Project Home
Project Home
Documents
Documents
Wiki
Wiki
Discussion Forums
Discussions
Project Information
Project Info
Forum Topic - Debugging a strange deadlock...: (2 Items)
   
Debugging a strange deadlock...  
On 6.3.0SP3 I'm debugging a strange deadlock with process communication on a single node.

I have a thread reply-blocked in MsgSend (thread #19)

# pidin -p 1364182
     pid tid name               prio STATE       Blocked
 1364182   1 het/bin/player.bin  10o MUTEX       1364182-07 #1
 1364182   2 het/bin/player.bin  10o CONDVAR     822849c
 1364182   3 het/bin/player.bin  29o CONDVAR     8216898
 1364182   4 het/bin/player.bin  30o RECEIVE     2
 1364182   5 het/bin/player.bin  10o RECEIVE     2
 1364182   6 het/bin/player.bin  10o RECEIVE     5
 1364182   7 het/bin/player.bin  10o CONDVAR     824609c
 1364182   8 het/bin/player.bin  62o CONDVAR     82445cc
 1364182   9 het/bin/player.bin  10o RECEIVE     21
 1364182  10 het/bin/player.bin  10o CONDVAR     824702c
 1364182  11 het/bin/player.bin  10o CONDVAR     8246de4
 1364182  12 het/bin/player.bin  14o CONDVAR     8246b84
 1364182  13 het/bin/player.bin  14o RECEIVE     25
 1364182  14 het/bin/player.bin  10o CONDVAR     82464b4
 1364182  15 het/bin/player.bin  10o CONDVAR     84e8844
 1364182  16 het/bin/player.bin  10o CONDVAR     84e7754
 1364182  17 het/bin/player.bin  10o CONDVAR     852993c
 1364182  18 het/bin/player.bin  10o CONDVAR     85412dc
 1364182  19 het/bin/player.bin  10o REPLY       1364127
 1364182  20 het/bin/player.bin  10o CONDVAR     854f37c
 1364182  21 het/bin/player.bin  10o CONDVAR     8545c14
 1364182  22 het/bin/player.bin  10o CONDVAR     854e1c4
 1364182  23 het/bin/player.bin  10o CONDVAR     8552ecc
 1364182  24 het/bin/player.bin  10o CONDVAR     8590d6c
 1364182  25 het/bin/player.bin  10o CONDVAR     857bbac
 1364182  26 het/bin/player.bin  10o CONDVAR     857b324
 1364182  27 het/bin/player.bin  30o CONDVAR     82fcf90
 1364182  28 het/bin/player.bin  30o CONDVAR     85a5d30
 1364182  32 het/bin/player.bin  30o CONDVAR     85a1718
 1364182  34 het/bin/player.bin  30o RECEIVE     2
 1364182  35 het/bin/player.bin  30o RECEIVE     2
 1364182  36 het/bin/player.bin  30o RECEIVE     2
 1364182  37 het/bin/player.bin  30o RECEIVE     2
 1364182  38 het/bin/player.bin  30o RECEIVE     2
 1364182  39 het/bin/player.bin  30o RECEIVE     2
 1364182  40 het/bin/player.bin  30o RECEIVE     2
 1364182  41 het/bin/player.bin  10o RECEIVE     2
 1364182  42 het/bin/player.bin  10o MUTEX       1364182-19 #1

The thread is blocked on process 1364127 which is a RM, but it's idle.

# pidin -p 1364127
     pid tid name               prio STATE       Blocked
 1364127   1 rophet/bin/drm.bin  10o MUTEX       1364127-04 #1
 1364127   2 rophet/bin/drm.bin  10o CONDVAR     8170bec
 1364127   3 rophet/bin/drm.bin  10o RECEIVE     2
 1364127   4 rophet/bin/drm.bin  10o RECEIVE     30
 1364127   5 rophet/bin/drm.bin  10o RECEIVE     18
 1364127   6 rophet/bin/drm.bin  10o RECEIVE     18
 1364127   7 rophet/bin/drm.bin  10o RECEIVE     22
 1364127   8 rophet/bin/drm.bin  21o RECEIVE     22
 1364127   9 rophet/bin/drm.bin  10o RECEIVE     26
 1364127  10 rophet/bin/drm.bin  10o RECEIVE     26
 1364127  11 rophet/bin/drm.bin  10o RECEIVE     30
 1364127  12 rophet/bin/drm.bin  10o RECEIVE     38
 1364127  13 rophet/bin/drm.bin  10o RECEIVE     38
 1364127  14 rophet/bin/drm.bin  10o RECEIVE     18
 1364127  15 rophet/bin/drm.bin  10o RECEIVE     26
 1364127  16 rophet/bin/drm.bin  10o RECEIVE     26
 1364127  17 rophet/bin/drm.bin  10o RECEIVE     18
 1364127  18 rophet/bin/drm.bin  10o RECEIVE     26
 1364127  19 rophet/bin/drm.bin  21o RECEIVE     26
 1364127  20 rophet/bin/drm.bin  10o RECEIVE     26
 1364127  21 rophet/bin/drm.bin  10o RECEIVE     26
 1364127  22 rophet/bin/drm.bin  10o RECEIVE     26
 1364127  23 rophet/bin/drm.bin  10o RECEIVE     26
 1364127  24 rophet/bin/drm.bin  10o RECEIVE     26
 1364127  25 rophet/bin/drm.bin  10o RECEIVE     18
 1364127  26 rophet/bin/drm.bin  10o RECEIVE     18
 1364127  27 rophet/bin/drm.bin  10o RECEIVE     18
 1364127  28 rophet/bin/drm.bin ...
View Full Message
Re: Debugging a strange deadlock...  
Ah, sorry, the close() has not been called yet. It just waits in thread #42 for the signal to unblock the read operation
 before calling close(). But it needs to be checked why signal did not unblock the thread as well as why the thread has 
become blocked at all.

You can delete these posts from forum until I verify everything to find more information. Sorry.