This article shows some insights on how to configure Intel Cluster Checker to run over Red Hat Enterprise Linux Server 5.6.
When running Intel Cluster Checker in a system built using Red Hat Enterprise Linux Server 5.6 with hardware having SATA controllers, the clean_ipc test module needs to be configured properly as the operating system will leave shared memory segments running by default.
$ /sbin/lspci | grep SATA00:1f.2 SATA controller: Intel Corporation 82801JI (ICH10 Family) SATA AHCI Controller
$ lsb_release --all | grep DescriptionDescription: Red Hat Enterprise Linux Server release 5.6 (Tikanga)
The clean_ipc test module checks by default that no Inter Process Communication (IPC) facilities are open, meaning that the subsystem is clean in all compute nodes in the cluster. The test module executes the ipcs command to get a list of Shared Memory Segments, Semaphore Arrays, and Message Queues. If there are any entries, it will flag them and fail, unless explicitly configured to allow an exact quantity of active entries.
The initial output of the Intel Cluster Checker tool provided the following diagnostics information:
System V Interprocess Communication, (clean_ipc)....................................................................................................FAILED[ERROR]subtest 'Shared Memory Segments' failed- failing All hosts returned: 'found 3 entries, target was 0'
After checking the test module manual and the associated debug file, it can be seen that the command used to display
IPC status information is the following:
[root@compute-0-0 ~]# ipcs -a------ Shared Memory Segments --------key shmid owner perms bytes nattch status------ Semaphore Arrays --------key semid owner perms nsems0x000000a7 0 root 600 1------ Message Queues --------key msqid owner perms used-bytes messages
Once the manual page of the ipcs command is reviewed, an approach to find the offending process ID can be the following. Then just by checking the ps manual page it is possible to get the actual process name from that original process ID.
$ ipcs -s -i 0Semaphore Array semid=0uid=0 gid=0 cuid=0 cgid=0mode=0600, access_perms=0600nsems = 1otime = Thu May 5 15:29:29 2011ctime = Thu May 5 15:29:28 2011semnum value ncount zcount pid0 1 0 0 13499$ ps -ef | grep 13499root 5094 4938 0 18:36 pts/1 00:00:00 grep 13499root 13499 1 0 15:29 ? 00:00:00 iscsid
The process name is already pointing out that the SCSI subsystem is used that IPC item, but just in case the installed packages database can be queried to find out which is the RPM package owning that process daemon. It is also good to know if the package is in use by other packages as a dependency.
$ rpm -qf /sbin/iscsidiscsi-initiator-utils-188.8.131.522-6.el5$ rpm -e --test iscsi-initiator-utils-184.108.40.2062-6.el5error: Failed dependencies:iscsi-initiator-utils is needed by (installed) mkinitrd-220.127.116.11-68.el5.x86_64iscsi-initiator-utils is needed by (installed) mkinitrd-18.104.22.168-68.el5.i386
It is safe then to assume that the package is required by the operating system to be there, therefore the Intel Cluster Checker test module should be configured to allow those IPC items. More details on the configuration of the clean_ipc test module can be found here.