MC/ServiceGuard A.12.40.00 Enterprise Edition tested on CentOS/RHEL7
all nodes incl. quorum
vi /etc/hosts x.x.x.x sg1.localdomain sg1 x.x.x.x sg2.localdomain sg2 x.x.x.x qs.localdomain qs x.x.x.253 dns x.x.x.254 gw
unless those are XEN guests
#ntpdate -u ntp.obspm.fr ntpdate -u ru.pool.ntp.org yum install ntp
SSH keys
all nodes but quorum
ssh-keygen -t ed25519 cat ~/.ssh/id_ed25519.pub vi ~/.ssh/authorized_keys ssh sg1 ssh sg2 ping -c1 qs ping -c1 sg1 ping -c1 sg2
rpm -q \ xinetd \ sg3_utils \ net-snmp \ lm_sensors \ tog-pegasus \ libnl \ authd \ perl \ perl-Sys-Syslog \ krb5-libs \ zlib \ lsscsi \ systemd \ | grep ^package yum install xinetd net-snmp lm_sensors tog-pegasus libnl perl-Sys-Syslog yum install java
authd
chkconfig --list systemctl list-unit-files | grep auth systemctl start auth.socket systemctl enable auth.socket
MAKE SURE YOU ARE RUNNING THE LATEST UPDATED KERNEL
rpm -qa | grep kernel uname -r
we will need this to compile deadman module
yum install kernel-headers kernel-devel yum groupinstall 'Development Tools'
ERROR: Retrieval of node UUID failed. Unable to retrive UUID on node sg1. Check if "dmidecode" returns valid UUID. ERROR: Retrieval of node SMBIOS failed. Unable to retrive SMBIOS version on node sg1. Check if "dmidecode" returns valid SMBIOS version.
cmcheckconf
is complaining about dmidecode not returning an UUID nor SMBIOS version. I do XEN/PV so there’s no SMBIOS to talk to, hence dmidecode does not provide any info. Let’s fake it.
mv /usr/sbin/dmidecode /usr/sbin/dmidecode.dist vi /usr/sbin/dmidecode #!/bin/ksh cat <<EOF SMBIOS 3.0.0 present. System Information UUID: 00000000-0000-0000-0000-000000000002 EOF chmod +x /usr/sbin/dmidecode
check
/usr/sbin/dmidecode
on all nodes but quorum
mkdir lala/ mount -o loop A.12.40.00_Enterprise_Edition_for_Red_Hat_Enterprise_Linux_7_6_SLES_15_12_11_Oracle_Linux_7_BB097-11006_EVAL.iso lala/ cd /root/lala/ less Readme_Before_Install.txt cd /root/lala/RedHat/RedHat7/Serviceguard/x86_64/ rpm -ivh serviceguard-license-A.12.40.00-0.rhel7.x86_64.rpm rpm -ivh serviceguard-A.12.40.00-0.rhel7.x86_64.rpm
the post install script SHOULD work fine, if you’ve got the corresponding /usr/src/kernels/
against the running kernel. In case you get package postinstall script failures, and fixed your kernel version afterwards, you can still fix it manually - build the deadman module and load it
ls -lF /lib/modules/`uname -r`/source ls -lF /lib/modules/`uname -r`/build ls -ldF /usr/src/kernels/`uname -r`/ cd /usr/local/cmcluster/drivers/ #make clean make -j16 modules make modules_install #modprobe -r deadman depmod -a modprobe deadman lsmod | grep dead cat /proc/misc | grep dead ls -lF /dev/deadman mkinitrd --force /boot/initramfs-`uname -r`.img `uname -r`
on all nodes but quorum
#cp -pi /etc/lvm/lvm.conf /etc/lvm/lvm.conf.dist #vi /etc/lvm/lvm.conf #{hosttags = 1} ls -lF /usr/local/cmcluster/bin/ cat /etc/cmcluster.conf vi ~/.bashrc PATH=$PATH:/usr/local/cmcluster/bin source /etc/cmcluster.conf source ~/.bashrc cp -pi /etc/man_db.conf /etc/man_db.conf.dist vi /etc/man_db.conf MANDATORY_MANPATH /usr/local/cmcluster/doc/man
and check
which cmquerycl man cmquerycl
handy symlink
cd ~/ ln -s /usr/local/cmcluster/conf
on the quorum server only
cd /root/lala/RedHat/RedHat7/QuorumServer/x86_64/ rpm -ivh serviceguard-qs-A.12.40.00-0.rhel7.x86_64.rpm ls -lF /usr/local/qs/conf/ vi /usr/local/qs/conf/qs_authfile sg1 sg2 rpm -ql serviceguard-qs-A.12.40.00-0.rhel7.x86_64 ls -lF /var/log/qs/ systemctl list-unit-files | grep qs systemctl status qs.service systemctl start qs.service systemctl enable qs.service pgrep -a qs netstat -lntup | grep qs
on all nodes but quorum
vi /usr/local/cmcluster/conf/cmclnodelist sg1 root sg2 root
on one node only
cd /usr/local/cmcluster/conf/ ls -lF cmcluster.conf.default cmquerycl -h cmquerycl -q qs -n sg1 -n sg2 -C cluster1.conf.full
we are getting some warnings are we did not define dedicated hearbeat network nor redundant NICs
- 2 or more heartbeat networks OR - 1 heartbeat network with local switch (HP-UX Only) OR - 1 heartbeat network using APA with 2 trunk members (HP-UX Only) OR - 1 heartbeat network using bonding (mode 1 or 4) with 2 slaves (Linux Only)
but it does not prevent us from proceeding
egrep -v '^[[:space:]]*(#|$)' cluster1.conf.full > cluster1.conf vi cluster1.conf
define a cluster name (cannot start with cluster
, don’t ask me why)
CLUSTER_NAME sgcluster1 HOSTNAME_ADDRESS_FAMILY IPV4 QS_HOST qs QS_POLLING_INTERVAL 300000000 NODE_NAME sg1 NETWORK_INTERFACE eth0 HEARTBEAT_IP x.x.x.x NODE_NAME sg2 NETWORK_INTERFACE eth0 HEARTBEAT_IP x.x.x.x MEMBER_TIMEOUT 14000000 AUTO_START_TIMEOUT 600000000 NETWORK_POLLING_INTERVAL 2000000 SUBNET 10.1.1.0 IP_MONITOR OFF MAX_CONFIGURED_PACKAGES 300 ROOT_DISK_MONITOR OFF USER_NAME ANY_USER USER_HOST ANY_SERVICEGUARD_NODE USER_ROLE MONITOR
check
cmcheckconf -C cluster1.conf
populate the ASCII conf among the farm e.g. here from sg2
to sg1
scp cluster1.conf sg1:/usr/local/cmcluster/conf/
and apply
cmapplyconf -k -C cluster1.conf
the cluster should now be status down but configured
cmviewcl -v cmquerycl -v
all nodes but quorum
start and enable the daemon at boot time (idealy in a row…)
systemctl start cmcluster.init.service systemctl enable cmcluster.init.service
join the cluster at startup
cp -pi /usr/local/cmcluster/conf/cmcluster.rc /usr/local/cmcluster/conf/cmcluster.rc.dist vi /usr/local/cmcluster/conf/cmcluster.rc AUTOSTART_CMCLD=1
reboot the cluster nodes
shutdown -r now
cmviewcl -v cmquerycl -v
you can now setup an application/service package with
cmmakepkg cmcheckconf -P cmapplyconf -P
HPE Serviceguard for Linux Trial Software https://www.hpe.com/emea_europe/en/resources/servers/serviceguard-linux-trial.html
Serviceguard Quorum Server https://h20392.www2.hpe.com/portal/swdepot/displayProductInfo.do?productNumber=B8467BA
HPE Serviceguard for Linux Repositories https://downloads.linux.hpe.com/SDR/project/sglx/
Subscribe your system to the sglx-base repository https://downloads.linux.hpe.com/SDR/project/sglx/sglx-base/
Serviceguard for Linux Continentalcluster Patches / Updates https://downloads.linux.hpe.com/SDR/project/sglx/sglx-cc/
Subscribe your system to the sglx-qs repository https://downloads.linux.hpe.com/SDR/project/sglx/sglx-qs/
HPE Serviceguard for Linux - Product documentation https://h20195.www2.hpe.com/v2/default.aspx?cc=us&lc=en&oid=376220
Use of a Quorum Server as the Cluster Lock https://docstore.mik.ua/manuals/hp-ux/en/B3936-90078/ch01s06.html https://docstore.mik.ua/manuals/hp-ux/en/B3936-90078/ch01s06.html
Serviceguard Quorum Server https://h20392.www2.hpe.com/portal/swdepot/displayProductInfo.do?productNumber=B8467BA
HPE Serviceguard for Linux Trial Software https://www.hpe.com/emea_europe/en/resources/servers/serviceguard-linux-trial.html
Repository Listing https://downloads.linux.hpe.com/SDR/repo/sglx-qs/rhel/7/x86_64/current/
Serviceguard for Linux - The cmviewcl Command Does Not Work, if we Can Not Use the rpm Command! https://support.hpe.com/hpsc/doc/public/display?docId=emr_na-c02898778
HP service-guard installation for linux on a single cluster node https://linuxindiablog.wordpress.com/2014/06/09/hp-service-guard-installation-for-linux-on-a-single-cluster-node/
HP service-guard installation for linux on a single cluster node http://ramblingsofanindiantechie.blogspot.com/2014/06/hp-service-guard-installation-for-linux.html
리눅스 service guard https://m.blog.naver.com/woomun/220266431105
Serviceguard for Linux - The cmviewcl Command Does Not Work, if we Can Not Use the rpm Command! https://support.hpe.com/hpsc/doc/public/display?docId=emr_na-c02898778
HP Serviceguard for Linux - cmclconfd: Unable to Open /dev/deadman https://support.hpe.com/hpsc/doc/public/display?docId=emr_na-c02753088
LINUX - What Is the Purpose of LINUX Serviceguard Deadman Driver? https://support.hpe.com/hpsc/doc/public/display?docId=emr_na-c03630952
CONFIG_RETPOLINE: Avoid speculative indirect branches in kernel https://cateee.net/lkddb/web-lkddb/RETPOLINE.html
How to check if Linux kernel is “Retpoline” enabled or not? https://unix.stackexchange.com/questions/435778/how-to-check-if-linux-kernel-is-retpoline-enabled-or-not
Converting system between RHEL and CentOS https://negativo17.org/converting-system-between-rhel-and-centos/
Converting CentOS 6 to RHEL 6 https://www.endpoint.com/blog/2011/12/22/converting-centos-6-to-rhel-6
RedHat Version file https://www.centos.org/forums/viewtopic.php?t=41699
How to get CentOS 8/7/6, Fedora 29-25, RHEL 8/7/6 release version https://computingforgeeks.com/how-to-get-centos-fedora-rhel-release-version/
Accidentally deleted /etc/redhat-release file https://unix.stackexchange.com/questions/209820/accidentally-deleted-etc-redhat-release-file