LINUX.ORG.RU

Pacemaker не запускает ресурсы jboss и pgsql

 , , , ,


0

1

Привет парни!

Тестирую pacemaker на двух серверах.

На двух нодах стоит CentOS 7 x64

jdk-7u80-linux-x64 JBoss 7.1.1 Final Pgsql (PostgreSQL) 9.2.24

pcs --version 0.9.165

Настроил 3 ресурса. IPaddr2 работает без проблем. А вот с jboss и pgsql проблемы. При чем если их запустить командами

/bin/sh /usr/lib/ocf/resource.d/heartbeat/pgsql start /bin/sh /usr/lib/ocf/resource.d/heartbeat/jboss start

они работают, но pacemaker их не видит.

[root@centos-test1 heartbeat]# pcs status --all
Cluster name: test
Stack: corosync
Current DC: centos-test1 (version 1.1.19-8.el7_6.2-c3c624ea3d) - partition with quorum
Last updated: Wed Dec 26 06:58:21 2018
Last change: Wed Dec 26 06:07:27 2018 by root via cibadmin on centos-test1

2 nodes configured
3 resources configured

Online: [ centos-test1 centos-test2 ]

Full list of resources:

 virtual_ip     (ocf::heartbeat:IPaddr2):       Started centos-test1
 jboss  (ocf::heartbeat:jboss): Stopped
 pgsql  (ocf::heartbeat:pgsql): Stopped

Failed Actions:
* jboss_start_0 on centos-test1 'unknown error' (1): call=18, status=Timed Out, exitreason='',
    last-rc-change='Wed Dec 26 06:08:16 2018', queued=0ms, exec=20002ms
* pgsql_start_0 on centos-test1 'not configured' (6): call=15, status=complete, exitreason='',
    last-rc-change='Wed Dec 26 06:07:56 2018', queued=0ms, exec=115ms
* jboss_start_0 on centos-test2 'unknown error' (1): call=14, status=Timed Out, exitreason='',
    last-rc-change='Wed Dec 26 13:07:04 2018', queued=0ms, exec=20002ms


Daemon Status:
  corosync: active/enabled
  pacemaker: active/enabled
  pcsd: active/enabled
  

В ocf::heartbeat: были ошибки с переменными окружения, в файлах пришлось явно указывать:

# Initialization:

: /usr/lib/ocf/lib/heartbeat
. /usr/lib/ocf/lib/heartbeat/ocf-shellfuncs

#: ${OCF_FUNCTIONS_DIR=${OCF_ROOT}/lib/heartbeat}
#. ${OCF_FUNCTIONS_DIR}/ocf-shellfuncs

В логах corasync пишет:

Dec 26 14:19:21 [27771] centos-test1    pengine:     info: common_print:        virtual_ip      (ocf::heartbeat:IPaddr2):    Started centos-test1
Dec 26 14:19:21 [27771] centos-test1    pengine:     info: common_print:        jboss   (ocf::heartbeat:jboss): FAILED centos-test1
Dec 26 14:19:21 [27771] centos-test1    pengine:     info: common_print:        pgsql   (ocf::heartbeat:pgsql): Stopped
Dec 26 14:19:21 [27771] centos-test1    pengine:     info: pe_get_failcount:    jboss has failed INFINITY times on centos-test1
Dec 26 14:19:21 [27771] centos-test1    pengine:  warning: check_migration_threshold:   Forcing jboss away from centos-test1 after 1000000 failures (max=1000000)
Dec 26 14:19:21 [27771] centos-test1    pengine:     info: pe_get_failcount:    pgsql has failed INFINITY times on centos-test1
Dec 26 14:19:21 [27771] centos-test1    pengine:  warning: check_migration_threshold:   Forcing pgsql away from centos-test1 after 1000000 failures (max=1000000)
Dec 26 14:19:21 [27771] centos-test1    pengine:     info: pe_get_failcount:    jboss has failed INFINITY times on centos-test2
Dec 26 14:19:21 [27771] centos-test1    pengine:  warning: check_migration_threshold:   Forcing jboss away from centos-test2 after 1000000 failures (max=1000000)
Dec 26 14:19:21 [27771] centos-test1    pengine:     info: native_color:        Resource jboss cannot run anywhere
Dec 26 14:19:21 [27771] centos-test1    pengine:     info: native_color:        Resource pgsql cannot run anywhere
Dec 26 14:19:21 [27771] centos-test1    pengine:     info: LogActions:  Leave   virtual_ip      (Started centos-test1)
Dec 26 14:19:21 [27771] centos-test1    pengine:   notice: LogAction:    * Stop       jboss          (                 centos-test1 )   due to node availability
Dec 26 14:19:21 [27771] centos-test1    pengine:     info: LogActions:  Leave   pgsql   (Stopped)
Dec 26 14:19:21 [27771] centos-test1    pengine:   notice: process_pe_message:  Calculated transition 5, saving inputs in /var/lib/pacemaker/pengine/pe-input-266.bz2
Dec 26 14:19:21 [27772] centos-test1       crmd:     info: do_state_transition: State transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE | input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=handle_response
Dec 26 14:19:21 [27772] centos-test1       crmd:     info: do_te_invoke:        Processing graph 5 (ref=pe_calc-dc-1545823161-30) derived from /var/lib/pacemaker/pengine/pe-input-266.bz2
Dec 26 14:19:21 [27772] centos-test1       crmd:   notice: te_rsc_command:      Initiating stop operation jboss_stop_0 locally on centos-test1 | action 2
Dec 26 14:19:21 [27772] centos-test1       crmd:     info: do_lrm_rsc_op:       Performing key=2:5:0:19594a89-d772-4748-8c9a-5a7888a82914 op=jboss_stop_0
Dec 26 14:19:21 [27767] centos-test1        cib:     info: cib_process_request: Forwarding cib_modify operation for section status to all (origin=local/crmd/54)
Dec 26 14:19:21 [27769] centos-test1       lrmd:     info: log_execute: executing - rsc:jboss action:stop call_id:18
Dec 26 14:19:21 [27767] centos-test1        cib:     info: cib_perform_op:      Diff: --- 0.15.35 2
Dec 26 14:19:21 [27767] centos-test1        cib:     info: cib_perform_op:      Diff: +++ 0.15.36 (null)
Dec 26 14:19:21 [27767] centos-test1        cib:     info: cib_perform_op:      +  /cib:  @num_updates=36
Dec 26 14:19:21 [27767] centos-test1        cib:     info: cib_perform_op:      +  /cib/status/node_state[@id='1']/lrm[@id='1']/lrm_resources/lrm_resource[@id='jboss']/lrm_rsc_op[@id='jboss_last_0']:  @operation_key=jboss_stop_0, @operation=stop, @transition-key=2:5:0:19594a89-d772-4748-8c9a-5a7888a82914, @transition-magic=-1:193;2:5:0:19594a89-d772-4748-8c9a-5a7888a82914, @call-id=-1, @rc-code=193, @op-status=-1, @last-run=1545823161, @last-rc-change=1545823161, @exec-time=0
Dec 26 14:19:21 [27767] centos-test1        cib:     info: cib_process_request: Completed cib_modify operation for section status: OK (rc=0, origin=centos-test1/crmd/54, version=0.15.36)
Dec 26 14:19:21  jboss(jboss)[28346]:    INFO: JBoss[jboss] is already stopped.
Dec 26 14:19:21 [27769] centos-test1       lrmd:     info: log_finished:        finished - rsc:jboss action:stop call_id:18 pid:28346 exit-code:0 exec-time:21ms queue-time:0ms
Dec 26 14:19:21 [27772] centos-test1       crmd:   notice: process_lrm_event:   Result of stop operation for jboss on centos-test1: 0 (ok) | call=18 key=jboss_stop_0 confirmed=true cib-update=55
Dec 26 14:19:21 [27767] centos-test1        cib:     info: cib_process_request: Forwarding cib_modify operation for section status to all (origin=local/crmd/55)
Dec 26 14:19:21 [27767] centos-test1        cib:     info: cib_perform_op:      Diff: --- 0.15.36 2
Dec 26 14:19:21 [27767] centos-test1        cib:     info: cib_perform_op:      Diff: +++ 0.15.37 (null)
Dec 26 14:19:21 [27767] centos-test1        cib:     info: cib_perform_op:      +  /cib:  @num_updates=37
Dec 26 14:19:21 [27767] centos-test1        cib:     info: cib_perform_op:      +  /cib/status/node_state[@id='1']/lrm[@id='1']/lrm_resources/lrm_resource[@id='jboss']/lrm_rsc_op[@id='jboss_last_0']:  @transition-magic=0:0;2:5:0:19594a89-d772-4748-8c9a-5a7888a82914, @call-id=18, @rc-code=0, @op-status=0, @exec-time=21
Dec 26 14:19:21 [27767] centos-test1        cib:     info: cib_process_request: Completed cib_modify operation for section status: OK (rc=0, origin=centos-test1/crmd/55, version=0.15.37)
Dec 26 14:19:21 [27772] centos-test1       crmd:     info: match_graph_event:   Action jboss_stop_0 (2) confirmed on centos-test1 (rc=0)
Dec 26 14:19:21 [27772] centos-test1       crmd:   notice: run_graph:   Transition 5 (Complete=2, Pending=0, Fired=0, Skipped=0, Incomplete=0, Source=/var/lib/pacemaker/pengine/pe-input-266.bz2): Complete
Dec 26 14:19:21 [27772] centos-test1       crmd:     info: do_log:      Input I_TE_SUCCESS received in state S_TRANSITION_ENGINE from notify_crmd
Dec 26 14:19:21 [27772] centos-test1       crmd:   notice: do_state_transition: State transition S_TRANSITION_ENGINE -> S_IDLE | input=I_TE_SUCCESS cause=C_FSA_INTERNAL origin=notify_crmd
Dec 26 14:19:26 [27767] centos-test1        cib:     info: cib_process_ping:    Reporting our current digest to centos-test1: 1441d742a8ffbf1c1f45b9d38dd1a776 for 0.15.37 (0x55a05cd6c580 0)

Почему не работает не понимаю, подскажите где что посмотреть.