redis集群故障无法自动提升slave

发布时间 2023-07-03 20:29:18作者: mvpbang

问题描述

生产redis集群(3master/3slave)部署在3台虚机上,每个虚机部署2个redis节点,挂了一台虚机导致redis集群异常,分析发现是挂了机器上是2master redis

redis日志

* MASTER <-> REPLICA sync started
# Error condition on socket for SYNC: Connection refused
* Connecting to MASTER x.12.73.126:4379

解决问题

m1、人工提供slave到master恢复集群

redis-cli  //login
cluster nodes
//登陆slave节点执行故障转移,slave->master
cluster failover takeover

m2、备份master rdb,重新初始化redis集群然后导入rdb文件