数据库服务器异常,重启服务器之后,taos启动不了了,是什么原因?急需恢复数据。。

【TDengine 使用环境】
生产环境

【TDengine 版本】TDengine-server-3.2.1.0

【操作系统以及版本】linux unbuntu

【部署方式】非容器部署

【集群节点数】

【集群副本数】

【描述业务影响】

【问题复现路径/shan】做过哪些操作出现的问题

【遇到的问题:问题现象及影响】

【资源配置】

【报错完整截图】(不要大段的粘贴报错代码,论坛直接看报错代码不直观)

01/10 13:12:04.409402 00045532 WAL ERROR vgId:1, WAL log entries incomplete in range [65, 217992], index:65, snaphotVer:64
01/10 13:12:04.409411 00045532 WAL ERROR vgId:1, cannot open wal since repair meta file failed
01/10 13:12:04.409449 00045532 MND ERROR failed to open wal since WAL log incomplete. wal:/mnt/cfsdist0/TDengine/data/mnode/wal
01/10 13:12:04.409451 00045532 MND ERROR mnode-wal exec failed since WAL log incomplete, start to cleanup
01/10 13:12:04.409455 00045532 MND ERROR failed to open mnode since WAL log incomplete
01/10 13:12:04.409460 00045532 DND ERROR failed to open mnode since WAL log incomplete
01/10 13:12:04.409462 00045532 DND ERROR node:mnode, failed to open since WAL log incomplete
01/10 13:12:04.409464 00045532 DND ERROR node:mnode, failed to open since WAL log incomplete
01/10 13:12:04.409475 00045532 DND ERROR failed to open nodes since WAL log incomplete
01/10 13:15:10.688313 00046753 WAL ERROR vgId:1, WAL log entries incomplete in range [65, 217992], index:65, snaphotVer:64
01/10 13:15:10.688328 00046753 WAL ERROR vgId:1, cannot open wal since repair meta file failed
01/10 13:15:10.688390 00046753 MND ERROR failed to open wal since WAL log incomplete. wal:/mnt/cfsdist0/TDengine/data/mnode/wal
01/10 13:15:10.688393 00046753 MND ERROR mnode-wal exec failed since WAL log incomplete, start to cleanup
01/10 13:15:10.688398 00046753 MND ERROR failed to open mnode since WAL log incomplete
01/10 13:15:10.688406 00046753 DND ERROR failed to open mnode since WAL log incomplete
01/10 13:15:10.688408 00046753 DND ERROR node:mnode, failed to open since WAL log incomplete
01/10 13:15:10.688412 00046753 DND ERROR node:mnode, failed to open since WAL log incomplete
01/10 13:15:10.688425 00046753 DND ERROR failed to open nodes since WAL log incomplete
01/10 13:15:11.918533 00046833 WAL ERROR vgId:1, WAL log entries incomplete in range [65, 217992], index:65, snaphotVer:64
01/10 13:15:11.918544 00046833 WAL ERROR vgId:1, cannot open wal since repair meta file failed
01/10 13:15:11.918587 00046833 MND ERROR failed to open wal since WAL log incomplete. wal:/mnt/cfsdist0/TDengine/data/mnode/wal
01/10 13:15:11.918590 00046833 MND ERROR mnode-wal exec failed since WAL log incomplete, start to cleanup
01/10 13:15:11.918596 00046833 MND ERROR failed to open mnode since WAL log incomplete
01/10 13:15:11.918604 00046833 DND ERROR failed to open mnode since WAL log incomplete
01/10 13:15:11.918607 00046833 DND ERROR node:mnode, failed to open since WAL log incomplete
01/10 13:15:11.918610 00046833 DND ERROR node:mnode, failed to open since WAL log incomplete
01/10 13:15:11.918622 00046833 DND ERROR failed to open nodes since WAL log incomplete
01/10 13:15:14.040006 00046877 WAL ERROR vgId:1, WAL log entries incomplete in range [65, 217992], index:65, snaphotVer:64
01/10 13:15:14.040027 00046877 WAL ERROR vgId:1, cannot open wal since repair meta file failed
01/10 13:15:14.040079 00046877 MND ERROR failed to open wal since WAL log incomplete. wal:/mnt/cfsdist0/TDengine/data/mnode/wal
01/10 13:15:14.040082 00046877 MND ERROR mnode-wal exec failed since WAL log incomplete, start to cleanup
01/10 13:15:14.040089 00046877 MND ERROR failed to open mnode since WAL log incomplete
01/10 13:15:14.040102 00046877 DND ERROR failed to open mnode since WAL log incomplete
01/10 13:15:14.040105 00046877 DND ERROR node:mnode, failed to open since WAL log incomplete
01/10 13:15:14.040108 00046877 DND ERROR node:mnode, failed to open since WAL log incomplete
01/10 13:15:14.040149 00046877 DND ERROR failed to open nodes since WAL log incomplete

看起来是 wal 文件损坏了。

可以尝试如下的操作:

1、停止 TDengine taosd 服务;

2、备份TDengine taosd服务的数据目录;

3、删除 数据目录下的 所有的 wal 目录。

4、重启 TDengine taosd 服务。

看看是否能够正常重启。

尝试过你这个方法,启动几秒之后,又失败了