CKPT_CORRUPT
tensorflow
data_error
ai_generated
partial
DataLossError: 无法打开检查点文件 /path/to/checkpoint: 数据损坏: 文件已损坏
DataLossError: Unable to open table file /path/to/checkpoint: Data loss: file is corrupted
ID: tensorflow/checkpoint-read-failed-corrupt
70%修复率
82%置信度
1证据数
2023-08-10首次发现
版本兼容性
| 版本 | 状态 | 引入 | 弃用 | 备注 |
|---|---|---|---|---|
| tensorflow 2.8.0 | active | — | — | — |
| tensorflow 2.9.0 | active | — | — | — |
| tensorflow 2.10.0 | active | — | — | — |
根因分析
检查点文件由于写入不完整、磁盘故障或 TensorFlow 版本不兼容而损坏。
English
The checkpoint file is corrupted due to incomplete write, disk failure, or version incompatibility between TensorFlow versions.
官方文档
https://www.tensorflow.org/guide/checkpoint解决方案
-
Restore from a backup checkpoint or re-train the model. If partial recovery is possible, use `tf.train.load_checkpoint` to inspect and salvage non-corrupted tensors.
无效尝试
常见但无效的做法:
-
80% 失败
If the source itself is corrupted, re-downloading won't help.
-
85% 失败
Corruption is usually at the file level, not format level.