我写了一个神经网络模型来分析图像并在最后提取 8 个 float 。该模型在我的计算机上运行良好(但速度缓慢),因此我在 TPU 云上尝试它,然后砰!我有一个错误:
I1008 12:58:47.077905 140221679261440 tf_logging.py:115] Error recorded from training_loop: File system scheme '[local]' not implemented (file: '/home/gcloud_iba/Data/CGTR/model/GA_subset/model.ckpt-0_temp_e840841d93124a67b54074b1c0fd7ae4') [[{{node save/SaveV2}} = SaveV2[dtypes=[DT_FLOAT, DT_FLOAT, DT_FLOAT, DT_FLOAT, DT_FLOAT, ..., DT_FLOAT, DT_FLOAT, DT_FLOAT, DT_FLOAT, DT_INT64], _device="/job:worker/replica:0/task:0/device:CPU:0"](save/ShardedFilename, save/SaveV2/tensor_names, save/SaveV2/shape_and_slices, batch_normalization/beta/Read/ReadVariableOp, batch_normalization/beta/Momentum/Read_1/ReadVariableOp, batch_normalization/gamma/Read/ReadVariableOp, batch_normalization/gamma/Momentum/Read_1/ReadVariableOp, batch_normalization/moving_mean/Read/ReadVariableOp, batch_normalization/moving_variance/Read/ReadVariableOp, batch_normalization_1/beta/Read/ReadVariableOp, batch_normalization_1/beta/Momentum/Read_1/ReadVariableOp, batch_normalization_1/gamma/Read/ReadVariableOp, batch_normalization_1/gamma/Momentum/Read_1/ReadVariableOp, batch_normalization_1/moving_mean/Read/ReadVariableOp, batch_normalization_1/moving_variance/Read/ReadVariableOp, conv2d/kernel/Read/ReadVariableOp, conv2d/kernel/Momentum/Read_1/ReadVariableOp, conv2d_1/kernel/Read/ReadVariableOp, conv2d_1/kernel/Momentum/Read_1/ReadVariableOp, conv2d_2/kernel/Read/ReadVariableOp, conv2d_2/kernel/Momentum/Read_1/ReadVariableOp, conv2d_3/kernel/Read/ReadVariableOp, conv2d_3/kernel/Momentum/Read_1/ReadVariableOp, conv2d_4/kernel/Read/ReadVariableOp, conv2d_4/kernel/Momentum/Read_1/ReadVariableOp, conv2d_5/kernel/Read/ReadVariableOp, conv2d_5/kernel/Momentum/Read_1/ReadVariableOp, conv2d_6/kernel/Read/ReadVariableOp, conv2d_6/kernel/Momentum/Read_1/ReadVariableOp, conv2d_7/kernel/Read/ReadVariableOp, conv2d_7/kernel/Momentum/Read_1/ReadVariableOp, conv2d_8/kernel/Read/ReadVariableOp, conv2d_8/kernel/Momentum/Read_1/ReadVariableOp, conv2d_9/kernel/Read/ReadVariableOp, conv2d_9/kernel/Momentum/Read_1/ReadVariableOp, dense/bias/Read/ReadVariableOp, dense/bias/Momentum/Read_1/ReadVariableOp, dense/kernel/Read/ReadVariableOp, dense/kernel/Momentum/Read_1/ReadVariableOp, dense_1/bias/Read/ReadVariableOp, dense_1/bias/Momentum/Read_1/ReadVariableOp, dense_1/kernel/Read/ReadVariableOp, dense_1/kernel/Momentum/Read_1/ReadVariableOp, dense_2/bias/Read/ReadVariableOp, dense_2/bias/Momentum/Read_1/ReadVariableOp, dense_2/kernel/Read/ReadVariableOp, dense_2/kernel/Momentum/Read_1/ReadVariableOp, dense_3/bias/Read/ReadVariableOp, dense_3/bias/Momentum/Read_1/ReadVariableOp, dense_3/kernel/Read/ReadVariableOp, dense_3/kernel/Momentum/Read_1/ReadVariableOp, global_step/Read/ReadVariableOp)]]
我检查了 TPU 是否可以访问硬盘并且它可以工作(我有另一段代码可以使用另一个模型读取相同的数据集)。 我还没有缓存我的数据,但我做了一些预取。除此之外,我看不出有什么不起作用?
感谢您提供的任何帮助!
Pi-r
最佳答案
本地文件系统在 Cloud TPU 上不可用。模型目录(检查点等)和输入数据应存储在 Google Cloud Storage 中(并以“gs://”为前缀)。
更多详情请点击此处
关于tensorflow - TPU本地文件系统不存在?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/52703047/