google-cloud-ml - Cloud-ML 作业 没有这样的文件或目录

标签 google-cloud-ml

我已向 cloud ml 提交了训练作业。但是,它找不到 csv 文件。它就在桶里。这是代码。

# Use scikit-learn to grid search the batch size and epochs
import numpy
from sklearn.model_selection import GridSearchCV
from keras.models import Sequential
from keras.layers import Dense
from keras.wrappers.scikit_learn import KerasClassifier

def create_model():
    model = Sequential()
    model.add(Dense(12, input_dim=11, activation='relu'))
    model.add(Dense(1, activation='sigmoid'))
    model.compile(loss='binary_crossentropy', optimizer='nadam', metrics=['accuracy'])
    return model
seed = 7
numpy.random.seed(seed)

FIL = "gs://bubbly-hexagon-112008-ml/dataset/mixed.csv"
dataset = numpy.loadtxt(FIL, delimiter=",")
X = dataset[:,0:11]
Y = dataset[:,11]

model = KerasClassifier(build_fn=create_model, verbose=1)
batch_size = [10, 20, 40, 60, 80, 100]
epochs = [10, 50, 100, 500, 1000]
param_grid = dict(batch_size=batch_size, nb_epoch=epochs)
grid = GridSearchCV(estimator=model, param_grid=param_grid, n_jobs=-1)
grid_result = grid.fit(X, Y)
print("Best: %f using %s" % (grid_result.best_score_, grid_result.best_params_))
means = grid_result.cv_results_['mean_test_score']
stds = grid_result.cv_results_['std_test_score']
params = grid_result.cv_results_['params']
for mean, stdev, param in zip(means, stds, params):
    print("%f (%f) with: %r" % (mean, stdev, param))

提交作业后,我收到此错误。

Traceback (most recent call last): File "/usr/lib/python2.7/runpy.py", line 162, in _run_module_as_main "__main__", fname, loader, pkg_name) File "/usr/lib/python2.7/runpy.py", line 72, in _run_code exec code in
run_globals File "/root/.local/lib/python2.7/
site-packages/trainer/task.py", line 18, in <module> dataset = numpy.loadtxt(FIL, delimiter=",") File "/root/.local/lib/python2.7/
site-packages/numpy/lib/npyio.py", line 803, in loadtxt fh = iter(open(fname, 'U')) IOError: [Errno 2] No such file or directory: 
'gs://bubbly-hexagon-112008-ml/dataset/mixed.csv'

- 文件位于指定存储桶中,其权限包括 cloud ml 作为读取器。

-我还使用了gcloud beta ml init-project来初始化项目。

-我创建了一个新存储桶并将文件放入其中,但遇到了相同的错误。

-我的存储桶与我提交的作业位于同一区域。

谢谢

最佳答案

来自tensorflow的

file_io效果很好:

from tensorflow.python.lib.io import file_io
import numpy as np
import json

读取 numpy 数组:

with file_io.FileIO(path_npx, 'rb') as f:
    np_arr = np.load( BytesIO(f.read()) )
    print(np_arr)

读取 json 文件:

with file_io.FileIO(path_json, 'r') as f:
    print(json.loads(f.read()))

关于google-cloud-ml - Cloud-ML 作业 没有这样的文件或目录,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/41518774/

相关文章:

machine-learning - 无法从 Google 机器学习 Cloud REST API 获取结果

Tensorflow服务ML引擎在线预测json文件格式

python - 谷歌云机器学习 : Outer dimension for outputs must be unknown

python - 使用自定义容器从 AI Platform 训练作业访问 Google Secret Manager

python - 如何在 google ml api python 客户端中设置请求超时?

tensorflow - 在 ML Engine 中运行分布式训练的正确方法

google-cloud-platform - GCP(AI 平台笔记本)上的“服务器连接错误”

tensorflow - Field : version. deployment_uri Error : The total size of files in gs://my-bucket/ml/is x bytes, 超出了允许的最大值 1073741824 字节

tensorflow - 在本地分发 Tensorflow 训练的最简单方法?

python - 谷歌云机器学习引擎 "Skipping evaluation due to same checkpoint"