所以我想在分割(使用分层)后对训练和测试数据集执行平均目标编码,并且为了做到这一点,必须将它们重新合并在一起。
我该怎么做? ,任何建议将不胜感激? ,谢谢你。
X_train, X_test, y_train, y_test = train_test_split(R,
target,
test_size=0.25,
random_state=7,
stratify=target)
print("Number transactions X_train dataset: ", X_train.shape)
print("Number transactions y_train dataset: ", y_train.shape)
print("Number transactions X_test dataset: ", X_test.shape)
print("Number transactions y_test dataset: ", y_test.shape)
这是输出:
Number transactions X_train dataset: (37779, 89)
Number transactions y_train dataset: (37779,)
Number transactions X_test dataset: (12593, 89)
Number transactions y_test dataset: (12593,)
最佳答案
您可以按行连接数组:
X_combined = np.r_[X_train, X_test]
y_combined = np.r_[y_train, y_test]
您可以在 other 中看到一些额外的深度所以问题。
关于python-3.x - 用sklearn分割后如何重新合并X_test和y_test以及x_train和y_train?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/56212429/