python - 在 for 循环中使用 tf.concat 添加值是否很慢？

我正在使用tensorflow 2.0，并尝试通过稍微优化我的代码来加快我的训练速度。

我批量运行我的模型，并希望保护每批结果的安全，以便在一个张量中获得一个时期结束时的所有结果。

这就是我的代码的样子:

...
for epoch in range(start_epoch, end_epoch):

    # this vector shall hold all results for one epoch
    predictions_epoch = tf.zeros(0,)
   
    for batch in tf_dataset: 
        # get prediction with predictions_batch.shape[0] euqals batch_size
        predictions_batch = model(batch)   
        
        # Add the batch result to the previous results
        predictions_epoch = tf.concat(predictions_batch, predictions_epoch)
        
        # DO SOME OTHER STUFF LIKE BACKPROB
        ...

    # predictions_epoch.shape[0] now equals number of all samples in dataset
    with writer.as_default():
        tf.summary.histogram(name='predictions', data=predictions_epoch, step=epoch)

让我们假设，一个预测只是一个标量值。所以 predictions_batch 是一个 shape=[batchsize,] 的张量。

这种连接方式效果很好。

现在我的问题是: 这个 tf.concat() 操作会减慢我的整个训练速度吗？我还使用了 tf.stack() 来实现此目的，但速度似乎没有区别。

我想知道，因为一旦我使用 Matlab，在 for 循环中向 Vector 添加新值(从而改变其大小)是非常慢的。用零初始化向量，然后在循环中分配值在速度方面更加有效。

对于 tensorflow 来说也是如此吗？或者是否有另一种更“正确”的方法来做一些事情，比如在 for 循环中将张量相加，这样更干净或更快速？我在网上没有找到任何替代解决方案。

感谢您的帮助。

最佳答案

是的，这不是最值得推荐的方法。最好将每个张量添加到列表中并在最后将它们连接一次:

for epoch in range(start_epoch, end_epoch):
    predictions_batches = []
    for batch in tf_dataset:
        predictions_batch = model(batch)
        predictions_batches.append(predictions_batch)
        # ...
    predictions_epoch = tf.concat(predictions_batches)

您还可以使用tf.TensorArray ，如果你想用 tf.function 修饰代码，这可能会更好.

for epoch in range(start_epoch, end_epoch):
    # Pass arguments as required
    # If the number of batches is know or an upper bound
    # can be estimated use that and dynamic_size=False
    predictions_batches = tf.TensorArray(
        tf.float32, INTIAL_SIZE, dynamic_size=True, element_shape=[BATCH_SIZE])
    i = tf.constant(0)
    for batch in tf_dataset:
        predictions_batch = model(batch)
        predictions_batches = predictions_batches.write(i, predictions_batch)
        i += 1
        # ...
    predictions_epoch = predictions_batches.concat()

关于python - 在 for 循环中使用 tf.concat 添加值是否很慢？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/62538267/

python - 在 for 循环中使用 tf.concat 添加值是否很慢？

上一篇：node.js - 服务器发送事件不起作用::错误::net::ERR_INCOMPLETE_CHUNKED_ENCODING 200

下一篇：javascript - 如何更改 Material UI Table Pagination 中的纸张样式