tensorflow - 当只需要第一个元素时，为什么要创建一个新轴？

首先，很抱歉标题含糊

由于我有兴趣了解有关 TensorFlow 和图像分割的更多信息，因此我正在学习他们的教程 ( https://www.tensorflow.org/tutorials/images/segmentation )。然而，我注意到一些我无法完全掌握的东西，在谷歌搜索之后也没有掌握。

在本节中:

def create_mask(pred_mask):
    pred_mask = tf.argmax(pred_mask, axis=-1)
    pred_mask = pred_mask[..., tf.newaxis]
    return pred_mask[0]

首先为 pred_mask 向量创建一个新轴，然后仅选择第一个元素的原因是什么？为什么和我想象的不一样，如下图:

def create_mask(pred_mask):
    pred_mask = tf.argmax(pred_mask, axis=-1)
    return pred_mask

最佳答案

这只是为了让图像保持 3D 张量。例如，如果您的预测形状为 (1, 256, 256, 10)(一批 256x256 图像 10 个类)，则在 tf.argmax() 之后，您可以将收到形状为 (1, 256, 256) 的张量(一批没有 channel 的 256x256 图像)。但通常情况下，如果图像采用 HWC 格式(Height, Width, Channel) 而不仅仅是(Height, Width)，则处理图像会更容易。例如，如果您使用 matplotlib 或 OpenCV，通常需要 HWC 图像。

关于tensorflow - 当只需要第一个元素时，为什么要创建一个新轴？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/58997048/

上一篇：typescript - 我如何 "un-index"类型？

下一篇：r - 如果输入存储在变量中，geom_emoji() 函数不接受输入

python - 如何将 int32 张量转换为 float32

python-3.x - 任何光照(低或高)下物体的图像分割

python - numpy 数组的累积 argmax

tensorflow - 强化学习，如何从 Action 维度空间大于一的高斯分布中采样 Action ？

linux - 在 fedora 27 中安装 gcc 和 g++ 版本 < 6

scala - Scala/Spark数据帧: find the column name corresponding to the max

java - Java 8 流中的 arg 最大值？

image - Matlab中的肺分割

python - 在 opencv python 中获取蒙版的轮廓