Python、keras、卷积自动编码器

我正在尝试在 keras 中创建我的第一个卷积自动编码器，但我在层输出形状方面遇到问题。这是我的代码:

input_img = Input(shape=X_train.shape[1:])

x = Conv2D(32, (3, 3), activation='relu', padding='same', kernel_constraint=maxnorm(3))(input_img)
x = MaxPooling2D(pool_size=(2, 2), padding='same')(x)
x = Conv2D(16, (3, 3), activation='relu', padding='same', kernel_constraint=maxnorm(3))(x)
encoded = MaxPooling2D((2, 2), padding='same')(x)

x = Conv2D(16, (3, 3), activation='relu', padding='same', kernel_constraint=maxnorm(3))(encoded)
x = UpSampling2D((2, 2))(x)
x = Conv2D(32, (3, 3), activation='relu', padding='same', kernel_constraint=maxnorm(3))(x)
x = UpSampling2D((2, 2))(x)
decoded = Conv2D(1, (3, 3), activation='sigmoid', padding='same')(x)

autoencoder = Model(input_img, decoded)
print(autoencoder.summary())
autoencoder.compile(optimizer='adadelta', loss='binary_crossentropy')
autoencoder.fit(X_train, X_train, epochs=50, batch_size=32)

结果:

_________________________________________________________________
Layer (type)                 Output Shape              Param   
=================================================================
input_87 (InputLayer)        (None, 32, 32, 3)         0         
_________________________________________________________________
conv2d_327 (Conv2D)          (None, 32, 32, 3)         9248      
_________________________________________________________________
max_pooling2d_136 (MaxPoolin (None, 32, 16, 2)         0         
_________________________________________________________________
conv2d_328 (Conv2D)          (None, 16, 16, 2)         4624      
_________________________________________________________________
max_pooling2d_137 (MaxPoolin (None, 16, 8, 1)          0         
_________________________________________________________________
conv2d_329 (Conv2D)          (None, 16, 8, 1)          2320      
_________________________________________________________________
up_sampling2d_124 (UpSamplin (None, 16, 16, 2)         0         
_________________________________________________________________
conv2d_330 (Conv2D)          (None, 32, 16, 2)         4640      
_________________________________________________________________
up_sampling2d_125 (UpSamplin (None, 32, 32, 4)         0         
_________________________________________________________________
conv2d_331 (Conv2D)          (None, 1, 32, 4)          289       
=================================================================
Total params: 21,121
Trainable params: 21,121
Non-trainable params: 0
_________________________________________________________________
None

当然还有错误:

ValueError: Error when checking target: expected conv2d_331 to have shape (None, 1, 32, 4) but got array with shape (50000, 32, 32, 3)

你知道我做错了什么吗？为什么最后一个 UpSampling2D 返回该形状？

最佳答案

因此，您的 keras 似乎已将其图像尺寸设置为 channel_first (可能还有 Theano 作为后端)，这意味着输入的最后两个维度被视为空间维度，而不是第二个和第三个维度。另一件事是您的输出应该有 3 过滤器而不是 1 因为这最终会导致目标维度不匹配。总结一下:

为了正确设置您的输入，您需要切换到 tensorflow 并将 channel 顺序更改为 channel_last 或通过以下方式转置您的输入:
```
X = X.transpose([0, 3, 1, 2])
```

更改以下行:

decoded = Conv2D(3, (3, 3), activation='sigmoid', padding='same')(x)

关于Python、keras、卷积自动编码器，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/44466157/

Python、keras、卷积自动编码器

上一篇：python - 使用 Networkx 计算图中的边时出现内存错误

下一篇：python - 从 MSSQL 运行 Python 脚本