python - 卷积网络上的二维矩阵

这可能是一个愚蠢的问题，但我想在我的深度强化学习项目中使用卷积神经网络，但我遇到了一个我不明白的问题。在我的项目中，我想插入网络矩阵 6x7 ，它应该相当于 6x7 大小(42 像素)的黑白图片，对吗？

class CNN(nn.Module):
    def __init__(self):
        super().__init__()
        self.model = torch.nn.Sequential()
        self.model.add_module("conv_1", torch.nn.Conv2d(in_channels=1, out_channels=16, kernel_size=4, stride = 1))
        self.model.add_module("relu_1", torch.nn.ReLU())
        self.model.add_module("max_pool", torch.nn.MaxPool2d(2))
        self.model.add_module("conv_2", torch.nn.Conv2d(in_channels=16, out_channels=16, kernel_size=4, stride = 1))
        self.model.add_module("relu_2", torch.nn.ReLU())
        self.model.add_module("flatten", torch.nn.Flatten())

        self.model.add_module("linear", torch.nn.Linear(in_features=16*16*16, out_features=7))

    def forward(self, x):
        x = self.model(x)
        return x

在 conv1 in_channels=1 中，因为我只有 1 个矩阵(如果是图像识别，则意味着 1 种颜色)。其他in_channels和out_channels在线性之前都是随机的。我不知道应该在哪里插入矩阵的大小，但最终输出应该是我在线性中输入的 7 大小。

我得到的错误是:

RuntimeError: Expected 3D (unbatched) or 4D (batched) input to conv2d, but got input of size: [6, 7]

最佳答案

您的代码存在一些问题。首先，您收到该错误消息的原因是 CNN 期望形状为 (N, Cin, Hin, Win) 的张量，其中:

N 是批量大小
Cin 是输入 channel 数
Hin 为输入图像像素高度
Win 是输入图像像素宽度

您只需提供宽度和高度尺寸。您需要显式添加 channels 和 batch 维度，即使这些维度的值仅为 1:

model = CNN()

example_input = torch.randn(size=(6, 7)) # this is your input image

print(example_input.shape) # should be (6, 7)

output = model(example_input) # you original error

example_input = example_input.unsqueeze(0).unsqueeze(0) # adds batch and channels dimension

print(example_input.shape) # should now be (1, 1, 6, 7)

output = model(example_input) # no more error!

但是您会注意到，您现在收到了不同的错误:

RuntimeError: Calculated padded input size per channel: (1 x 2). Kernel size: (4 x 4). Kernel size can't be greater than actual input size

这是因为在第一个转换层之后，您的数据形状为 1x2，但第二层的内核大小为 4，这使得操作无法进行。大小 6x7 的输入图像非常小，要么将内核大小减小到可以使用的大小，要么使用更大的图像。

这是一个工作示例:

import torch
from torch import nn


class CNN(nn.Module):
    def __init__(self):
        super().__init__()
        self.model = torch.nn.Sequential()
        self.model.add_module(
            "conv_1",
            torch.nn.Conv2d(in_channels=1, out_channels=16, kernel_size=2, stride=1),
        )
        self.model.add_module("relu_1", torch.nn.ReLU())
        self.model.add_module("max_pool", torch.nn.MaxPool2d(2))
        self.model.add_module(
            "conv_2",
            torch.nn.Conv2d(in_channels=16, out_channels=16, kernel_size=2, stride=1),
        )
        self.model.add_module("relu_2", torch.nn.ReLU())
        self.model.add_module("flatten", torch.nn.Flatten())

        self.model.add_module("linear", torch.nn.Linear(in_features=32, out_features=7))

    def forward(self, x):
        x = self.model(x)
        return x


model = CNN()
x = torch.randn(size=(6, 7))
x = x.unsqueeze(0).unsqueeze(0)
output = model(x)
print(output.shape) # has shape (1, 7)

注意，我将 kernel_size 更改为 2，最终线性层的输入大小为 32。此外，输出的形状为 (1, 7)，其中 1 是批量大小，在我们的例子中仅为 1。如果您只需要 7 个输出特征，只需使用 x = torch .squeeze(x).

关于python - 卷积网络上的二维矩阵，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/72331271/

python - 卷积网络上的二维矩阵

上一篇：node.js - 仅显示在服务器端的 Reactjs 中发布的用户是 Node Js、Mongodb 的帖子

下一篇：Django 获取具有值的相关对象的计数并将其添加到注释中