python - 将输入提供给 Pytorch LSTM 网络时出现 AttributeError : 'tuple' object has no attribute 'dim' ,

我正在尝试运行以下代码:

import matplotlib.pylab as plt
import numpy as np
import torch
import torch.nn as nn

class LSTM(nn.Module):
    def __init__(self, input_shape, n_actions):
        super(LSTM, self).__init__()

        self.lstm = nn.LSTM(input_shape, 12)
        self.hidden2tag = nn.Linear(12, n_actions)

    def forward(self, x):
        out = self.lstm(x)
        out = self.hidden2tag(out)
        return out


state = [(1,2,3,4,5),(2,3,4,5,6),(3,4,5,6,7),(4,5,6,7,8),(5,6,7,8,9),(6,7,8,9,0)]

device = torch.device("cuda")
net = LSTM(5, 3).to(device)

state_v = torch.FloatTensor(state).to(device)

q_vals_v = net(state_v.view(1, state_v.shape[0], state_v.shape[1]))
_, action = int(torch.max(q_vals_v, dim=1).item())

然后返回这个错误:

Traceback (most recent call last):
  File "/home/dikkerj/Documents/PycharmProjects/LSTMReactor/QuestionStackoverflow.py", line 26, in <module>
    q_vals_v = net(state_v.view(1, state_v.shape[0], state_v.shape[1]))
  File "/home/dikkerj/.local/lib/python3.5/site-packages/torch/nn/modules/module.py", line 477, in __call__
    result = self.forward(*input, **kwargs)
  File "/home/dikkerj/Documents/PycharmProjects/LSTMReactor/QuestionStackoverflow.py", line 15, in forward
    out = self.hidden2tag(out)
  File "/home/dikkerj/.local/lib/python3.5/site-packages/torch/nn/modules/module.py", line 477, in __call__
    result = self.forward(*input, **kwargs)
  File "/home/dikkerj/.local/lib/python3.5/site-packages/torch/nn/modules/linear.py", line 55, in forward
    return F.linear(input, self.weight, self.bias)
  File "/home/dikkerj/.local/lib/python3.5/site-packages/torch/nn/functional.py", line 1022, in linear
    if input.dim() == 2 and bias is not None:
AttributeError: 'tuple' object has no attribute 'dim'

有人知道怎么解决吗？ (摆脱张量是一个元组，以便它可以被送入 LSTM 网络)

最佳答案

pytorch LSTM 返回一个元组。
所以你会得到这个错误，因为你的线性层 self.hidden2tag 无法处理这个元组。

所以改变:

out = self.lstm(x)

到

out, states = self.lstm(x)

这将修复您的错误，方法是拆分元组，使 out 只是您的输出张量。

out 然后存储隐藏状态，而 states 是另一个包含最后隐藏状态和单元格状态的元组。

你也可以在这里看看:
https://pytorch.org/docs/stable/nn.html#torch.nn.LSTM

您将在最后一行收到另一个错误，因为 max() 也返回一个元组。但这应该很容易修复并且是不同的错误:)

关于python - 将输入提供给 Pytorch LSTM 网络时出现 AttributeError : 'tuple' object has no attribute 'dim' ,，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/53032586/

python - 将输入提供给 Pytorch LSTM 网络时出现 AttributeError : 'tuple' object has no attribute 'dim' ,

上一篇：python - 矩阵中列的最大值列表(没有 Numpy)

下一篇：python - 需要在 bash 中传递 python 数组中的对象