python - urllib - 从 Python2 更新到 Python3

标签 python urllib

我已经尝试适应 the following script .我得到了接下来的内容。

#!/usr/bin/python3

import re
import csv

import urllib.request, urllib.parse

class Spreadsheet(object):
    def __init__(self, key):
        super(Spreadsheet, self).__init__()
        self.key = key

class Client(object):
    def __init__(self, email, password):
        super(Client, self).__init__()
        self.email = email
        self.password = password

    def _get_auth_token(self, email, password, source, service):
        url = "https://www.google.com/accounts/ClientLogin"
        params = {
            "Email": email, "Passwd": password,
            "service": service,
            "accountType": "HOSTED_OR_GOOGLE",
            "source": source
        }
        req = urllib.request.Request(url, urllib.parse.urlencode(params))
        return re.findall(r"Auth=(.*)", urllib.request.urlopen(req).read())[0]

    def get_auth_token(self):
        source = type(self).__name__
        return self._get_auth_token(self.email, self.password, source, service="wise")

    def download(self, spreadsheet, gid=0, format="csv"):
        url_format = "https://spreadsheets.google.com/feeds/download/spreadsheets/Export?key=%s&exportFormat=%s&gid=%i"
        headers = {
            "Authorization": "GoogleLogin auth=" + self.get_auth_token(),
            "GData-Version": "3.0"
        }
        req = urllib.request.Request(url_format % (spreadsheet.key, format, gid), headers=headers)
        return urllib.request.urlopen(req)

if __name__ == "__main__":
    email = "xxx" # (your email here)
    password = "yyyy"
    spreadsheet_id = "zzz" # (spreadsheet id here)

    # Create client and spreadsheet objects
    gs = Client(email, password)
    ss = Spreadsheet(spreadsheet_id)

    # Request a file-like object containing the spreadsheet's contents
    print(gs.download(ss).read())

我的问题是出现以下错误。

Traceback (most recent call last):
  File "/Users/test.py", line 54, in <module>
    print(gs.download(ss).read())
  File "/Users/test.py", line 38, in download
    "Authorization": "GoogleLogin auth=" + self.get_auth_token(),
  File "/Users/test.py", line 33, in get_auth_token
    return self._get_auth_token(self.email, self.password, source, service="wise")
  File "/Users/test.py", line 29, in _get_auth_token
    return re.findall(r"Auth=(.*)", urllib.request.urlopen(req).read())[0]
  File "/Library/Frameworks/Python.framework/Versions/3.2/lib/python3.2/urllib/request.py", line 138, in urlopen
    return opener.open(url, data, timeout)
  File "/Library/Frameworks/Python.framework/Versions/3.2/lib/python3.2/urllib/request.py", line 364, in open
    req = meth(req)
  File "/Library/Frameworks/Python.framework/Versions/3.2/lib/python3.2/urllib/request.py", line 1052, in do_request_
    raise TypeError("POST data should be bytes"
TypeError: POST data should be bytes or an iterable of bytes. It cannot be str.

问题来自方法_get_auth_token中的urllib.request.urlopen(req)。有办法解决这个问题吗?

最佳答案

是的,在发布之前将您的数据编码为字节:

req = urllib.request.Request(url, urllib.parse.urlencode(params).encode('ASCII'))

我在这里假设您的数据是纯 ASCII 的(电子邮件地址通常是,大概您的密码也是)。

关于python - urllib - 从 Python2 更新到 Python3,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/17658804/

相关文章:

python - 将 Html 页面中的数据获取到 Python 数组中

python - 修复 Python requests.exception.InvalidURL : Invalid percent-escape sequence 'u2' error?

将图像下载为 numpy 数组时 Python 崩溃

http-headers - HTTP 基本身份验证不适用于 Python 3

python - 在 Flask 中使用 Redis 跟踪在线用户

python - 使用当前路径从终端打开 Pycharm

python - 在 arm64 上使用 numpy 和 pandas 构建 docker 时出现问题

python - 为用户模型定义两个不同的扩展

python - Python/Django 中未捕获 urllib HTTPError

python - 为什么在目录中循环访问.wav文件时显示错误,但在不循环时工作正常?