所以我正在尝试从我抓取的网站下载文件(图像和文档)。我必须将这些下载到特定文件夹。到目前为止,我有:
images = re.findall("/([^/]+\.(?:jpg|gif|png))", html)
output = open("output.txt","a+")
output.write("\n" + f"[+] {len(images)} Images Found:" + "\n")
for images in images:
output.write(images + "\n")
output.write("Beginning file download with urllib2..." + "\n")
imageurl = "images"
urllib.request.urlretrieve(url, "/downloads")
如何使文件名与网站上特定文件类型等的文件名保持一致?
这只是处理图像的代码片段。
最佳答案
您可以将输出文件名放入urllib.request.urlretrieve
。
images = re.findall("/([^/]+\.(?:jpg|gif|png))", html)
output = open("output.txt","a+")
output.write("\n" + f"[+] {len(images)} Images Found:" + "\n")
for images in images:
output.write(images + "\n")
output.write("Beginning file download with urllib2..." + "\n")
imageurl = "images"
urllib.request.urlretrieve(url, "/downloads" + imagename)
[您只需将变量设置为图像名称即可。例如image.png
]
希望能帮到你
关于python-3.x - 尝试在 python 中下载多个文件并报告是否成功,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/47578628/