python - 如何用python替换字符串中的字符串？

我有一个很长的字符串，其中包含标签 img 和属性 src，但现在我想使用正则表达式删除 src 中的一些字符串.

我尝试过以下代码，但我认为pattern中有一些错误。

#!/usr/bin/env python
#encoding: utf-8
import re
url = "<p><img src ='https://xxx.cn/20190504195124718.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L2gzNTYzNjM=,size_16,color_FFFFFF,t_70'></img></p><p><img src ='https://xxxx.cn/20190504195124718.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L2gzNTYzNjM=,size_16,color_FFFFFF,t_70'></img></p>"

pattern = re.compile(r"https://img-.*(\?x-oss-process.*t_70)")

print(pattern.findall(url))

out = re.sub(pattern, '', url)

print(out)

第一次打印，得到结果:

['?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L2gzNTYzNjM=,size_16,color_FFFFFF,t_70']

第二次打印，得到结果:

<p><img src =''></img></p>

我想获取img src删除字符串的新字符串?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L2gzNTYzNjM= ,size_16,color_FFFFFF,t_70,只有“https://xxx.cn/20190504195124718.png ”。

就像:

url = "<p><img src ='https://xxx.cn/20190504195124718.png'></img></p><p><img src ='https://xxxx.cn/20190504195124718.png'></img></p>"

如何编写模式？

非常感谢~

最佳答案

由于您需要替换字符串，我们将使用捕获组 (?#...)

output = re.sub("(?#<img.*)\?x-oss-process.*?t_70",'',url)

添加了？在 t_70 之前进行非贪婪匹配，它将捕获多个 img 标签。

来自文档

(?#...)
A comment; the contents of the parentheses are simply ignored.

请参阅[此处]文档 ( https://docs.python.org/2/library/re.html )

关于python - 如何用python替换字符串中的字符串？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/56007673/

python - 如何用python替换字符串中的字符串？

上一篇：python - 如何识别包含多个单词的字符串

下一篇：python - 转换包含在数据框行值内的列表