Python正则表达式: replace a letter if it is not a part of the word in a list

假设我有一个像 [cat,hat,mat,ate] 这样的单词列表，并且我想删除像 这样的字符串中的所有字母 a >acatbatmate 到 catbtmate(如果字母 a 不在单词列表中)。

在当前步骤中，我可以使用以下代码按单词列表中的单词拆分字符串:

''.join([word.replace('a','') 
         if word not in ['cat','hat','mat','ate'] 
         else word for word in re.split('(cat|hat|mat|ate)','acatbatmate') ])

我可以使用re.sub(pattern, repl, string)直接删除字母a吗？

最佳答案

您可以使用 re 轻松做到这一点，如下所示:

import re
except_contexts = ['cat','hat','mat','ate']
print(re.sub(r'({})|a'.format("|".join(except_contexts)), lambda x: x.group(1) if x.group(1) else '', 'acatbatmate'))
# => catbtmate

请参阅Python 2 demo .

如果您使用的是 Python 3.5+，只需反向引用就更容易了:

import re
except_contexts = ['cat','hat','mat','ate']
print(re.sub(r'({})|a'.format("|".join(except_contexts)), r'\1', 'acatbatmate'))

但是，如果您打算替换该a，则需要使用 lambda 表达式。

详细信息

r'({})|a'.format("|".join( except_contexts)) 看起来像 (cat|hat|mat|ate)|a正则表达式。它将匹配并将 cat、hat 等捕获到组 1 中，如果匹配，我们需要替换为该组内容。否则，我们要么替换为空字符串，要么替换为所需的替换。

请参阅regex demo .

关于Python正则表达式: replace a letter if it is not a part of the word in a list，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/50743216/

上一篇：python - 构造一个 tf.Variable 对象的张量

下一篇：python - 如何在图像颜色为黑色时使用 Pillow 将 PNG 转换为 JPG？

相关文章：

python - 使用 Python 查找稀有字符

regex - 为什么我的 Emacs 正则表达式不与 ^ 匹配行的开头？

json - 使用正则表达式与JSON进行过滤

python - itertools.groupby 的意外行为

python - 在 Emacs 中使用 Flycheck 和 flake8 时如何在 Python 2 和 3 之间切换？

python - 将wav转换为mp3时如何使用pysox指定比特率

python - xml.etree.ElementTree.ParseError -- 异常处理未捕获错误

regex - 如何通过正则表达式从文本文件中提取某些行

python - 在 groupby 对象内的列列表上生成滚动计算的更快方法

python - 删除列表中重复项之间的所有实例，Python