python - 使用 BeautifulSoup 解码 html 实体

我正在尝试使用 BeautifulSoup 解码实体，但没有成功。

from BeautifulSoup import BeautifulSoup

decoded = BeautifulSoup("&lt;p&gt; &lt;/p&gt;",convertEntities=BeautifulSoup.HTML_ENTITIES)

print decoded

输出根本没有解码。我在这里找到了很多使用这种方法的答案。我做错了什么吗？

我想为此使用 BeautifulSoup，所以请不要费心告诉我标准库有解码实体的方法。

最佳答案

您需要print decoded.contents :

>>> print decoded
&lt;p&gt; &lt;/p&gt;
>>> print decoded.contents
[u'<p> </p>']

关于python - 使用 BeautifulSoup 解码 html 实体，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/10088318/

相关文章：

python - 子字符串列表与字符串列表的 bool 比较