我有一个这样的字典列表:
d = [{'email': u'<a href="https://stackoverflow.com/cdn-cgi/l/email-protection" class="__cf_email__" data-cfemail="6f5e5d5c2f08020e0603410c0002" rel="noreferrer noopener nofollow">[email protected]</a>', 'name': u'xxx'}, {'email': u'<a href="https://stackoverflow.com/cdn-cgi/l/email-protection" class="__cf_email__" data-cfemail="83e2e1e0c3e4eee2eaefade0ecee" rel="noreferrer noopener nofollow">[email protected]</a>', 'name': u'xxx1'}, {'email': u'<a href="https://stackoverflow.com/cdn-cgi/l/email-protection" class="__cf_email__" data-cfemail="95edecefd5f2f8f4fcf9bbf6faf8" rel="noreferrer noopener nofollow">[email protected]</a>', 'name': u'xxx2'}]
我希望删除该 unicode 字符,因此我的输出应如下所示
d = [{'email': '<a href="https://stackoverflow.com/cdn-cgi/l/email-protection" class="__cf_email__" data-cfemail="ab9a9998ebccc6cac2c785c8c4c6" rel="noreferrer noopener nofollow">[email protected]</a>', 'name': 'xxx'}, {'email': '<a href="https://stackoverflow.com/cdn-cgi/l/email-protection" class="__cf_email__" data-cfemail="6706050427000a060e0b4904080a" rel="noreferrer noopener nofollow">[email protected]</a>', 'name': 'xxx1'}, {'email': '<a href="https://stackoverflow.com/cdn-cgi/l/email-protection" class="__cf_email__" data-cfemail="1a6263605a7d777b737634797577" rel="noreferrer noopener nofollow">[email protected]</a>', 'name': 'xxx2'}]
谁能告诉我吗?
最佳答案
您还可以将 unicode 字符编码为 utf-8,例如:
In [2]: d = [{'email': u'<a href="https://stackoverflow.com/cdn-cgi/l/email-protection" class="__cf_email__" data-cfemail="38090a0b785f55595154165b5755" rel="noreferrer noopener nofollow">[email protected]</a>', 'name': u'xxx'}, {'email': u'<a href="https://stackoverflow.com/cdn-cgi/l/email-protection" class="__cf_email__" data-cfemail="29484b4a694e44484045074a4644" rel="noreferrer noopener nofollow">[email protected]</a>', 'name': u'xxx1'}, {'email': u'<a href="https://stackoverflow.com/cdn-cgi/l/email-protection" class="__cf_email__" data-cfemail="cfb7b6b58fa8a2aea6a3e1aca0a2" rel="noreferrer noopener nofollow">[email protected]</a>', 'name': u'xxx2'}]
In [3]: new_d = [{k: v.encode("utf-8") for k, v in elem.items()} for elem in d]
In [4]: new_d
Out[4]:
[{'email': '<a href="https://stackoverflow.com/cdn-cgi/l/email-protection" class="__cf_email__" data-cfemail="4a7b78790a2d272b232664292527" rel="noreferrer noopener nofollow">[email protected]</a>', 'name': 'xxx'},
{'email': '<a href="https://stackoverflow.com/cdn-cgi/l/email-protection" class="__cf_email__" data-cfemail="29484b4a694e44484045074a4644" rel="noreferrer noopener nofollow">[email protected]</a>', 'name': 'xxx1'},
{'email': '<a href="https://stackoverflow.com/cdn-cgi/l/email-protection" class="__cf_email__" data-cfemail="c8b0b1b288afa5a9a1a4e6aba7a5" rel="noreferrer noopener nofollow">[email protected]</a>', 'name': 'xxx2'}]
关于python - 如何从字典列表中删除 ('u' )unicode?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/23403015/