python - 计算列表中所有元素与单个句子的相似度

我需要量化长列表中所有句子与单个句子的相似度。也许使用 Levenshtein 或 difflib。
然后，我必须删除列表中超出某个给定阈值的句子，例如 difflib 中的 90%。

你们能帮忙吗？谢谢!

最佳答案

>>> mylist = ['ham and eggs', 'spam and legs', "it's time to die, mr bond!"]
>>> import difflib
>>> close_matches = difflib.get_close_matches('spam and eggs', mylist)
>>> close_matches
['spam and legs', 'ham and eggs']
>>> set(mylist) - set(close_matches)
set(["it's time to die, mr bond!"])

关于python - 计算列表中所有元素与单个句子的相似度，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/13696337/

上一篇：python - Qt : Updating UI (with signals) from another class and thread

下一篇：Python3.2垂直链接

相关文章：

python - 为什么 reversed() 消除了线程安全？

Java 在方法调用中声明一个 String 数组

ios - 按钮标题文本不清晰

c - 需要对名称链接列表进行合并排序

sql - 如何测试 int2vector 是否恰好包含一个特定值？

comparison - 如何混淆C++变量和函数

python - 在 Pandas 中，如果较小，如何将当前行项目设置为上一个？

python - 如何处理按钮点击事件

python - 如何在 python 中将 EOF 写入 STDIN popen

python - 将 np 数组添加到现有的 pandas 数据框