python - Pytesseract 输出与图像中的文本完全不同

标签 python python-imaging-library textblob python-tesseract

以下代码在我 friend 的 Windows 笔记本电脑上完美运行,但在我的笔记本电脑上,它会输出完全垃圾。我认为这可能与图书馆有关,但我不能肯定地说?你们能告诉我它是否适合您或者您以前是否曾经发生过这种情况吗?

import pytesseract
from PIL import Image, ImageEnhance, ImageFilter
from textblob import TextBlob

originalImage = Image.open("pictures\hp.png")
workingImage = originalImage.copy()
workingImage = originalImage.filter(ImageFilter.MedianFilter())

enhanceSharpness = ImageEnhance.Sharpness(workingImage)
enhanceColor = ImageEnhance.Color(workingImage)

workingImage = enhanceSharpness.enhance(2)
workingImage = enhanceColor.enhance(0)
workingImage = workingImage.convert('1')

text = pytesseract.image_to_string(workingImage)
correctedText = TextBlob(text).correct()

print(correctedText)

这是我们提供给它的图片:enter image description here

与在我 friend 的计算机上打印照片中的文本不同,我的计算机打印以下内容:

of was an am: came: mm: mmm mm bu noma of arm 523w at"
mmmhmg peaches? w 3 cm {mamma a map m: K wm} Or Mummy
didn‘c realm when Am had cm then he mmm Am: Ma mad m

Am!» adam Them was 3 W317); Ll them on arm mmm a‘met
Turn mm "mm wasn't a mm: m 313% Ham sum Am have. term mmm
M‘) It must had: {mm 3 mm fiftht‘ High: Or Him‘cy [asked and
hated :31 me cm of Taxed back Of I Mummy time mound! m:

come and up in: may M maxim} tin: ca: m Am mm‘m h Am raw
made m: mm mm mid and Run“ w Of iwkmg at me 5 Is
cmricbfl mad may or $$ka Or [)unity gave hrmsuifa hut: wake and
put 11% cm my m“ Am Am} Is he rim mere own M {high W
msihmg except & Large may of'dry‘b M. M h my m E“ that thy

最佳答案

我下载了 tesseract-ocr 4.0 版,现在可以完美运行了。感谢大家的帮助!

关于python - Pytesseract 输出与图像中的文本完全不同,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/54724551/

相关文章:

python - numpy 字符数组到字符串

python - group_by 语句、sqlalchemy 和 postgresql 的问题

python-3.x - ValueError:图 block 无法扩展到图像外部,无法处理图像

python - Pickle 可以处理大于我机器上安装的 RAM 的文件吗?

python - 如何使用 pandas 替换所有列中的所有字符串?

Python ElementTree 重复检查器

python - 使用 PIL 为 Django 网站旋转通过电子邮件上传的图像

python - 缩小图像时保留时间戳

sentiment-analysis - Textblob 情感算法

python - 用于情感分析的 nltk NaiveBayesClassifier 训练