python - 在图像中找到矩形并提取其中的文本以将其另存为新图像

标签 python opencv image-processing opencv3.0

我是OpenCV的新手,所以我真的需要您的帮助。我有一堆这样的图像:

enter image description here

我需要检测图像上的矩形,从中提取文本部分并将其另存为新图像。

你能帮我吗?

谢谢!

最佳答案

为了增加Danyals的答案,我添加了一个示例代码,并在注释中编写了步骤。对于此图像,您甚至不需要在图像上执行形态学打开。但是通常建议对图像中的此类噪点进行推荐。干杯!

import cv2
import numpy as np

# Read the image and create a blank mask
img = cv2.imread('napis.jpg')
h,w = img.shape[:2]
mask = np.zeros((h,w), np.uint8)

# Transform to gray colorspace and invert Otsu threshold the image
gray = cv2.cvtColor(img,cv2.COLOR_BGR2GRAY)
_, thresh = cv2.threshold(gray,0,255,cv2.THRESH_BINARY_INV+cv2.THRESH_OTSU)

# ***OPTIONAL FOR THIS IMAGE

### Perform opening (erosion followed by dilation)
#kernel = np.ones((2,2),np.uint8)
#opening = cv2.morphologyEx(thresh, cv2.MORPH_OPEN, kernel)

# ***

# Search for contours, select the biggest and draw it on the mask
_, contours, hierarchy = cv2.findContours(thresh, # if you use opening then change "thresh" to "opening"
                                          cv2.RETR_TREE,cv2.CHAIN_APPROX_NONE)
cnt = max(contours, key=cv2.contourArea)
cv2.drawContours(mask, [cnt], 0, 255, -1)

# Perform a bitwise operation
res = cv2.bitwise_and(img, img, mask=mask)

########### The result is a ROI with some noise
########### Clearing the noise

# Create a new mask
mask = np.zeros((h,w), np.uint8)

# Transform the resulting image to gray colorspace and Otsu threshold the image 
gray = cv2.cvtColor(res,cv2.COLOR_BGR2GRAY)
_, thresh = cv2.threshold(gray,0,255,cv2.THRESH_BINARY+cv2.THRESH_OTSU)

# Search for contours and select the biggest one again
_, contours, hierarchy = cv2.findContours(thresh,cv2.RETR_TREE,cv2.CHAIN_APPROX_NONE)
cnt = max(contours, key=cv2.contourArea)

# Draw it on the new mask and perform a bitwise operation again
cv2.drawContours(mask, [cnt], 0, 255, -1)
res = cv2.bitwise_and(img, img, mask=mask)

# If you will use pytesseract it is wise to make an aditional white border
# so that the letters arent on the borders
x,y,w,h = cv2.boundingRect(cnt)
cv2.rectangle(res,(x,y),(x+w,y+h),(255,255,255),1)

# Crop the result
final_image = res[y:y+h+1, x:x+w+1]

# Display the result
cv2.imshow('img', final_image)
cv2.waitKey(0)
cv2.destroyAllWindows()

结果:

enter image description here

关于python - 在图像中找到矩形并提取其中的文本以将其另存为新图像,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/52797615/

相关文章:

python - 在 pyglet 或 PIL/python 中从远程服务器加载图像

visual-c++ - 从 MinGW 的 MSVC DLL 调用函数

c++ - 如何从视频中选择两帧? opencv C++

python - 我想将数据框的行附加为列

python - 防止 matplotlib 有状态

c++ - 如何改进opencv中的特征检测

c++ - 图像处理-大纲

image - 如何组合一幅图像的相位和不同图像的大小?

MATLAB 图像处理 - 查找图像的边缘和区域

python - 有没有办法更新Python字典的值,但如果键不存在则不添加键?