android - Google Cloud Vision - 无法在包含 2 列文本的图像上提供良好的 OCR

标签 android json google-cloud-platform ocr google-cloud-vision

当我将双列文本图像传递给谷歌云视觉 API 的 TEXT_DETECTION/DOCUMENT_TEXT_DETECTION 时,双列文本的输出顺序不正确,因为它从第一列取一行,然后从另一列取下一行并附加它。

您可以看到输出结果没有按照双列应有的顺序正确对齐。有没有办法更正来自 google vision API 的结果,或者使用 JSON 文件输出更正结果?

To view input image in JPG[click here]

输出-

6.6 Healthcare access times. [56] impairment of their vision can use this to recognize peo- 6.6 Healthcare access ple within their vicinity without them saying a word.152|| Visual impairment has the ability to create consequences for health and well being. Visual impairment is increasing especially among older people. It is recognized that those 6.5.3 Communication development individuals with visual impairment are likely to have lim- ited access to information and healthcare facilities, and Visual impairment can have profound effects on the de- may not receive the best care possible because not all velopment of infant and child communication. The lan- health care professionals are aware of specific needs re- guage and social development of a child or infant can lated to vision. be very delayed by the inability to see the world around them. . A prerequisite of effective health care could very well be having staff that are aware that people may have problems with vision. Social development Social development includes in- teractions with the people surrounding the infant in the Communication and different ways of being able beginning of its life. To a child with vision, a smile from to communicate with visually impaired clients must a parent is the first symbol of recognition and communi be tailored to individual needs and available at all cation, and is almost an instant factor of communication. For a visually impaired infant, recognition of a parent's voice will be noticed at approximately two months old, but a smile will only be evoked through touch between 7 Epidemiology parent and baby. This primary form of communication is greatly delayed for the child and will prevent other forms of communication from developing. Social interactions The WHO estimates that in 2012 there were 285 million are more complicated because subtle visual cues are miss- visually impaired people in the world, of which 246 mil- ing and facial expressions from others are lost. lion had low vision and 39 million were blind.[4] Due to delays in a child's communication development, of those who are blind 90% live in the developing they may appear to be disinterested in social activity with world. [56] Worldwide for each blind person, an average peers, non-communicative and un-education on how to of 3.4 people have low vision, with country and regional communicate with other people. This may cause the child variation ranging from 2.4 to 5.5.[57] to be avoided by peers and consequently over protected By age: Visual impairment is unequally distributed by family members. across age groups. More than 82% of all people who are blind are 50 years of age and older, although they repre- sent only 19% of the world's population. Due to the ex- anguage development with sight much of what is pected number of years lived in blindness (blind years), learned by a child is learned through imitation of others, childhood blindness remains a significant problem, with where as a visually impaired child needs very planned in an estimated 1.4 million blind children below age 15. struction directed at the development of postponed imi- By gender: Available studies consistently indicate that in tation. A visually impaired infant may jabber and imitate every region of the world, and at all ages, females have a words sooner than a sighted child, but may show delay significantly higher risk of being visually impaired than when combining words to say themselves, the child may males. tend to initiate few questions and their use of adjectives is infrequent. Normally the child's sensory experiences are By geography: Visual impairment is not distributed uni- not readily coded into language and this may cause them formly throughout the world. More than 90% of the to store phrases and sentences in their memory and re world's visually impaired live in developing countries. [57] peat them out of context. The language of the blind child Since the estimates of the 1990s, new data based on the does not seem to mirror their developing knowledge of 2002 global population show a reduction in the number the world, but rather their knowledge of the language of of people who are blind or visually impaired, and those others. who are blind from the effects of infectious diseases, but A visually impaired child may also be hesitant to explore an increase in the number of people who are blind from the world around them due to fear of the unknown and conditions related to longer life spans. 157|| also may be discouraged from exploration by overprotec- In 1987, it was estimated that 598,000 people in the tive family members. Without concrete experiences, the United States met the legal definition of blindness. Is Of child is not able to develop meaningful concepts or the this number, 58% were over the age of 65.581 In 1994- language to describe or think about them. 55] 1995, 1.3 million Americans reported legal blindness. 159|

最佳答案

现在云视觉支持多栏文档,提供更好的阅读顺序。

https://cloud.google.com/vision/docs/fulltext-annotations

关于android - Google Cloud Vision - 无法在包含 2 列文本的图像上提供良好的 OCR,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/47281901/

相关文章:

安卓相机 :How to capture an image automatically by specified time

c# - 我如何在 C# .NET(win7 手机)中读入 'nested' 和 'DataContractJsonSerializer' 的 Json 文件?

javascript - AngularJS $scope 未定义 $scope 内的项目

java - 接收带有 2 个数组的 JSON 对象

google-cloud-platform - 如何授予服务帐户访问特定 key 的权限?

java - 有人可以提供 Cloud RuntimeConfig Google Cloud List Variables 的示例吗

android - "Starting emulator for AVD"然后 panic : could not open. ..”

android - 谷歌播放中的所有快照应用程序如何在没有设备根的情况下工作?

java - 将 SharedPreferences 与 for 循环一起使用

google-cloud-platform - Airflow :触发 DAG 运行时出现重复条目​​ mysql 完整性错误