我想解析一个 pdf 文件并从中提取一些内容。谁能列出任何特定的 perl 模块吗?
谢谢。
最佳答案
你可以尝试看看
或
如果您尝试从文档中解析文本,那么它可能不切实际。来自 CAM::PDF::Text
This module attempts to extract sequential text from a PDF page. This is not a robust process, as PDF text is graphically laid out in arbitrary order. This module uses a few heuristics to try to guess what text goes next to what other text, but may be fooled easily by, say, subscripts, non-horizontal text, changes in font, form fields etc.
关于perl - 用于解析 PDF 文件的 CPAN Perl 模块,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/9701155/