嘿,我是这项工作的新手,我在编译代码时遇到此错误:
OCR ocr = new OCR();
PDFReader reader = new PDFReader(new File("C:\\Users\\pc\\Downloads\\chk1.pdf"));
reader.open(); // open the file.
int pages = reader.getNumberOfPages();
for(int i=0; i<pages; i++) {
BufferedImage image = reader.getPageAsImage(i); /////null pointer exception here
System.out.println("OCR result:\n" + ocr.recognizeCharacters(image));
}
reader.close(); // finally, close the file.
错误是:
java.lang.NullPointerException
at org.pdfbox.util.operator.pagedrawer.Invoke.a(Unknown Source)
at com.asprise.util.pdf.as.a(Unknown Source)
at com.asprise.util.pdf.as.b(Unknown Source)
at com.asprise.util.pdf.as.a(Unknown Source)
at com.asprise.util.pdf.gV.a(Unknown Source)
at com.asprise.util.pdf.G.l(Unknown Source)
at com.asprise.util.pdf.PDFReader.getPageAsImage(Unknown Source)
at file.tracker.threads.PDFFilerConverter.gotoRead(PDFFilerConverter.java:94)
at file.tracker.threads.PDFFilerConverter.run(PDFFilerConverter.java:60)
at java.lang.Thread.run(Thread.java:744)
有人可以帮我摆脱它吗?
最佳答案
试试这个
for(int i=0; i<pages; i++) {
String txt =reader.extractTextFromPage(i);
System.out.println(“Text result:\n” + txt);
}
关于java - 使用 Asprise 和 Java 对 PDF 进行 OCR,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/21986641/