java - ITextRenderer.setDocumentFromString(html) 导致 javax.xml.transform.TransformerException

标签 java html xml pdf flying-saucer

我正在使用 JSoup 转换一些 HTML。生成最终的 HTML 后,我想将其呈现为 pdf 文件。为此,我使用 Flying Sacer 库。

我尝试渲染的 Html 示例:

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head> 
<title></title> 
<meta http-equiv="X-UA-Compatible" content="IE=edge" /> 
<meta name="viewport" content="width=device-width, initial-scale=1" /> 
...

将html渲染为pdf的方法:

private void createPdfFile(String html, String fileName) throws IOException, DocumentException {
    ITextRenderer renderer = new ITextRenderer();

    renderer.setDocumentFromString(html);
    renderer.layout();

    FileOutputStream fos = new FileOutputStream(fileName);
    renderer.createPDF( fos );
    fos.close();
    System.out.println( "File 2: '" + fileName + "' created." );
    }

错误日志:

org.xhtmlrenderer.util.XRRuntimeException: Can't load the XML resource (using TRaX transformer). org.xml.sax.SAXParseException; lineNumber: 1; columnNumber: 64; The system identifier must begin with either a single or double quote character.
    at org.xhtmlrenderer.resource.XMLResource$XMLResourceBuilder.createXMLResource(XMLResource.java:191)
    at org.xhtmlrenderer.resource.XMLResource.load(XMLResource.java:75)
    at org.xhtmlrenderer.pdf.ITextRenderer.setDocumentFromString(ITextRenderer.java:157)
    at main.Main.createPdfFile(Main.java:106)
    at main.Main.getEmailAsset(Main.java:97)
    at main.ServletMain.doGet(ServletMain.java:52)
    at javax.servlet.http.HttpServlet.service(HttpServlet.java:624)
    at javax.servlet.http.HttpServlet.service(HttpServlet.java:731)
    at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:303)
    at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:208)
    at org.apache.tomcat.websocket.server.WsFilter.doFilter(WsFilter.java:52)
    at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:241)
    at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:208)
    at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:220)
    at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:122)
    at org.apache.catalina.authenticator.AuthenticatorBase.invoke(AuthenticatorBase.java:505)
    at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:170)
    at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:103)
    at org.apache.catalina.valves.AccessLogValve.invoke(AccessLogValve.java:950)
    at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:116)
    at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:423)
    at org.apache.coyote.http11.AbstractHttp11Processor.process(AbstractHttp11Processor.java:1079)
    at org.apache.coyote.AbstractProtocol$AbstractConnectionHandler.process(AbstractProtocol.java:620)
    at org.apache.tomcat.util.net.JIoEndpoint$SocketProcessor.run(JIoEndpoint.java:316)
    at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
    at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
    at org.apache.tomcat.util.threads.TaskThread$WrappingRunnable.run(TaskThread.java:61)
    at java.lang.Thread.run(Unknown Source)
Caused by: javax.xml.transform.TransformerException: org.xml.sax.SAXParseException; lineNumber: 1; columnNumber: 64; The system identifier must begin with either a single or double quote character.
    at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerImpl.transform(Unknown Source)
    at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerImpl.transform(Unknown Source)
    at org.xhtmlrenderer.resource.XMLResource$XMLResourceBuilder.createXMLResource(XMLResource.java:189)
    ... 27 more
Caused by: org.xml.sax.SAXParseException; lineNumber: 1; columnNumber: 64; The system identifier must begin with either a single or double quote character.
    at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(Unknown Source)
    at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerImpl.transformIdentity(Unknown Source)
    ... 30 more

知道为什么会发生这种情况吗?

最佳答案

您需要将 DTD URI 用引号括起来 - 您当前不平衡:

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">

在现有代码中,您缺少 http://www.w3... 之前的 "

关于java - ITextRenderer.setDocumentFromString(html) 导致 javax.xml.transform.TransformerException,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/30209462/

相关文章:

java - Spring Boot - log4j2.properties 创建日志文件但不将日志写入文件

iphone - 创建适用于 ipad 和 iphones 的动态 list 文件

javascript - 带有可选 JavaScript 的 CSS 模态弹出窗口

XML - 引用数字属性

java - Netbeans 评论测试用例仍在运行

java - 使用 UCanAccess 通过 JDBC 获取数据库名称

java - 为什么这个构造函数没有给我正确的答案

javascript - 如何检查 ionic 本地存储中的 null 或未定义值并重定向到登录页面?

xml - 如何使用 bash 脚本编辑 XML?

xml - 我如何从 Google Maps API 获取行车时间?