我正在解析 RSS 提要,其中 <description>
标签包含一些像这样的\n 行字符
<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0">
<item>
<guid>http://www.horseandhound.co.uk/blogs/7619/314307.html</guid>
<title>Chloe Chubb's s<title>Chloe Chubb's showing blog: Horse of the Year Show (HOYS) preparations</title>
<description>So it is a week before HOYS and I have somehow managed to stay a bit oblivious to it all. It is only when I log onto Facebook and see the many, many statuses talking about it that I realise it’s only just around the corner! Then the butterflies begin.
<br />
/>
My pre-HOYS diet was going well until last weekend when me and my best mates headed for a weekend away by the beach. Cue huge amounts of cheese, wine and cake and with the 5 pounds I’d lost, 2 went straight back on.&#8230;</description>
<url>http://www.horseandhound.co.uk/blogs/7619/314307.html</url>
</item>
</channel>
</rss>
在这里,我能够解析第一行“some text in line1..”之前的文本,但从下一行开始,剩余的文本不会被解析。我正在使用 Dom 解析器。请建议如何解决此问题。
最佳答案
您可以使用实体
来表示 XML 属性中的换行符。
可用于表示回车。
Windows 风格的 CRLF 可以表示为
。
参见XML spec了解更多详情。
关于java - 如何解析 XML 标签中的换行符?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/12853239/