我想解析 1 个 url 和后记,我想从中访问一些数据。
try {
Document doc = Jsoup.connect("http://abc.com/en/currency/default.aspx").get();//abc is for example as i cant put site name
Elements td = doc.select("ctl00_ContentPlaceHolder1_currencylist_rptCurrencyList_ctl01_trList"); //this is the name of table row in html page i will show html page snippet also
String temp=td.val();
info.setText(temp);
}
catch (IOException e) {
// TODO Auto-generated catch block
e.printStackTrace();
}
我要解析的html页面 fragment 如下
<tr id="ctl00_ContentPlaceHolder1_currencylist_rptCurrencyList_ctl01_trList">
<td width="400px" class="CurrencyListItems">
UK POUND
</td>
<td width="60px;" class="CurrencyListItemsIN" align="center">
5.72
</td>
<td width="150px;" class="CurrencyListItemsLast">
<table cellspacing ="0" cellpadding ="0" width="100%">
<tr>
<td class="CurrencyListBANKNOTES" align="center">
5.625
</td>
<td class="CurrencyListBANKNOTES2" width="75px" align="center">
5.75
</td>
</tr>
</table>
</td>
我想从上面的 html 英镑,5.625,5.75 我试过上面的代码,但问题是它不是解析 URL,只是如果尝试它就出来了
最佳答案
试试这个:
Element tr = doc.getElementById("ctl00_ContentPlaceHolder1_currencylist_rptCurrencyList_ctl01_trList");
尝试
String contents = tr.text().trim();
contents = contents.replaceAll("\\s+"," ");
contents = contents. replaceAll("\\<.*?>","-");
String []values = contents.split("-");
或
Elements elements = tr.select("*");
for (Element element : elements) {
System.out.println(element.ownText());
}
关于android - 解析html页面,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/8278398/