python - Beautifulsoup 结果到 Pandas 数据框

标签 python pandas beautifulsoup

下面的代码返回一个,结果如下

r = requests.get(url)
soup = bs4.BeautifulSoup(r.text, 'lxml')

mylist = soup.find(attrs={'class': 'table_grey_border'})
print(mylist)

结果 - 它延伸了 1700 行

<table cellpadding="0" cellspacing="2" class="table_grey_border" width="100%">
<tr valign="top">
<td class="verd_black12" width="18%"><b>STOCK CODE</b></td>
<td class="verd_black12" width="42%"><b>NAME OF LISTED SECURITIES</b></td>
<td class="verd_black12" width="19%"><b>BOARD LOT</b></td>
<td class="verd_black12" colspan="4" width="12%"><b>REMARK</b></td>
</tr>
<tr class="tr_normal">
<td class="verd_black12" width="18%">00001</td>
<td class="verd_black12" width="42%"><a href="../../../invest/company/profile_page_e.asp?WidCoID=00001&amp;WidCoAbbName=&amp;Month=&amp;langcode=e" target="_parent">CKH HOLDINGS</a></td>
<td class="verd_black12" width="19%">500</td>
<td align="center" class="verd_black12" width="3%">#</td>
<td align="center" class="verd_black12" width="3%">H</td>
<td align="center" class="verd_black12" width="3%">O</td>
<td align="center" class="verd_black12" width="3%">F</td>
</tr>
<tr class="tr_normal">
<td class="verd_black12" width="18%">00002</td>
<td class="verd_black12" width="42%"><a href="../../../invest/company/profile_page_e.asp?WidCoID=00002&amp;WidCoAbbName=&amp;Month=&amp;langcode=e" target="_parent">CLP HOLDINGS</a></td>
<td class="verd_black12" width="19%">500</td>
<td align="center" class="verd_black12" width="3%">#</td>
<td align="center" class="verd_black12" width="3%">H</td>
<td align="center" class="verd_black12" width="3%">O</td>
<td align="center" class="verd_black12" width="3%">F</td>
</tr>
...

我的问题是,如何将这些行中的每一行放入 Pandas Dataframe 中?我尝试了下面的代码,但返回错误

a = pandas.read_html(mylist)
print(a)

错误

TypeError: 'NoneType' object is not callable

最佳答案

Document :

pandas.read_html(url, attrs={'class': 'table_grey_border'})

关于python - Beautifulsoup 结果到 Pandas 数据框,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/42050796/

相关文章:

python - 帮助python继承

python - 在python中将字符串转换为小写的简单方法

python - 将函数作为命名参数传递以进行条件检查

python - Pandas 对给定开始和结束日期的时间序列进行重采样

python - 检查字符串中是否存在特定子字符串(存在于数据帧的行中)

python - 在 Jupyter Notebook 中使用 BeautifulSoup 抓取表

python - BeautifulSoup 转换成 python 换行符

python - 导致 pandas KeyError 的大索引值

python - 如何在 Pandas 数据框中生成字母数字固定长度列

python - 从网页表格中抓取值