python - BS4 中出现奇怪的错误。 find_all() 返回 None

我正在使用 BS4 和 PhantomJS 来抓取网站。在 Mac 上一切正常，但在 Windows 上我遇到了一个奇怪的错误:find_all() 返回 None，但元素存在!

我的代码:

def get_venues():
    driver = webdriver.PhantomJS(executable_path = path)
    url=web+'#/racing'
    driver.get(url)
    try:
        wait = WebDriverWait(driver,10).until(EC.presence_of_element_located((By.CLASS_NAME, "wrapper")))
    finally:
        content=driver.page_source
        soup=bs4.BeautifulSoup(content, "html5")
        driver.quit()

    b = soup.find(id='content').div
    print(b)
    c = b.ul(attrs={'class': 'main-list'})

    print(c)

和 c 是 None，不应该是 b 那样的情况:

</ul></div><ul class="main-list"><li><div class="collapsible R"><div class="icon race_code_R"></div><span>Thoroughbreds</span><div class="arrow_down_sign"></div></div><ul class="sub-list"><li class="venue cell"><a href="#/meetings/19197">
  <span class="location">Beaudesert</span>
  <div class="goto-sign"></div>
</a>
<a class="next-race" href="#/races/181880/exchange/win">
  <span class="time-left critical">-30m</span>
  <span class="number">R5</span>
</a>
</li><li class="venue cell"><a href="#/meetings/19199">
  <span class="location">Werribee</span>
  <div class="goto-sign"></div>
</a>
<a class="next-race" href="#/races/181900/exchange/win">
  <span class="time-left critical">-38s</span>
  <span class="number">R7</span>
</a>
</li><li c

最佳答案

将代码从 mac 转移到 windows 机器的问题是，在编码文件时使用略有不同的 utf-8 值，如果不被捕获，可能会破坏你的程序，所以我在这里最好的猜测(我不是 python 专家)是不是从你的 Mac 到你的 PC 的过程中，UTF-8 字符发生了变化，现在你的整个程序都被淘汰了，解决这个问题的一种方法是在基于 Windows 的 IDE/编译器/文本编辑器中从头开始重建它希望这有帮助

关于python - BS4 中出现奇怪的错误。 find_all() 返回 None，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/27032991/

python - BS4 中出现奇怪的错误。 find_all() 返回 None

上一篇：python - 如何使用 django for python 获取正在访问我的网站的 IP 地址

下一篇：Python:启动类方法时线程不起作用