javascript - 用 Selenium 刮4

标签 javascript python selenium xpath web-scraping

这是我试图抓取的 HTML 代码:

<div class="vbseo_liked">
<a href="http://www.jamiiforums.com/member.php?u=8355" rel="nofollow">Nyaralego</a>
,
<a href="http://www.jamiiforums.com/member.php?u=8870" rel="nofollow">Sikonge</a>
,
<a href="http://www.jamiiforums.com/member.php?u=8979" rel="nofollow">Ab-Titchaz</a>
and
<a onclick="return vbseoui.others_click(this)" href="http://www.jamiiforums.com/kenyan-news/225589-kenyan-and-tanzanian-surburbs.html#">11 others</a>
like this.
</div>

这是我到目前为止的代码:

from selenium import webdriver
from selenium.webdriver.common.keys import Keys


path_to_chromedriver = '/Users/myMac/Desktop/chromedriver' # change path as needed
#browser = webdriver.Chrome(executable_path = path_to_chromedriver)
browser= webdriver.PhantomJS()
browser.set_window_size(1120, 550)
#browser.get("http://www.jamiiforums.com/login.php?do=logout&logouthash=1427579705-9d07da3339a5f07bb3a4c9d1001b75dc0431a138")
browser.get("http://www.jamiiforums.com/kenyan-news/225589-kenyan-and-tanzanian-surburbs.html")
username = browser.find_element_by_id("navbar_username")
password = browser.find_element_by_name("vb_login_password_hint")

username.send_keys("")
password.send_keys("")

browser.find_element_by_class_name("loginbutton").click()
link = browser.find_element_by_xpath('//div[@class="vbseo_liked"]/a[contains(@onclick, "return vbseoui.others_click(this)")]')
link.click()

print 'succes!!'
browser.close()

我希望能够点击链接:

<a onclick="return vbseoui.others_click(this)" href="http://www.jamiiforums.com/kenyan-news/225589-kenyan-and-tanzanian-surburbs.html#">11 others</a>

单击链接后,然后在屏幕上打印结果。

最佳答案

登录后发生重定向。您需要等待重定向发生才能获取页面源。这是对我有用的:

from selenium import webdriver
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC

browser = webdriver.PhantomJS()
browser.set_window_size(1120, 550)
browser.get("http://www.jamiiforums.com/kenyan-news/225589-kenyan-and-tanzanian-surburbs.html")

username = browser.find_element_by_id("navbar_username")
password = browser.find_element_by_name("vb_login_password_hint")

username.send_keys("username")
password.send_keys("password")

browser.find_element_by_class_name("loginbutton").click()

wait = WebDriverWait(browser, 10)
wait.until(EC.visibility_of_element_located((By.XPATH, '//h2[contains(., "Redirecting")]')))
wait.until(EC.title_contains('Kenyan & Tanzanian'))

print(browser.page_source)

print 'success!!'
browser.close()

关于javascript - 用 Selenium 刮4,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/29335519/

相关文章:

javascript - 迭代并读取 json 响应

javascript - 嵌套函数的用途或优点是什么?

python - 在有 GIL 的情况下,您可以在 Python 中竞争条件吗?

python - 使用 selenium 从 Highcharts 抓取数据

javascript - 如何从二维(嵌套)数组中删除数组?

javascript - 为什么在此代码中使用 “n--” 运算符会引发错误?

python - 如何使用 pandas python 绘制堆积条形图

python - 如何使用 Django 验证器将日期限制为 1900 或更长

java - 如何在 Jenkins 上自动重新运行失败的 TestNG 测试

c# - 如何使用 Selenium Webdriver (C#) 在 Chrome 中处理浏览器身份验证?