Python Glassdoor API

标签 python api

我正在尝试从他们的 Python API 中获取 glassdoor 数据:

import urllib2

id1 = 'x'
key = 'y'
action = 'employers'
company = 'company'

basepath = 'http://api.glassdoor.com/api/api.htm?v=1&format=json&t.p='
url = basepath + id1 + '&t.k=' + key + '&action=' + action + '&q=' + company + '&userip=192.168.43.42&useragent=Mozilla/5.0'

response = urllib2.urlopen(url)
html = response.read()

我收到以下错误:

>>> response = urllib2.urlopen(url)
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "//anaconda/lib/python2.7/urllib2.py", line 154, in urlopen
    return opener.open(url, data, timeout)
  File "//anaconda/lib/python2.7/urllib2.py", line 437, in open
    response = meth(req, response)
  File "//anaconda/lib/python2.7/urllib2.py", line 550, in http_response
    'http', request, response, code, msg, hdrs)
  File "//anaconda/lib/python2.7/urllib2.py", line 475, in error
    return self._call_chain(*args)
  File "//anaconda/lib/python2.7/urllib2.py", line 409, in _call_chain
    result = func(*args)
  File "//anaconda/lib/python2.7/urllib2.py", line 558, in http_error_default
    raise HTTPError(req.get_full_url(), code, msg, hdrs, fp)
urllib2.HTTPError: HTTP Error 403: Forbidden

谁能帮忙...?

谢谢

最佳答案

下面是工作代码,通过添加 BeautifulSoup 模块并在 hdr 变量中设置 User-Agent 进行了一些改进。

import urllib2, sys
from BeautifulSoup import BeautifulSoup

url = "http://api.glassdoor.com/api/api.htm?t.p=yourID&t.k=yourkey&userip=8.28.178.133&useragent=Mozilla&format=json&v=1&action=employers&q="
hdr = {'User-Agent': 'Mozilla/5.0'}
req = urllib2.Request(url,headers=hdr)
response = urllib2.urlopen(req)
soup = BeautifulSoup(response)

希望对你有帮助,谢谢

关于Python Glassdoor API,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/30454571/

相关文章:

Python3 快速检查元素是否在元素集合中的方法

修改列表中包含特定数量(不仅仅是字符)的元素的 Pythonic 方法

python - 如何使 QWidget 圆角

c# - 检测用户是否输入假名的服务或策略?

java - 隐藏枚举常量

c - Postgresql 和 C 数据类型

python - 在 Swift 中导入 Python 代码和库

python - AWS Athena - 操作无法分页 : get_query_results

ruby-on-rails - Rails 中的 Google freebusy api 调用无法识别参数

java - 锁在 API 中等待,以防暴露