ruby-on-rails - 无法使用带有 Mechanize 的表单登录

标签 ruby-on-rails ruby screen-scraping nokogiri mechanize

我正在尝试使用 mechanize 提交表单。但是提交时没有任何反应。我刚刚再次获得登录页面。

形式:

http://affilate.mikkelsenmedia.dk/partnersystem/mylogins.php

require 'Mechanize'
agent = WWW::Mechanize.new
agent.get("http://affilate.mikkelsenmedia.dk/partnersystem/mylogins.php")

form = agent.page.forms.first
form.username = 'username'
form.password = 'password'
form.submit

使用浏览器登录时实时 http 日志:

http://affilate.mikkelsenmedia.dk/partnersystem/mylogins.php

POST /partnersystem/mylogins.php HTTP/1.1
Host: affilate.mikkelsenmedia.dk
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 6.1; da; rv:1.9.2.13) Gecko/20101203 Firefox/3.6.13
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
Accept-Language: da,en-us;q=0.7,en;q=0.3
Accept-Encoding: gzip,deflate
Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.7
Keep-Alive: 115
Connection: keep-alive
Referer: http://affilate.mikkelsenmedia.dk/partnersystem/mylogins.php
Cookie: XXX
Content-Type: application/x-www-form-urlencoded
Content-Length: 47
username=username&password=password&send=Submit
HTTP/1.1 200 OK
Date: Fri, 18 Feb 2011 17:07:15 GMT
Server: Apache/2.0.63 (CentOS)
X-Powered-By: PHP/5.1.6
Expires: Thu, 19 Nov 1981 08:52:00 GMT
Cache-Control: no-store, no-cache, must-revalidate, post-check=0, pre-check=0
Pragma: no-cache
Content-Length: 77
Connection: close
Content-Type: text/html; charset=UTF-8

控制台:

irb(main):001:0> require 'Mechanize'
=> true
irb(main):002:0> agent = Mechanize.new { |agent|
irb(main):003:1*   agent.user_agent_alias = 'Mac Safari'
<"http://affilate.mikkelsenmedia.dk/partnersystem/mylogins.php")
=> #<Mechanize::Page
 {url
  #<URI::HTTP:0x2a1c770 URL:http://affilate.mikkelsenmedia.dk/partnersystem/mylo
gins.php>}
 {meta}
 {title "Partner System - Log-in"}
 {iframes}
 {frames}
 {links}
 {forms
  #<Mechanize::Form
   {name "loginform"}
   {method "POST"}
   {action "mylogins.php"}
   {fields
    #<Mechanize::Form::Text:0x2836680
     @name="username",
     @node=
      #(Element:0x141b3e8 {
        name = "input",
        attributes = [
          #(Attr:0x13e4458 { name = "type", value = "text" }),
          #(Attr:0x13e4440 { name = "name", value = "username" }),
          #(Attr:0x13e43e0 { name = "size", value = "30" })]
        }),
     @value="">
    #<Mechanize::Form::Field:0x2836230
     @name="password",
     @node=
      #(Element:0x141b22c {
        name = "input",
        attributes = [
          #(Attr:0x13ac148 { name = "type", value = "password" }),
          #(Attr:0x13ac13c { name = "name", value = "password" }),
          #(Attr:0x13ac10c { name = "size", value = "30" })]
        }),
     @value="">}
   {radiobuttons}
   {checkboxes}
   {file_uploads}
   {buttons
    #<Mechanize::Form::Submit:0x2835f90
     @name="send",
     @node=
      #(Element:0x141b01c {
        name = "input",
        attributes = [
          #(Attr:0x13702e0 { name = "type", value = "submit" }),
          #(Attr:0x13702d4 { name = "name", value = "send" }),
          #(Attr:0x13702c8 { name = "class", value = "style2" }),
          #(Attr:0x13702bc { name = "value", value = "Submit" })]
        }),
     @value="Submit">}>}>

irb(main):006:0>   form = agent.page.forms.first
irb(main):007:0>   form.username = 'username'
=> "username"
irb(main):008:0>   form.password = 'password'
=> "password"
irb(main):009:0>   form.submit
=> #<Mechanize::Page
 {url
  #<URI::HTTP:0x2a82e78 URL:http://affilate.mikkelsenmedia.dk/partnersystem/mylo
gins.php>}
 {meta}
 {title "Partner System - Log-in"}
 {iframes}
 {frames}
 {links}
 {forms
  #<Mechanize::Form
   {name "loginform"}
   {method "POST"}
   {action "mylogins.php"}
   {fields
    #<Mechanize::Form::Text:0x2a52c50
     @name="username",
     @node=
      #(Element:0x1529694 {
        name = "input",
        attributes = [
          #(Attr:0x1513c14 { name = "type", value = "text" }),
          #(Attr:0x1513c08 { name = "name", value = "username" }),
          #(Attr:0x1513bfc { name = "size", value = "30" })]
        }),
     @value="">
    #<Mechanize::Form::Field:0x2a52998
     @name="password",
     @node=
      #(Element:0x1529550 {
        name = "input",
        attributes = [
          #(Attr:0x15121d4 { name = "type", value = "password" }),
          #(Attr:0x15121c8 { name = "name", value = "password" }),
          #(Attr:0x15121bc { name = "size", value = "30" })]
        }),
     @value="">}
   {radiobuttons}
   {checkboxes}
   {file_uploads}
   {buttons
    #<Mechanize::Form::Submit:0x2a52758
     @name="send",
     @node=
      #(Element:0x152940c {
        name = "input",
        attributes = [
          #(Attr:0x151062c { name = "type", value = "submit" }),
          #(Attr:0x1510614 { name = "name", value = "send" }),
          #(Attr:0x1510608 { name = "class", value = "style2" }),
          #(Attr:0x15105fc { name = "value", value = "Submit" })]
        }),
     @value="Submit">}>}>

irb(main):010:0>

最佳答案

服务器还会检查提交按钮参数“发送”是否存在。

form.submit 之前添加这一行

form.add_field! 'send','Submit'

关于ruby-on-rails - 无法使用带有 Mechanize 的表单登录,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/5044206/

相关文章:

ruby-on-rails - 在 El Capitan 上安装 Nokogiri (1.6.7) 时出错

javascript - 在 Assets 管道中加载订单 JavaScript 文件

javascript将变量添加到输入字段rails

ruby - MongoDB聚合/展开/组/项目查询组合

ruby - 保存来自网站的所有图像文件

python - 如何使用python迭代scrapy中的XML子节点?

ruby-on-rails - 如何将 'Fixnum' 转换为 Rails 3.1.3 中的日期?

ruby - 如何在单行 ruby​​ -e 中随机播放文件

ruby - 从线程循环中脱离出来

java - 如何使用java从网站中提取数据?