vba - 使用 MSXML2.XMLHTTP 而不是使用 VBA 的 InternetExplorer.Application 登录网站

标签 vba authentication msxml

第一次发帖,

我正在尝试从必须登录的网站页面获取 ID“dadosDoUsuario”。我使用“InternetExplorer.Application”对象使其正常工作,但在使用“MSXML2.5”时无法获取 ID 值。 XMLHTTP”对象。它似乎不会越过登录页面,因为我可以从该页面获取其他 ID(例如:“tituloPagina”)。有人可以提示我登录后如何从页面获取数据吗?谢谢!

InternetExplorer.Application 代码(此代码有效):

Sub testIE()
Dim texto As String

Set ie = CreateObject("InternetExplorer.Application")
my_url = "https://www.nfp.fazenda.sp.gov.br/login.aspx"
With ie
    .Visible = False
    .Navigate my_url

Do Until Not ie.Busy And ie.readyState = 4
    DoEvents
Loop

End With

ie.Document.getelementbyid("userName").Value = "MYUSERNAME"
ie.Document.getelementbyid("Password").Value = "MYPASSWORD"
ie.Document.getelementbyid("Login").Click

Do Until Not ie.Busy And ie.readyState = 4
    DoEvents
Loop

ie.Document.getelementbyid("btnConsultarNFSemestre").Click

Do Until Not ie.Busy And ie.readyState = 4
    DoEvents
Loop

texto = ie.Document.getelementbyid("dadosDoUsuario").innerText
MsgBox texto

ie.Quit

End Sub

MSXML2.XMLHTTP 代码(此代码不起作用):

Sub testXMLHTTP()
Dim xml As Object
Dim html As Object
Dim dados As Object
Dim text As Object

Set xml = CreateObject("MSXML2.XMLHTTP")

Set html = CreateObject("htmlFile")

With xml
  .Open "POST", "https://www.nfp.fazenda.sp.gov.br/Login.aspx", False
  .setRequestHeader "Content-Type", "text/xml"
  .send "userName=MYUSERNAME&password=MYPASSWORD"
  .Open "GET", "https://www.nfp.fazenda.sp.gov.br/Inicio.aspx", False
  .setRequestHeader "Content-Type", "text/xml"
  .send
End With

html.body.innerhtml = xml.responseText

Set objResult = html.GetElementById("dadosDoUsuario")
GetElementById = objResult.innertext

MsgBox GetElementById

End Sub

编辑:我按照@Florent B.建议的步骤操作,并添加了一个 scripcontrol 来获取 __VIEWSTATE、__VIEWSTATEGENERATOR 和 __EVENTVALIDATION 的编码值。成功了!

Sub testXMLHTTP()
Dim xml As Object
Dim html As HTMLDocument
Dim dados As Object
Dim text As Object
Dim html2 As HTMLDocument
Dim xml2 As Object

Set xml = CreateObject("Msxml2.ServerXMLHTTP.6.0")
Set html = CreateObject("htmlFile")


With xml
  .Open "GET", "https://www.nfp.fazenda.sp.gov.br/Login.aspx", False
  .send
End With

strCookie = xml.getResponseHeader("Set-Cookie")

html.body.innerhtml = xml.responseText

Set objvstate = html.GetElementById("__VIEWSTATE")
Set objvstategen = html.GetElementById("__VIEWSTATEGENERATOR")
Set objeventval = html.GetElementById("__EVENTVALIDATION")

vstate = objvstate.Value
vstategen = objvstategen.Value
eventval = objeventval.Value

'URL Encode ViewState
    Dim ScriptEngine As ScriptControl
    Set ScriptEngine = New ScriptControl
    ScriptEngine.Language = "JScript"
    ScriptEngine.AddCode "function encode(vstate) {return encodeURIComponent(vstate);}"
    Dim encoded As String
    encoded = ScriptEngine.Run("encode", vstate)
    vstate = encoded
'URL Encode Event Validation
    ScriptEngine.AddCode "function encode(eventval) {return encodeURIComponent(eventval);}"
    encoded = ScriptEngine.Run("encode", eventval)
    eventval = encoded
'URL Encode ViewState Generator
    ScriptEngine.AddCode "function encode(vstategen) {return encodeURIComponent(vstategen);}"
    encoded = ScriptEngine.Run("encode", vstategen)
    vstategen = encoded

Postdata = "__EVENTTARGET=" & "&__EVENTARGUMENT=" & "&__VIEWSTATE=" & vstate & "&__VIEWSTATEGENERATOR=" & vstategen & "&__EVENTVALIDATION=" & eventval & "&ctl00$ddlTipoUsuario=#rdBtnNaoContribuinte" & "&ctl00$UserNameAcessivel=Digite+o+Usuário" & "&ctl00$PasswordAcessivel=x" & "&ctl00$ConteudoPagina$Login1$rblTipo=rdBtnNaoContribuinte" & "&ctl00$ConteudoPagina$Login1$UserName=MYUSERNAME" & "&ctl00$ConteudoPagina$Login1$Password=MYPASSWORD" & "&ctl00$ConteudoPagina$Login1$Login=Acessar" & "&ctl00$ConteudoPagina$Login1$txtCpfCnpj=Digite+o+Usuário"

Set xml2 = CreateObject("Msxml2.ServerXMLHTTP.6.0")
Set html2 = CreateObject("htmlFile")

With xml2
  .Open "POST", "https://www.nfp.fazenda.sp.gov.br/Login.aspx", False
  .setRequestHeader "Cookie", strCookie
  .setRequestHeader "Content-Type", "application/x-www-form-urlencoded"
  .setRequestHeader "Content-Lenght", Len(Postdata)
  .send (Postdata)
End With

html2.body.innerhtml = xml2.responseText

Set objResult = html2.GetElementById("dadosDoUsuario")
GetElementById = objResult.innertext

MsgBox GetElementById


End Sub

最佳答案

有可能,但没那么容易。

首先,您需要使用 CreateObject("Msxml2.ServerXMLHTTP.6.0") 而不是 CreateObject("MSXML2.XMLHTTP")。

然后按照以下步骤操作:

  1. 打开并将 GET 发送至 https://www.nfp.fazenda.sp.gov.br/login.aspx
  2. 解析并存储响应 header “Set-Cookie”中的 cookie
  3. 解析并存储 HTML 响应中的 __VIEWSTATE、__VIEWSTATEGENERATOR、__EVENTVALIDATION
  4. 使用之前解析的值和您的用户名/密码构建下一个查询的数据:

    __EVENTTARGET:""
    __EVENTARGUMENT:""
    __VIEWSTATE:"..."
    __VIEWSTATEGENERATOR:"..."
    __EVENTVALIDATION:"..."
    ctl00$ddlTipoUsuario:"#rdBtnNaoContribuinte"
    ctl00$UserNameAcessivel:"Digite+o+Usuário"
    ctl00$PasswordAcessivel:"x"
    ctl00$ConteudoPagina$Login1$rblTipo:"rdBtnNaoContribuinte"
    ctl00$ConteudoPagina$Login1$UserName:"..."
    ctl00$ConteudoPagina$Login1$Password:"..."
    ctl00$ConteudoPagina$Login1$Login:"Acessar"
    ctl00$ConteudoPagina$Login1$txtCpfCnpj:"Digite+o+Usuário"
    
  5. 打开帖子到 https://www.nfp.fazenda.sp.gov.br/login.aspx

  6. 使用第 2 步解析的 cookie 设置 header “Cookie”
  7. 设置 header 内容类型:“application/x-www-form-urlencoded”
  8. 将 header Content-Length 设置为数据长度
  9. 发送包含第 4 步中的数据的 POST

关于vba - 使用 MSXML2.XMLHTTP 而不是使用 VBA 的 InternetExplorer.Application 登录网站,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/35906564/

相关文章:

xml - 遍历完整的 XML 文档

不使用客户端 ID 和 key 的 Azure AD Graph API 访问

authentication - 使用服务器端代码手动构建 Google 登录集成流程

javascript - 引用未声明的命名空间前缀 : 'soap' when parsing MSXML soap response using selectSingleNode

excel - 单击 Bloomberg 功能区上的 RefreshWorkbook

regex - 如何在VBA正则表达式中使用非捕获组?

node.js - 检查电子邮件是否存在 Sequelize ( Node js)的正确方法?

javascript - 有没有办法从 MSXML XPath 求值器获取非节点结果?

javascript - MSXML2.XMLHTTP 请求将选择什么版本,没有版本后缀?

excel - 如何在将数据从 Excel 导入 Outlook 时设置固定列宽?