python - 来自 SUDS.client 的未知字符串格式(汤?)的可能解析器

标签 python soap suds

我正在使用 suds 包从网站查询 API,从他们的网站返回的数据如下所示:

(1)。谁能告诉我这是什么格式?

(2)。如果是这样,解析数据的最简单方法是什么?我已经使用 BeautifulSoup 处理过很多 HTML/XML 格式,但在我为这种格式编写正则表达式之前。我很好奇这是某种类型的“流行格式”,实际上已经编写了一些漂亮的解析器。谢谢。

    # Below are the header and tail of the response.. 
    (DetailResult)
{ 

status = (Status){ message = None code = "0" } 

searchArgument = (DetailSearchArgument){ reqPartNumber = "BQ" reqMfg = "T" reqCpn = None } 

detailsDto[] = (DetailsDto){ 

summaryDto = (SummaryDto){ PartNumber = "BQ" seMfg = "T" description = "Fast" } 

packageDto[] = 
(PackageDto){ fetName = "a" fetValue = "b" }, 
(PackageDto){ fetName = "c" fetValue = "d" }, 
(PackageDto){ fetName = "d" fetValue = "z" }, 
(PackageDto){ fetName = "f" fetValue = "Sq" }, 
(PackageDto){ fetName = "g" fetValue = "p" }, 

additionalDetailsDto = (AdditionalDetailsDto){ cr = None pOptions = None inv = None pcns = None } 

partImageDto = None 

riskDto = (RiskDto){ life= "Low" lStage = "Mature" yteol = "10" Date = "2023"} 

partOptionsDto[] = (ReplacementDto){ partNumber = "BQ2" manufacturer = "T" type = "Reel" }, 

inventoryDto[] = 
(InventoryDto){ distributor = "V" quantity = "88" buyNowLink = "https://www..." }, 
(InventoryDto){ distributor = "R" quantity = "7" buyNowLink = "http://www.r." }, 
(InventoryDto){ distributor = "RS" quantity = "2" buyNowLink = "http://www.rs.." }, 
}, 

}

最佳答案

这看起来像是某种嵌套的 repr 输出,类似于 JSON,但带有 结构或对象名称信息(“状态包含消息和代码”)。 如果是嵌套的,仅靠正则表达式无法完成这项工作。这是 pyparsing 的粗略过程 解析器

sample = """
... given sample text ...
 """
from pyparsing import *

# punctuation
LPAR,RPAR,LBRACE,RBRACE,LBRACK,RBRACK,COMMA,EQ = map(Suppress,"(){}[],=")

identifier = Word(alphas,alphanums+"_")

# define some types that can get converted to Python types
# (parse actions will do conversion at parse time)
NONE = Keyword("None").setParseAction(replaceWith(None))
integer = Word(nums).setParseAction(lambda t:int(t[0]))
quotedString.setParseAction(removeQuotes)

# define a placeholder for a nested object definition (since objDefn
# will be referenced within its own definition)
objDefn = Forward()

objType = Combine(LPAR + identifier + RPAR)

objval = quotedString | NONE | integer | Group(objDefn)
objattr = Group(identifier + EQ + objval)

arrayattr = Group(identifier + LBRACK + RBRACK + EQ + Group(OneOrMore(Group(objDefn)+COMMA)) )

# use '<<' operator to assign content to previously declared Forward
objDefn << objType + LBRACE + ZeroOrMore((arrayattr | objattr) + Optional(COMMA)) + RBRACE
# parse sample text
result = objDefn.parseString(sample)

# use pprint to list out indented parsed data
import pprint
pprint.pprint(result.asList())

打印:

['DetailResult',
 ['status', ['Status', ['message', None], ['code', '0']]],
 ['searchArgument',
  ['DetailSearchArgument',
   ['reqPartNumber', 'BQ'],
   ['reqMfg', 'T'],
   ['reqCpn', None]]],
 ['detailsDto',
  [['DetailsDto',
    ['summaryDto',
     ['SummaryDto',
      ['PartNumber', 'BQ'],
      ['seMfg', 'T'],
      ['description', 'Fast']]],
    ['packageDto',
     [['PackageDto', ['fetName', 'a'], ['fetValue', 'b']],
      ['PackageDto', ['fetName', 'c'], ['fetValue', 'd']],
      ['PackageDto', ['fetName', 'd'], ['fetValue', 'z']],
      ['PackageDto', ['fetName', 'f'], ['fetValue', 'Sq']],
      ['PackageDto', ['fetName', 'g'], ['fetValue', 'p']]]],
    ['additionalDetailsDto',
     ['AdditionalDetailsDto',
      ['cr', None],
      ['pOptions', None],
      ['inv', None],
      ['pcns', None]]],
    ['partImageDto', None],
    ['riskDto',
     ['RiskDto',
      ['life', 'Low'],
      ['lStage', 'Mature'],
      ['yteol', '10'],
      ['Date', '2023']]],
    ['partOptionsDto',
     [['ReplacementDto',
       ['partNumber', 'BQ2'],
       ['manufacturer', 'T'],
       ['type', 'Reel']]]],
    ['inventoryDto',
     [['InventoryDto',
       ['distributor', 'V'],
       ['quantity', '88'],
       ['buyNowLink', 'https://www...']],
      ['InventoryDto',
       ['distributor', 'R'],
       ['quantity', '7'],
       ['buyNowLink', 'http://www.r.']],
      ['InventoryDto',
       ['distributor', 'RS'],
       ['quantity', '2'],
       ['buyNowLink', 'http://www.rs..']]]]]]]]

关于python - 来自 SUDS.client 的未知字符串格式(汤?)的可能解析器,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/20133824/

相关文章:

python - 在 Raspberry Pi 上运行的 Docker 容器中访问 picamera 时出错

python - 使用 SQLAlchemy 批量更新

python - 努力使用 Docker 卷和 Flask 写入文件

php - SOAP 空信封。如何处理空响应

java - org.apache.axis.message.SOAPBodyElement 无法转换为 weblogic.xml.xmlnode.XMLNode

python - 如何使用 python suds 客户端进行 kerberos 身份验证

python - 如何处理 : ImportError:/usr/lib/x86_64-linux-gnu/libatk-1. 0.so.0: undefined symbol: g_log_structured_standard

c# - 如何为 WS-Security 生成 UsernameToken?

python - 解析包含相互引用的 XSD 的 WSDL

python - SOAP - 使用 zeep 创建元素 (python)