Python PLY 赋值操作失败

标签 python ply

我正在尝试创建一个将我的脚本作为输入的解释器。我在编写正则表达式时遇到了一些问题。定义的标记之一是将所有字符串视为标记。

import ply.lex as lex
import ply.yacc as yacc

tokens = (
    'STAIRCASE',
    'STAIRCASE_END',
    'STAIR',
    'STAIR_END',
    'TAG',
    'COLON_SYM',
    'LINE_START_SYM',
    'NONE',
    'USER_DEFINED',
    'ARRAY',
    'IS',
)

assignments = {}

t_STAIRCASE                 = r'staircase'
t_TAG                       = r'\(([a-zA-Z0-9\ ])*\)'
t_COLON_SYM                 = r' :'
t_LINE_START_SYM            = r'-'
t_STAIRCASE_END             = 'EOSC'
t_ignore                    = ' \t\n'
t_STAIR                     = 'stair'
t_STAIR_END                 = 'EOS'
t_NONE                      = 'EOP'

这是正则表达式的问题

t_USER_DEFINED              = r'[a-zA-Z0-9]+'

代码继续

t_IS                        = 'is'

def t_error(t):
    print 'Illegal character "%s"' % t.value[0]
    t.lexer.skip(1)
lex.lex()

NONE, STAIRCASE, TAG, STAIRCASE_DESCRIPTION = range(4)
states = ['NONE', 'STAIRCASE','STAIRCASE_DESCRIPTION']
current_state = NONE

def x():
    print "Hi How you doing"

def p_staircase_def(t):
    """STAIRCASE_DEF         : STAIRCASE TAG COLON_SYM STAIRCASE_DESCRIPTION
                             """
    print t[0:]
    help(t)

def p_staircase_description(t):
    """STAIRCASE_DESCRIPTION : LINE_START_SYM DICTONARY STAIRCASE_DESCRIPTION
                             | STAIRCASE_END STAIR_DEF
                             """
    print t[0:]

def p_dictonary(t):
    """
    DICTONARY                : USER_DEFINED IS USER_DEFINED
                             """

这里是我的赋值操作,实际上它创建了一个变量字典

    temp = { t[1] : t[2] }
    print assignments.update( temp )
def p_stair_def(t):
    """STAIR_DEF             : STAIR TAG COLON_SYM STAIR_DESCRIPTION
                             """
    print t[0:]

def p_stair_description(t):
    """STAIR_DESCRIPTION     : LINE_START_SYM DICTONARY STAIR_DESCRIPTION
                             | STAIR_END STAIR_DEF
                             | STAIR_END
                             """
    print t[0:]

def p_error(t):
    print 'Syntax error at "%s"' % t.value if t else 'NULL'
    global current_state
    current_state = NONE

yacc.yacc()

file_input = open("x.staircase","r")
yacc.parse(file_input.read())

这是我的解释器“x.staircase”需要接受的示例输入

staircase(XXXX XXX XXX):
- abc is 23183       # which need to {'abc' : '23183'}
- bcf is fda
- deh is szsC
EOSC
stair(XXXX XXX XXX):
- lkm is 35
- raa is 233
EOS
stair(XXXX XXX XXX):
- faa is zxhfb
- faa is 1
EOS

我得到的错误

Syntax error at "staircase"
[Finished in 0.1s]

BELOW 代码有效,但输入文件不符合预期。

import ply.lex as lex
import ply.yacc as yacc

tokens = (
    'STAIRCASE',
    'STAIRCASE_END',
    'STAIR',
    'STAIR_END',
    'TAG',
    'COLON_SYM',
    'LINE_START_SYM',
    'NONE',
    'USER_DEFINED',
    'ARRAY',
    'IS',
)

assignments = {}

t_STAIRCASE                 = r'staircase'
t_TAG                       = r'\(([a-zA-Z0-9\ ])*\)'
t_COLON_SYM                 = r' :'
t_LINE_START_SYM            = r'-'
t_STAIRCASE_END             = 'EOSC'
t_ignore                    = ' \t\n'
t_STAIR                     = 'stair'
t_STAIR_END                 = 'EOS'
t_NONE                      = 'EOP'

##########################################
Here is the issue with this regular exprission

It worked, If I Use this
t_USER_DEFINED              = r'a'

Instead of this
#t_USER_DEFINED              = r'[a-zA-Z0-9]+'

But, when it comes to my input file it only accept one variable called 'a'
##########################################
Code continues

t_IS                        = 'is'

def t_error(t):
    print 'Illegal character "%s"' % t.value[0]
    t.lexer.skip(1)
lex.lex()

NONE, STAIRCASE, TAG, STAIRCASE_DESCRIPTION = range(4)
states = ['NONE', 'STAIRCASE','STAIRCASE_DESCRIPTION']
current_state = NONE

def x():
    print "Hi How you doing"

def p_staircase_def(t):
    """STAIRCASE_DEF         : STAIRCASE TAG COLON_SYM STAIRCASE_DESCRIPTION
                             """
    print t[0:]
    help(t)

def p_staircase_description(t):
    """STAIRCASE_DESCRIPTION : LINE_START_SYM DICTONARY STAIRCASE_DESCRIPTION
                             | STAIRCASE_END STAIR_DEF
                             """
    print t[0:]

def p_dictonary(t):
    """
    DICTONARY                : USER_DEFINED IS USER_DEFINED
                             """

HERE is my assignment operation, actually it create a dictionary of variables

    temp = { t[1] : t[2] }
    print assignments.update( temp )

def p_stair_def(t):
    """STAIR_DEF             : STAIR TAG COLON_SYM STAIR_DESCRIPTION
                             """
    print t[0:]

def p_stair_description(t):
    """STAIR_DESCRIPTION     : LINE_START_SYM DICTONARY STAIR_DESCRIPTION
                             | STAIR_END STAIR_DEF
                             | STAIR_END
                             """
    print t[0:]

def p_error(t):
    print 'Syntax error at "%s"' % t.value if t else 'NULL'
    global current_state
    current_state = NONE

yacc.yacc()

file_input = open("x.staircase","r")
yacc.parse(file_input.read())

下面给出了一个示例 input("x.staircase") 的工作原理。但正如您所见,所有 USER_DEFINED 始终是一个

staircase(XXXX XXX XXX):
- a is a       # which need to {'abc' : '23183'}
- a is a
- a is a
EOSC
stair(XXXX XXX XXX):
- a is a
- a is a
EOS
stair(XXXX XXX XXX):
- a is a
- a is a 1
EOS

最佳答案

请(重新)阅读 Ply manual 中 Ply 的词法分析器如何识别标记的描述。 .特别注意订购规则;因为模式变量是从长到短排序的,t_USER_DEFINED 模式在任何关键字模式(例如 staircase)之前被尝试,所以没有一个关键字会被识别。 (这就是为什么将 t_USER_DEFINED 缩短为单个字符会改变词法行为的原因。)

有一个很好的线索表明这是问题所在而不是作业生成:错误消息是在标记 staircase 处触发的,远在遇到作业之前。通过在 p_error 函数中打印 t.typet.value,您会得到另一条线索。 (或者,当然,通过在尝试解析任何内容之前测试标记器。)

如果您通读到我链接的 Ply 手册部分的末尾,您会找到有关如何使用扫描仪功能和辅助关键字词典处理关键字标记的建议。我强烈建议您使用它作为模型。

另请注意,冒号前必须有一个空格字符:

t_COLON_SYM                 = r' :'

但是您的样本输入在冒号前没有空格。

关于Python PLY 赋值操作失败,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/55545590/

相关文章:

python - 词法分析器错误处理 PLY Python

python - PLY : Parsing error. 规则意外匹配空标记集

python - python 中的正则表达式使用 OR 表示单引号或双引号

python - conda与caffe一起安装openCV 3.4.2,但不能单独删除

python - 缺少输出,在 Python 中使用 range()

python - 解析隐式与显式时间运算符

r - 如何计算数据框中的唯一行?

python - 如何在同一选项卡中使用 selenium 函数? (在 python 上)

python - 将小函数转换为协程

python - 在 Python 中使用 plyj 解析 Java 源代码