c++ - 提升精神 : parse a section of an input

我有数千行输入，每行由 3 个整数和一个逗号组成，看起来像这样:

5 6 10,
8 9 45,
.....

如何创建仅解析输入的特定部分的语法，例如前 100 行或从 1000 到 1200 行并忽略其余部分。

我的语法目前是这样的:

qi::int_ >> qi::int_ >> qi::int_ >> qi::lit(",");

但显然它会解析整个输入。

最佳答案

您可以只查找感兴趣的点并在那里解析 100 行。

关于如何从 spirit 跳过 100 行的草图:

Live On Coliru

#define BOOST_SPIRIT_DEBUG
#include <boost/spirit/include/qi.hpp>
#include <boost/fusion/adapted/std_tuple.hpp>
#include <tuple>

namespace qi = boost::spirit::qi;

int main() {
    using It  = boost::spirit::istream_iterator;
    using Tup = std::tuple<int, int, int>;

    It f(std::cin >> std::noskipws), l;
    std::vector<Tup> data;

    using namespace qi;

    if (phrase_parse(f, l,
            omit [ repeat(100) [ *(char_ - eol) >> eol ] ] >> // omit 100 lines
            repeat(10) [  int_ >> int_ >> int_ >> ',' >> eol ], // parse 10 3-tuples
            blank, data))
    {
        int line = 100;
        for(auto tup : data)
            std::cout << ++line << "\t" << boost::fusion::as_vector(tup) << "\n";
    }

}

当使用一些随机输入进行测试时

od -Anone -t d2 /dev/urandom -w6 | sed 's/$/,/g' | head -200 | tee log | ./test
echo ============== VERIFY WITH sed:
nl log | sed -n '101,110p'

它会打印一些预期的东西，比如:

101 (15400 5215 -20219)
102 (26426 -17361 -6618)
103 (-15311 -6387 -5902)
104 (22737 14339 16074)
105 (-28136 21003 -11594)
106 (-11020 -32377 -4866)
107 (-24024 10995 22766)
108 (3438 -19758 -10931)
109 (28839 22032 -7204)
110 (-25237 23224 26189)
============== VERIFY WITH sed:
   101    15400   5215 -20219,
   102    26426 -17361  -6618,
   103   -15311  -6387  -5902,
   104    22737  14339  16074,
   105   -28136  21003 -11594,
   106   -11020 -32377  -4866,
   107   -24024  10995  22766,
   108     3438 -19758 -10931,
   109    28839  22032  -7204,
   110   -25237  23224  26189,

关于c++ - 提升精神 : parse a section of an input，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/33070299/

c++ - 提升精神 : parse a section of an input

上一篇：c++ - 创建 DNA 序列的补码并将其反转 C++

下一篇：c++ - 在 int 变量中查找 '1' 的奇偶校验