I had a parser that worked well in Scala Packrat parser combinators. I would like to try something faster with the Fastparse library. However, it cannot handle left-recursion infinite loops. Is there any standard way to cope with that?
sealed trait Expr
case class Num(value: java.lang.Number) extends Expr
case class Div(a: Expr, b: Expr) extends Expr
def num[_: P] = P(CharIn("0-9").rep(1).!).map(n => Num(n.toInt))
def div[_: P] = P(expr ~ "/" ~ expr).map(Div.tupled)
def expr[_: P]: P[Expr] = P(div | num)
I don't know much about Fastparse, but I'll try to answer your question nevertheless. Right now, your grammar looks something like this:
expr ::= div | num
div ::= expr "/" expr
num ::= 0 | 1 | ...
So if you wanted to parse 1/2
as an expression, it would first try to match div
. To do that, it would try to match expr
again, and basically go on infinitely. We can fix this by putting num
before div
, as suggested in a comment above:
expr ::= num | div
Or
def expr[_: P]: P[Expr] = P(num | div)
Success! Or is it? Upon looking more closely at the result, you'll see that it's not a Div(Num(1), Num(2))
but rather just a Num(1)
. To fix this, use End
def expr[_: P]: P[Expr] = P((num | div) ~ End)
And now it fails, saying it found "/2". It successfully matches num
first, so it has no reason to think that that first number is part of a division operation. So we will have to use div
before num
after all, to make sure the bigger pattern is used, but something needs to be done to avoid recursion. We can refactor it like this:
expr ::= div
div ::= num ("/" num)*
div
doesn't just match division, it can also match a single number, but it tries to match division when possible. In Scala, that would be:
def div[_: P] = P(num ~ ("/" ~/ num).rep).map {
case (a, ops) => ops.foldLeft(a: Expr){ case (a, b) => Div(a, b) }
}
def expr[_: P]: P[Expr] = P(div ~ End)
This way, we can match "1/2", "1/2/3", "1/2/3/4", etc.
Output for parse("1/2/3/4", expr(_))
is Parsed.Success(Div(Div(Div(Num(1),Num(2)),Num(3)),Num(4)), 7)