Takmela: an algorithm for parsing and Datalog query execution

Call start nonterminal

#0: Called e/0 at the top level (will process)

Processing e/0 codePos e → • e `+` t inputPos 0

Calling: e/0 with Cont e/0 ; e → e• `+` t [already processed]

Processing e/0 codePos e → • t inputPos 0

Calling: t/0 with Cont e/0 ; e → t• [will process]

Processing t/0 codePos t → • NUM inputPos 0

Match NUM to 1, matched; inputPos now 1

Reached end of rule, success! t/0 → 1

New continuations:

e/0 → e/0 ; e → e• `+` t ; e

t/0 → e/0 ; e → t• ; e

t/0 → 1, t('1')

Iteration 0

Worklist - successes

jS t/0 → 1 木t('1')

Worklist - continuations

nJ e/0 → e/0 ; e → e• `+` t

S⤚ t/0 → e/0 ; e → t•

#1: JoinS: t/0 → 1 will resume cont: e/0 ; e → t•

Processing e/0 codePos e → t• inputPos 1

Reached end of rule, success! e/0 → 1

t/0 → 1, t('1') e/0 → 1, e(t('1'))

Iteration 1

Worklist - successes

jS e/0 → 1 木e(t('1'))

Worklist - continuations

#2: JoinS: e/0 → 1 will resume cont: Root/0 ; Root → e•

×Didn't reach end of input, not a successful parse

t/0 → 1, t('1') e/0 → 1, e(t('1'))

#3: JoinS: e/0 → 1 will resume cont: e/0 ; e → e• `+` t

Processing e/0 codePos e → e• `+` t inputPos 1

Match + to +, matched; inputPos now 2

Calling: t/2 with Cont e/0 ; e → e `+` t• [will process]

Processing t/2 codePos t → • NUM inputPos 2

Match NUM to 2, matched; inputPos now 3

Reached end of rule, success! t/2 → 3

New continuations:

t/2 → e/0 ; e → e `+` t• ; e(e(t('1')), '+')

t/0 → 1, t('1') e/0 → 1, e(t('1')) t/2 → 3, t('2')

Iteration 2

Worklist - successes

jS t/2 → 3 木t('2')

Worklist - continuations

S⤚ t/2 → e/0 ; e → e `+` t•

#4: JoinS: t/2 → 3 will resume cont: e/0 ; e → e `+` t•

Processing e/0 codePos e → e `+` t• inputPos 3

Reached end of rule, success! e/0 → 3

t/0 → 1, t('1') e/0 → 1, e(t('1')) t/2 → 3, t('2') e/0 → 3, e(e(t('1')),'+',t('2'))

Iteration 3

Worklist - successes

jS e/0 → 3 木e(e(t('1')),'+',t('2'))

Worklist - continuations

#5: JoinS: e/0 → 3 will resume cont: Root/0 ; Root → e•

✓Reached end of input, successful parse!

t/0 → 1, t('1') e/0 → 1, e(t('1')) t/2 → 3, t('2') e/0 → 3, e(e(t('1')),'+',t('2')) Root/0 → 3, Root(e(e(t('1')),'+',t('2')))✓

#6: JoinS: e/0 → 3 will resume cont: e/0 ; e → e• `+` t

Processing e/0 codePos e → e• `+` t inputPos 3

Match `+` to (eof), fail

t/0 → 1, t('1') e/0 → 1, e(t('1')) t/2 → 3, t('2') e/0 → 3, e(e(t('1')),'+',t('2')) Root/0 → 3, Root(e(e(t('1')),'+',t('2')))✓

Iteration 4

Worklist - successes

nJ Root/0 → 3 木Root(e(e(t('1')),'+',t('2')))

Worklist - continuations

(No new calls or successes that can be further processed)

Iteration 5 [Fixed point reached]

	Parsing	Datalog
Input position	An integer	n/a
Call	Rule Name + input pos	Rule Name + any constant arguments, e.g ancestor(a, ?)
CodePos	Where are we in a non-terminal rule?	Where are we in a Datalog rule (aka a Horn Clause)?
Continuation	Call + CodePos + "parse tree so far" if we want parse trees.	Call + CodePos + any bound intermediate variables e.g {Z = b}
Success	Call → Set<FinalInputPos>	Call → Set<ResultTuple>
Surface matching	Matching terminals	Querying facts
More complex matching	Calling non-terminals	Calling Subgoals
When resuming a continuation	Continue from last input position	Make sure variables match

Takmela: an algorithm for parsing and Datalog query execution

by Mohamed Samy (samy2004@gmail.com)

The Takmela parser

How Takmela parsing works

Takmelogic: A Datalog execution engine based on Takmela

Adding to Takmelogic

Comparison with earlier work

A really important part is missing