Commit 04191de
committed
Short-circuit nullable-fallback and inline single-branch fragments
Two grammar/parser refinements that both reduce recursive calls:
* In parse_recursive(): when the rule has a per-token branch selector but
the current token is not in any branch's FIRST and the rule itself is
nullable, return 'matched empty' immediately instead of descending into
nullable branches that would recursively do the same thing. This alone
eliminates ~460k recursive calls on the MySQL corpus.
* At grammar build time, expand every single-branch fragment rule into
its call sites. Fragments exist only to factor shared sub-sequences and
their children are already flattened into the parent AST node, so
splicing them directly into parent branches is a no-op for the
resulting tree but removes an entire recursive call per use. 480 of the
grammar's fragments qualify.
Also drops the dead terminal branch at the top of parse_recursive() (the
branch loop inlines terminal matching, so parse_recursive is only ever
called with non-terminal rule ids) and the always-false empty-branches
guard.
End-to-end parser benchmark:
Before: ~22,400 QPS After: ~27,500 QPS (+23%)1 parent 6812717 commit 04191de
2 files changed
Lines changed: 92 additions & 39 deletions
Lines changed: 71 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
110 | 110 | | |
111 | 111 | | |
112 | 112 | | |
| 113 | + | |
113 | 114 | | |
114 | 115 | | |
115 | 116 | | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
116 | 187 | | |
117 | 188 | | |
118 | 189 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
28 | 28 | | |
29 | 29 | | |
30 | 30 | | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
31 | 38 | | |
32 | | - | |
33 | | - | |
34 | | - | |
35 | | - | |
36 | | - | |
37 | | - | |
38 | | - | |
39 | | - | |
40 | | - | |
41 | | - | |
42 | | - | |
43 | | - | |
44 | | - | |
45 | | - | |
46 | | - | |
47 | | - | |
48 | | - | |
49 | | - | |
50 | | - | |
51 | | - | |
52 | | - | |
53 | | - | |
54 | | - | |
55 | | - | |
56 | | - | |
| 39 | + | |
57 | 40 | | |
58 | 41 | | |
59 | 42 | | |
60 | 43 | | |
61 | 44 | | |
62 | | - | |
63 | | - | |
64 | | - | |
65 | | - | |
66 | | - | |
67 | | - | |
68 | | - | |
69 | | - | |
70 | | - | |
71 | | - | |
72 | | - | |
73 | | - | |
74 | | - | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
75 | 54 | | |
76 | | - | |
| 55 | + | |
77 | 56 | | |
78 | 57 | | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
79 | 61 | | |
80 | 62 | | |
81 | 63 | | |
| |||
0 commit comments