bench: add memory benchmarks #255

rgrinberg · 2024-04-15T20:16:33Z

$ dune exec benchmarks/memory.exe will demonstrate the pathological example

Signed-off-by: Rudi Grinberg <[email protected]>

vouillon · 2024-04-16T11:13:44Z

The automata remembers the last size + 1 characters, which takes an exponential amount of memory.

In this case, one only needs to check that there are size zeroes or ones after the first 1, which would only require a linear amount of memory. For that, when building the automaton one would need to see that repn (set "01") (n + 1) (Some (n + 1) is subsumed by repn (set "01") n (Some n).

But this is very fragile. For a longest match semantics or a first match / greedy semantics, one really needs to remember all these characters. And if we change the regular expression only a little bit, like below, one cannot avoid the exponential behavior either.

  seq
    [ rep (set "01")
    ; char '1'
    ; repn (set "01") size (Some size)
    ; char 'x'
    ]

DFAs are not good at counting...

bench: add memory benchmarks

6ba83be

Signed-off-by: Rudi Grinberg <[email protected]>

rgrinberg force-pushed the ps/rr/bench__add_memory_benchmarks branch from aa84882 to 6ba83be Compare April 15, 2024 20:46

rgrinberg merged commit ec98adc into master Apr 15, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bench: add memory benchmarks #255

bench: add memory benchmarks #255

rgrinberg commented Apr 15, 2024 •

edited

Loading

vouillon commented Apr 16, 2024

bench: add memory benchmarks #255

bench: add memory benchmarks #255

Conversation

rgrinberg commented Apr 15, 2024 • edited Loading

vouillon commented Apr 16, 2024

rgrinberg commented Apr 15, 2024 •

edited

Loading