Skip to content

Supplemental statistics for SPE paper

Shunsuke Kanda edited this page Feb 23, 2023 · 1 revision

We received a question about our SPE paper that the length of a sentence (i.e., search text) used in Section 4 is unknown.

To supplement it, we show the statistics as follows:

Dataset Source #sentences #chars/sentence #bytes/sentence
English Pizza&Chili Corpus 1,000,000 60.5 60.5
Japanese BCCWJ 1,000,000 33.8 99.5
Clone this wiki locally