Skip to content

Commit

Permalink
fix #60
Browse files Browse the repository at this point in the history
  • Loading branch information
dhruvil410 committed Mar 19, 2021
1 parent c924822 commit d181905
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion src/sentences/sentence_splitting.jl
Original file line number Diff line number Diff line change
Expand Up @@ -120,7 +120,8 @@ function postproc_splits(sentences::AbstractString)
sentences = replace(sentences, r"(\bMs\.)\n" => s"\1 ")
sentences = replace(sentences, r"(\bMrs\.)\n" => s"\1 ")


# no sentence break in between two words with no punctuation
sentences=replace(sentences,r"([a-zA-Z0-9])\n([a-zA-Z0-9])"=>s"\1 \2")


# possible TODO: filter excessively long / short sentences
Expand Down

0 comments on commit d181905

Please sign in to comment.