Skip to content

Reorganize Input Processing To Handle Prefix Morphological Rules

Compare
Choose a tag to compare
@prohippo prohippo released this 08 Nov 08:19
· 64 commits to master since this release

PyElly tokenization of input was improperly dealing with prefix recognition and splitting off. This was main due to problems with the '+' character used to marking prefixes and separated roots. The problems showed up in the "chemic" example application, which has to find chemical nomenclature in various styles of text. The names are too numerous to list out fully and so requires a bit of analysis.