Releases · clulab/timenorm

This release includes some improvements in the linking process in the scala version of the parser, especially for the Between operator:

linking of the Between operator has been improved checking the text and the position with respect to the child entity. E.g. "until 2019" cannot be linked with End-Interval.
Unknown is now the default value for Type property if the case is not found in date-types.txt
For Frequency the value of the Type operator is Other.
date-types.txt has been refined removing unnecessary cases and adding new ones like hours, seconds,...

In parseBatchToXML the text is cleaned replacing one control characters with one space character to avoid wrong spans when the text contains consecutive control characters (^C, ^L, ...). Previously, consecutive control characters were replaced with a single space character.

This release includes a number of major changes:

Packages have been reorganized to separate the new time normalizer based on the SCATE schema and the old time normalizer based on the TimeML schema.
The neural SCATE-based normalizer now directly returns the SCATE operators (TimeExpression objects), which now include character offsets of where each operator was found.
The anti-xml dependency has been removed, using the standard scala-xml instead. The scala-xml APIs are ugly, but anti-xml is no longer maintained.
The English number parsing grammar that was included as part of the old time normalizer has been simplified and separated out into its own API, WordsToNumber, so it can be used by the neural parser.
The build is now based on SBT instead of Maven.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: clulab/timenorm

timenorm-1.0.5

timenorm-1.0.4

timenorm-1.0.3

timenorm-1.0.2

timenorm-1.0.1

timenorm-1.0.0

timenorm-0.12.1

timenorm-0.12.0

timenorm-0.11.2

timenorm-0.11.1_2.11.11