Skip to content

Commit

Permalink
[en] new dict, start fixing tests
Browse files Browse the repository at this point in the history
[en] fix tests

[en] multiwords

[en] disambiguation: Even

[en] multiwords

[en] multiwords

[en] multiwords

[en] multiwords

[en] multiwords

[en] fix some FPs, by @AzadehSafakish

[en] multiwords

[en] do not synthesize: saith

[en] multiwords

[en] multiwords

[en] multiwords, empty files: spelling.txt, added.txt

[en] dummy commit
  • Loading branch information
jaumeortola committed Jan 11, 2025
1 parent c32f97f commit 842fa0f
Show file tree
Hide file tree
Showing 23 changed files with 128 additions and 1,872 deletions.

Large diffs are not rendered by default.

Original file line number Diff line number Diff line change
Expand Up @@ -28,6 +28,16 @@
<token postag="NNP?S|VB[PD]|SENT_END|PCT" postag_regexp="yes"/>
</equivalence>
</unification>

<rule id="INSTAL_INSTALL" name="instal/install">
<pattern>
<and>
<token inflected="yes">install</token>
<token inflected="yes">instal</token>
</and>
</pattern>
<disambig action="remove"><wd lemma="instal"></wd></disambig>
</rule>

<rule name="or" id="REMOVE_JJ_FOR_OR">
<!-- work around strange dict entry -->
Expand Down Expand Up @@ -75,7 +85,8 @@
<token chunk_re=".-VP">gives</token>
</pattern>
<disambig action="remove"><wd pos="NNS"/></disambig>
<example type="ambiguous" inputform="gives[gif/NNS,give/NNS,give/VBZ]" outputform="gives[give/VBZ]">Next, GOPUSA <marker>gives</marker> you information on Enron and the Democrats as reported by Bob Webster, the editor of OpinioNet.com.</example>
<!-- gif/NNS, -->
<example type="ambiguous" inputform="gives[give/NNS,give/VBZ]" outputform="gives[give/VBZ]">Next, GOPUSA <marker>gives</marker> you information on Enron and the Democrats as reported by Bob Webster, the editor of OpinioNet.com.</example>
</rule>

<rulegroup id="CONTRACTIONS" name="contractions: special cases">
Expand Down Expand Up @@ -1556,7 +1567,7 @@
<disambig action="filterall"/>
<example type="untouched">One car, two dogs, seven cats</example>
<example type="ambiguous" inputform="one[one/CD,one/JJ,one/NN,one/PRP]" outputform="one[one/PRP]">No <marker>one</marker> trusts him any more.</example>
<example type="ambiguous" inputform="No[No/NNP,no/DT,no/NN]" outputform="No[no/DT]"><marker>No</marker> one trusts him any more.</example>
<example type="ambiguous" inputform="No[no/DT,no/NN]" outputform="No[no/DT]"><marker>No</marker> one trusts him any more.</example>
</rule>

<rule id="ONE_OF_THE_MOST" name="'most' as RBS">
Expand Down Expand Up @@ -3205,7 +3216,7 @@ Incorrectly touched. <example type="untouched">Incorrect grammar. Those present
<example type="untouched">Let's are not expensive in this part of town.</example><!-- All 'untouched' examples contain an incorrect possessive 's with a noun -->
<example type="untouched">Our let's in this prestigious property are taken very quickly.</example>
<example type="untouched">... but whose let's go quickly?</example>
<example type="ambiguous" inputform="Let[Let/NNP,let/NN,let/VB,let/VBD,let/VBN,let/VBP]" outputform="Let[let/VBP]"><marker>Let</marker>'s do a test.</example>
<example type="ambiguous" inputform="Let[let/NN,let/VB,let/VBD,let/VBN,let/VBP]" outputform="Let[let/VBP]"><marker>Let</marker>'s do a test.</example>
</rule>
<rule>
<pattern>
Expand Down Expand Up @@ -3241,7 +3252,7 @@ Incorrectly touched. <example type="untouched">Incorrect grammar. Those present
</pattern>
<disambig action="filter" postag="VB.*"/>
<example type="untouched">Let was an unincorporated community in Gilmer County, West Virginia, United States.</example>
<example type="ambiguous" inputform="Let[Let/NNP,let/NN,let/VB,let/VBD,let/VBN,let/VBP]" outputform="Let[let/VB,let/VBD,let/VBN,let/VBP]"><marker>Let</marker> go!</example>
<example type="ambiguous" inputform="Let[let/NN,let/VB,let/VBD,let/VBN,let/VBP]" outputform="Let[let/VB,let/VBD,let/VBN,let/VBP]"><marker>Let</marker> go!</example>
</rule>

<rule id="BE_OUT" name="OUT as RB">
Expand Down Expand Up @@ -10606,7 +10617,7 @@ Incorrectly touched. <example type="untouched">Incorrect grammar. Those present
<example type="untouched">I thought I'd already told you oil and water don't mix.</example>
<example type="untouched">Who told you oil and water actually mixes well?</example>
<example type="untouched">... as he said to Jane and I milk and honey is the food of the gods.</example>
<example type="ambiguous" inputform="email[email/NN:U,email/VB,email/VBP]" outputform="email[email/VBP]">Currently we <marker>email</marker> and share the stuff.</example><!-- #2539-->
<!--<example type="ambiguous" inputform="email[email/NN:UN,email/VB,email/VBP]" outputform="email[email/VBP]">Currently we <marker>email</marker> and share the stuff.</example><!-\- #2539-\->-->
</rule>

<rule id="PRP_MD_RB_VB" name="pronoun + modal + adverb + noun/verb > verb"><!-- #2396 -->
Expand Down Expand Up @@ -10651,7 +10662,7 @@ Incorrectly touched. <example type="untouched">Incorrect grammar. Those present
</pattern>
<disambig postag="JJS"/>
<example type="untouched">Most of his writing is rubbish.</example>
<example type="ambiguous" inputform="Most[Most/NNP,most/NN:U,much/JJS,much/RBS]" outputform="Most[much/JJS]"><marker>Most</marker> are around 40-50 μm in diameter.</example>
<example type="ambiguous" inputform="Most[most/NN:U,much/JJS,much/RBS]" outputform="Most[much/JJS]"><marker>Most</marker> are around 40-50 μm in diameter.</example>
</rule>

<rule id="LESS" name="less as connective (revenues less expenses)">
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -8,4 +8,5 @@ mixt mix VBN
showed show VBN
busses bus NNS
hid hide VBN
a'ight alright RG
a'ight alright RG
saith say VBZ
Loading

0 comments on commit 842fa0f

Please sign in to comment.