Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

whitaker's: missing meanings for words with multi dict differing principal parts spelling #3

Closed
balmas opened this issue Aug 8, 2018 · 1 comment
Labels

Comments

@balmas
Copy link
Member

balmas commented Aug 8, 2018

It looks like there is a problem with the Whitaker's output when we have multiple dict elements with differing frequency or spelling of principal parts. We get only one mean element and it's not clear which dict element it applies to and the other meanings are missing. This might be a problem with our xml API enhancement to whitakers or it might be in the original source. Either way, fixing it would require revisiting the Ada code...

Example output below.

Words found so far that have this issue:

aberis
adero
adjuvo (adiuvabo, adiuvante,...)
alo ( 'alitus' vs 'altus' in principal parts)
amicio ('amixi' vs 'amicui' in principal parts)
apta
auxilio
beatricem (trico vs tricor)
blandiatur (blandio vs blandior)
caedo (caecidi vs cacidi)
cape (here we have some garbage in one of the hdwds "capio, capere, additional, forms")
clave, claves, clavis
comedo (comessus vs comestus vs comesus)
como
commoraris (commoro vs commoror)
congredior
contemplur (contemplo vs contemplor)
coque (coquos vs coquus)
criminati (crimino vs criminor)
cunctor (cunctor; cunctari; cunctatus vs cuncto; cunctare; cunctavi; cunctatus)
desino (desino; desinere; desivi; desitus vs desino; desinare; desavi; desatus)
duco (some garbage in one of the hdwds "duco; ducere; additional; forms")
edo (essus vs esus)
emere (emereo vs emereor)
excurro (excurro; excurrere; excucurri; excursus vs excurro; excurrere; excurri; excursus)
felem (felis vs feles)
grammaticae
ibis
imitandum (imito/imitor)
industrius (industriior vs industrior)
inferus
insuper
iocari (joco vs jocor)
itinera (itiner vs itiner)
lacrimante (lacrimo vs lacrimor)
lactis (lac vs lact)
lamentari (lamento/lamentor)
latrina (latrina/latrinum)
lavo (lavatus v lautos v lotus)
merendam (mereo vs mereor)
mille (millis vs milis)
misereror (misereo vs misereror vs miseret)
obsonatum (obsono vs obsonor)
odi (odeo vs odio)
ostendere (ostendo vs ostendeo)
pantheum (vs pantheom)
physicae (a big mess)
poto (potatus vs potus)
pradium (prandii vs prandi(i)
prodito (prodo vs prodeo)
promo (prompsi vs promsi)
pungo (pupugi vs pepgui)
quasi
salit (salo vs saleo)
scio (scivi vs scivi(ii)
scrutari (scruto vs scrutor)
septimia (septim vs septem)
sicut (adv vs conjunction)
spondeo (spopondi vs spepondi)
tueor (tuitus vs tutus)
vello (volsi vs velli)

<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
  <oac:Annotation xmlns:oac="http://www.openannotation.org/ns/" rdf:about="urn:TuftsMorphologyService:aberis:whitakerLat">
    <dcterms:creator xmlns:dcterms="http://purl.org/dc/terms/">
      <foaf:Agent xmlns:foaf="http://xmlns.com/foaf/0.1/" rdf:about="net.alpheios:tools:wordsxml.v1"/>
    </dcterms:creator>
    <dcterms:created xmlns:dcterms="http://purl.org/dc/terms/">2018-08-08T14:21:53.417956</dcterms:created>
    <dc:rights xmlns:dc="http://purl.org/dc/elements/1.1/">Short definitions and morphology from Words by William Whitaker, Copyright 1993-2007.</dc:rights>
    <oac:hasTarget>
      <rdf:Description rdf:about="urn:word:aberis"/>
    </oac:hasTarget>
    <dc:title xmlns:dc="http://purl.org/dc/elements/1.1/"/>
    <oac:hasBody rdf:resource="urn:uuid:idm140640518845224"/>
    <oac:Body rdf:about="urn:uuid:idm140640518845224">
      <rdf:type rdf:resource="cnt:ContentAsXML"/>
      <cnt:rest xmlns:cnt="http://www.w3.org/2008/content#">
        <entry>
          <infl>
            <term xml:lang="lat">
              <stem>ab</stem>
              <suff>eris</suff>
            </term>
            <pofs order="3">verb</pofs>
            <conj>5th</conj>
            <var>1st</var>
            <tense>future</tense>
            <voice>active</voice>
            <mood>indicative</mood>
            <pers>2nd</pers>
            <num>singular</num>
          </infl>
          <dict>
            <hdwd xml:lang="lat">absum, abesse, abfui, abfuturus</hdwd>
            <pofs order="3">verb</pofs>
            <freq order="3">lesser</freq>
            <src>Lewis+Short</src>
          </dict>
          <dict>
            <hdwd xml:lang="lat">absum, abesse, afui, afuturus</hdwd>
            <pofs order="3">verb</pofs>
            <freq order="6">very frequent</freq>
            <src>Ox.Lat.Dict.</src>
          </dict>
          <mean>be away/absent/distant/missing; be free/removed from; be lacking; be distinct;</mean>
        </entry>
      </cnt:rest>
    </oac:Body>
  </oac:Annotation>
</rdf:RDF>
@balmas balmas added the bug label Aug 8, 2018
@balmas balmas changed the title whitaker's: output for aberis is incomplete whitaker's: missing meanings Aug 8, 2018
@balmas balmas changed the title whitaker's: missing meanings whitaker's: missing meanings for words with multi dict differing freq Aug 8, 2018
@balmas balmas changed the title whitaker's: missing meanings for words with multi dict differing freq whitaker's: missing meanings for words with multi dict differing freq or principal parts Aug 8, 2018
@balmas balmas changed the title whitaker's: missing meanings for words with multi dict differing freq or principal parts whitaker's: missing meanings for words with multi dict differing freq or principal parts spelling Aug 8, 2018
@balmas balmas changed the title whitaker's: missing meanings for words with multi dict differing freq or principal parts spelling whitaker's: missing meanings for words with multi dict differing principal parts spelling Aug 13, 2018
@balmas
Copy link
Member Author

balmas commented Aug 13, 2018

Looking at these more closely, I think most, if not all are due to differences in the spellings of the principal parts. We probably normalize this via the morph-client see alpheios-project/morph-client#17

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant