Skip to content

Latest commit

 

History

History
141 lines (108 loc) · 4.98 KB

pending.org

File metadata and controls

141 lines (108 loc) · 4.98 KB

Pending

type in wordnet-br a mismatch type s in WordNet-3

Evidence word “gordo” (br a -> en s). ruim (br a -> en s). Solucao, arquivo com triplas sameAs entre synsets-a e synsets-s.

sufixos “a” e “s” na WordNet-EN

Para ser possível mapeamento com a SentiWordNet, fazer os sameAs como para a WordNet-BR.

Improve debug of the functions

http://xach.livejournal.com/316925.html

improves in the Wordnet-br

leão, bunda, tabu, seminal, presente.

remove lemma from wordsenses

cleanup the triple store before serialize it.

IndexSense and WordSense must be merged.

bc field from worndet-br? bc from the others?

base concepts and core concepts are mapped to types. The number of base concepts in the XML files from Wordnet-BR do not have any special meaning.

Identify what to extract from index.XXXX files.

No new information.

total workflow

File process.lisp

tagCount

verb sentences examples

parsing adj marks

adj and adv synsets (sattelities)

It looks like the relations between synsets encode this information presented in the sense key.

lexnames

Testing

This tests can be runned using Slime connected to the remove Allegro Lisp on Amazon.

Classes

The SPARQL query for obtain the list of classes and number of intances for each classe is given below:

select ?cls (count(?res) as ?count)
where  {
  ?res a ?cls .
}
group by ?cls

In the next block we can also use :results output and :table in the :results-format parameter.

(sparql:run-sparql query :results-format :lists)

Properties

propertycountptr?pcounter
wn30:syntacticMarker105500
wn30:example399300
wn30:frame2164900
wn30:lexicographerFile11765900
wn30:gloss11765900
wn30:synsetId11765900
wn30:lexicalForm14873000
wn30:lemma14873000
wn30:tagCount20694100
wn30:senseNumber20694100
wn30:senseKey20694100
wn30:lexfile20694100
wn30:lexicalId20697800
rdf:label20697800
wn30:wordNumber20697800
wn30:word20697800
wn30:containsWordSense20697800
rdf:type47832700
wn30:participleOf73173
wn30:causes2201220
wn30:entails4081408
wn30:substanceMeronymOf7971797
wn30:substanceHolonymOf7971797
wn30:attribute127811278
wn30:classifiesByRegion136011360
wn30:classifiedByRegion136011360
wn30:classifiesByUsage137611376
wn30:classifiedByUsage137611376
wn30:sameVerbGroupAs175011750
wn30:adverbPertainsTo322213222
wn30:seeAlso327213272
wn30:adjectivePertainsTo480114801
wn30:classifiedByTopic665416654
wn30:classifiesByTopic665416654
wn30:antonymOf797917979
wn30:hasInstance857718577
wn30:instanceOf857718577
wn30:partHolonymOf909719097
wn30:partMeronymOf909719097
wn30:memberHolonymOf12293112293
wn30:memberMeronymOf12293112293
wn30:similarTo21386121386
wn30:derivationallyRelated74708174708
wn30:hypernymOf89089189089
wn30:hyponymOf89089189089
377583