Skip to content
/ aelius Public

Python/NLTK-based package for shallow parsing of Brazilian Portuguese

License

Notifications You must be signed in to change notification settings

LR-POR/aelius

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Aelius Brazilian Portuguese POS-Tagger and Corpus Annotation Tool http://sourceforge.net/projects/aelius/

Author: Leonel F. de Alencar <[email protected]>

Copyright (C) 2010-2013 Leonel F. de Alencar

For license information, see LICENSE.txt

Aelius is an ongoing free software, open source project aiming at developing a suite of Python, NLTK-based modules and interfaces to external freely available tools for shallow parsing of Brazilian Portuguese. It also includes language resources such as language models, sample texts, and gold standards. Presently, Aelius already offers facilities for POS-tagging and chunking corpora and outputting annotations in different formats, such as in XML in the TEI P5 encoding scheme.


Documentation: http://aelius.sourceforge.net/manual.html

  
Mailing Lists: see Aelius homepage

 
Contributing: If you would like to contribute to Aelius,
    please see Aelius homepage

Donating: Have you found the toolkit helpful?  Please support Aelius development
    by donating to the project via PayPal, using the link on the Aelius homepage.

Redistributing: Aelius source code is distributed under the Apache 2.0 License.
    Aelius documentation is distributed under the Creative Commons
    Attribution-Noncommercial-No Derivative Works 3.0 United States license.
    Aelius corpora and language models are provided under the terms
    given in the README and/or LICENSE file
    for these corpora and language models; all are redistributable, and available for non-commercial use.
    Aelius may be freely redistributed, subject to the provisions of these licenses.

Citing: If you publish work that uses Aelius, please cite the following:

APA style
Alencar, L. F. de. (2013). Aelius Brazilian Portuguese POS-Tagger and Corpus Annotation Tool (Version 0.9.7) [Software]. Available from http://aelius.sourceforge.net.

ABNT style
Alencar, Leonel Figueiredo de. Aelius Brazilian Portuguese POS-Tagger and Corpus Annotation Tool, versão 0.9.7. Fortaleza: [s.n.], 2013. Disponível em: <http://aelius.sourceforge.net/>. Acesso em: 25 fev. 2013.

The first version of Aelius was presented at the following conference, whose proceedings will hopefully be published soon; a draft of the presentation is available on-line:

Alencar, Leonel Figueiredo de. Aelius: uma ferramenta para anotação automática de corpora usando o NLTK. ELC 2010, The 9th Brazilian Corpus Linguistics Meeting, Porto Alegre, Brazil, Rio Grande do Sul Catholic University (PUCRS), October 8 and 9, 2010. 
http://corpuslg.org/gelc/elc2010.php?paged=2
http://corpuslg.org/gelc/media/blogs/elc2010/slides/Figueiredo_de_Alencar.pdf

Acknowledgments: my Computational Linguistics students at the Universidade Federal do Ceará; Marcel Caraciolo for some code improvement suggestions and feedback on the Aelius documentation.

This file is based on NTLK's README.txt
http://code.google.com/p/nltk/source/browse/trunk/nltk/README.txt