Skip to content

Latest commit

 

History

History
136 lines (110 loc) · 6.15 KB

PE504-Natural-Language-Processing.org

File metadata and controls

136 lines (110 loc) · 6.15 KB

<<<PE504>>> NATURAL LANGUAGE PROCESSING

{{{credits}}}

LTPC
3003

COURSE OBJECTIVES

  • To learn language models
  • To understand the levels of knowledge in language processing
  • To develop NLP applications.

{{{unit}}}

UNIT IOVERVIEW AND LANGUAGE MODELING8

Origins and challenges of NLP – Knowledge in language processing – NLP applications; Language Modeling: Language and grammar – Grammar-based language models – Lexical functional grammar – Government and binding; Statistical Language Model: N-gram model – Smoothing techniques.

{{{unit}}}

UNIT IIWORD LEVEL AND SYNTACTIC ANALYSIS10

Word Level Analysis: Regular expressions – Survey of morphology – Word and sentence tokenization – Stemmer – Word classes – Part-of-Speech Tagging: HMM POS tagging; Syntactic Analysis: Constituency – Context-free grammar – Dependency Grammar; Parsing: Top-down – Bottom-up – Ambiguity – Early algorithm – CYK – Probabilistic CFG – Probabilistic CYK parsing; Tree banks.

{{{unit}}}

UNIT IIISEMANTIC ANALYSIS9

The representation of Meaning: Meaning representation – Computational desiderata for representation; Lexical Semantics: Word senses – Relations – WordNet – Thematic roles – Selectional restrictions; Word Sense Disambiguation: Dictionary-based – Supervised – Minimally-supervised – Unsupervised; Word Similarity: Thesaurus methods – Distributional methods.

{{{unit}}}

UNIT IVDISCOURSE PROCESSING, IR AND IE9

Discourse Processing: Reference resolution – Anaphora resolution algorithms – Co-reference resolution; Information Retrieval: The vector space model – Term weighting – Evaluation of IR; Information Extraction: Named entity recognition – Relation detection and classification.

{{{unit}}}

UNIT VMACHINE TRANSLATION AND QUESTION ANSWERING9

Machine Translation(MT): Problems in machine translation – Classical MT – Statistical MT; Factoid Question Answering: Question processing – Passage retrieval – Answer processing – Evaluation of factoid answers.

\hfill Total Periods: 45

COURSE OUTCOMES

Upon the completion of the course the students should be able to:

  • Describe the language models (K3)
  • Explain levels of knowledge in language processing (K3)
  • Apply computational methods in semantic and discourse processing (K3)
  • Apply NLP techniques to MT, IR, IE, QA and Summarization systems (K2)
  • Apply evaluation metrics for different NLP applications (K3).

TEXT BOOKS

  1. Daniel Jurafsky and James H Martin, “Speech and Language Processing: An introduction to Natural Language Processing, Computational Linguistics and Speech Recognition”, 2nd Edition, Prentice Hall, 2008.
  2. Tanveer Siddiqui, U S Tiwary, “Natural Language Processing and Information Retrieval”, Oxford University Press, 2008.

REFERENCES

  1. Christopher D Manning, Hinrich Schutze, “Foundations of Statistical Natural Language Processing”, MIT Press, 1999.
  2. Nitin Indurkhya, Fred J Damerau, “Handbook of Natural Language Processing”, 2nd Edition, CRC Press, 2010.
  3. Steven Bird, Ewan Klein, “Natural Language Processing with Python”, O’Reilly Media, 2009.
  4. Ruslan Mitkov, “The Oxford Handbook of Computational Linguistics”, Oxford University Press, 2009.
  5. NLTK – Natural Language Tool Kit - http://www.nltk.org/.

** CO PO MAPPING :noexport:

PO1PO2PO3PO4PO5PO6PO7PO8PO9PO10PO11PO12PSO1PSO2PSO3
CO1322
CO2322
CO3322
CO43233
CO5322
Score1510311
Course Mapping3233