Skip to content

Video, audio text analysis references

cesine edited this page Oct 2, 2011 · 5 revisions

Table of Contents

Eye Gaze

Eye Tracking

Cameras,” Boston University Computer Science Technical Report 2005-012.

Natural Language Processing

Transcription, subtitling=

Speech Stream Segmentation

  • Willie Walker, Paul Lamere, Philip Kwok, Bhiksha Raj, Rita Singh, Evandro Gouvea, Peter Wolf, Joe Woelfel. (2004) Sphinx-4: A Flexible Open Source Framework for Speech Recognition. SMLI TR2004-0811 White paper, Sun Microsystems inc.

Phonetic Analysis

Statistical Analysis and Results Visualization