This is a python implementation of LDA using gibbs sampling algorithm.
The following picture shows the top 10 words in the 10 topics (set K = 10) generated by this algorithm over 16 sentences about one piece on wikipedia.
The following picture shows the top 10 words in the 10 topics (set K = 10) generated by this algorithm over 5000 chinese sina social news.
The following picture shows the top 10 words in the 30 topics (set K = 30) generated by this algorithm over 5000 chinese sina social news.
- ZhikaiZhang
- Email [email protected]
- Blog http://zhikaizhang.cn
- 自然语言处理之LDA