- Data Analytics 개요 및 주요 개념
- 데이터과학 프로젝트 절차
- Machine Learning 방법론
- Machine Learning 모델링 예시: PP 기사 분류 모형
- Logistic Regression Formulation
- Logistic Regression 학습: Gradient Descent
- 다항 로지스틱 회귀분석
- 회귀 모형의 성능 평가: MAE, MAPE, MSE, RMSE
- 분류 모형의 성능 평가: 단순정확도, 균형정확도, F1-지표
- 심층신경망 개요
- 합성곱 신경망: Convolution 개념, 대표적 CNN 구조
- 순환신경망, LSTM, GRU
- 오토인코더
- 앙상블 배경
- 배깅 & 랜덤 포레스트
- AdaBoost & Gradient Boosting Machine
- 이상치 탐지
- 밀도 기반 이상치 탐지
- 모델 기반 이상치 탐지
- 군집화 개요 및 타당성 평가 지표
- K-평균 군집화
- 계층적 군집화
- 밀도 기반 군집화: DBSCAN
Topic 1: Introduction to Text Analytics [Slide]
- Text Analytics: Backgrounds, Applications, & Challanges, and Process [Video]
- Text Analytics Process [Video]
Topic 2: Text Preprocessing [Slide]
- Introduction to Natural Language Processing (NLP) [Video]
- Lexical analysis [Video]
- Syntax analysis & Other topics in NLP [Video]
- Reading materials
Topic 3: Text Representation I: Classic Methods [Slide]
- Bag of words, Word weighting, N-grams [Video]
Topic 5: Text Representation II: Distributed Representation [Slide]
- Neural Network Language Model (NNLM) [Video]
- Word2Vec [Video]
- GloVe [Video]
- FastText, Doc2Vec, and Other Embeddings [Video]
- Reading materials
Topic 6: Dimensionality Reduction [Slide]
- Dimensionality Reduction Overview, Supervised Feature Selection [Video]
- Unsupervised Feature Extraction [Video]
- Reading materials
- Sequence-to-Sequence Learning [Video]
- Transformer [Video]
- ELMo: Embeddings from Language Models [Video]
- GPT: Generative Pre-Training of a Language Model [Video]
- BERT: Bidirectional Encoder Representations from Transformer [Video]
- GPT-2: Language Models are Unsupervised Multitask Learners [Video]
- Transformer to T5 [Slide], [Video], Presented by Yukyoung Lee.
- Reading Materials
- Topic modeling overview & Latent Semantic Analysis (LSA), Probabilistic Latent Semantic Analysis: pLSA [Video]
- LDA: Document Generation Process [Video]
- LDA Inference: Collapsed Gibbs Sampling, LDA Evaluation [Video]
- Reading Materials
- Recommended video lectures
