Name	Name	Last commit message	Last commit date
Latest commit icoxfog417 Update LICENSE Dec 10, 2018 ba7c95e · Dec 10, 2018 History 58 Commits
data	data	update annotation policy	Mar 24, 2017
docs	docs	implements parallel process	Apr 7, 2017
elephant_sense	elephant_sense	add deploy settings	Apr 7, 2017
models	models	add structure feature & fix save models	Apr 7, 2017
notebooks	notebooks	implement like based classifier	Apr 7, 2017
scripts	scripts	implements parallel process	Apr 7, 2017
tests	tests	implements app	Apr 7, 2017
.gitignore	.gitignore	add deploy settings	Apr 7, 2017
Dockerfile	Dockerfile	add deploy settings	Apr 7, 2017
LICENSE	LICENSE	Update LICENSE	Dec 10, 2018
README.md	README.md	update readme	Apr 7, 2017
requirements.txt	requirements.txt	implement search_qiita	Apr 7, 2017
requirements_app.txt	requirements_app.txt	add deploy settings	Apr 7, 2017
run.py	run.py	implements application template	Apr 5, 2017

Repository files navigation

elephant-sense

Content itself quality evaluation by machine learning

You can try from Here.

Setup

Get Qiita API token and set it to environment variable.

$ export QiitaToken=xxx

(only read_qiita scope is required)

Then use Dockerfile and run!

For Training the Model

Data Preparation

Locate the Qiita posts on data/raw/items
- You can get Qiita posts by Qiita API
- 1 post is 1 json file whose name is post id (like 0a0000aa0a0000a00aa0.json).
Locate the annotated file labeled_qiita_posts.csv on data/raw.
- It's format is No,url,Title, and annotator1, annotator2... (column names are as you like ).

Data Preprocessing

Run the following script.

python scripts/data/make_data.py

Then, labeled json file is stored at data/processed/items.

Next, execute preprocessing.

python scripts/data/preprocessing.py

posts.json will be created at data/processed/.
posts.json includes splited tokens of each posts. You can use this to get the words in the posts.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

elephant-sense

Setup

For Training the Model

Data Preparation

Data Preprocessing

About

Releases

Packages

Contributors 3

Languages

License

chakki-works/elephant_sense

Folders and files

Latest commit

History

Repository files navigation

elephant-sense

Setup

For Training the Model

Data Preparation

Data Preprocessing

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages