Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

语料如何构造 #5

Open
zmingshi opened this issue Jun 20, 2020 · 3 comments
Open

语料如何构造 #5

zmingshi opened this issue Jun 20, 2020 · 3 comments

Comments

@zmingshi
Copy link

麻烦问下,这个语料如何构造呢?可以分享一些经验吗

@ZhuiyiTechnology
Copy link
Owner

百度知道爬取

@chenjun0210
Copy link

尝试爬了,但是反爬被禁了。。。请问爬好的数据就直接用了吗?有做什么其他额外的数据预处理吗?不一定百度推荐的相似query就是语义相关的吧,也会有噪音吧。

@zhangtaochn
Copy link

@ZhuiyiTechnology 那请问下,你们这个数据量有多大呢,达到了这个效果

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants