Skip to content

Code used in the paper, Feature2Vec: Distributional semantic modelling of human property knowledge

License

Notifications You must be signed in to change notification settings

dumbshow/Feature2Vec

 
 

Repository files navigation

Feature2Vec

arxiv

Code used in the paper Feature2Vec: Distributional semantic modelling of human property knowledge.

Feature2Vec

Requirements

Code ran on ubuntu 16.04 with anaconda python distribution with python 3.6.9. Packages include,

  • numpy
  • pandas
  • scikit-learn
  • spacy
  • tensorflow-gpu
  • keras-gpu
  • jupyter lab
  • conda_nb

Note that the model probably trains faster on CPU but GPU versions of tensorflow and keras were used during experiments. We have also included a neural network class based on recent work https://www.mdpi.com/2504-2289/3/2/30 . We compare it with our model using hyper parameters that we found to work well (although better paramters may be found which could improve results).

Spacy spacy requires the en_core_web_lg language model.

python -m spacy download en_core_web_lg

Pretrained Embeddings

Pretrained embeddings for features, along with save/load functions, are now available. The embeddings are placed in the embeddings folder as .txt files. The embeddings were trained using all concepts and features. Feature2Vec still requires the spacy word embeddings and property norm datasets to run. See tsne below for example of possible analysis.

For another good visualization, see the tensorflow projection kindly provided by Marijn van Vliet.

How to Cite

Please use the following bibtex entry:

@article{derby2019feature2vec,
  title={Feature2Vec: Distributional semantic modelling of human property knowledge},
  author={Derby, Steven and Miller, Paul and Devereux, Barry},
  journal={arXiv preprint arXiv:1908.11439},
  year={2019}
}


About

Code used in the paper, Feature2Vec: Distributional semantic modelling of human property knowledge

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 69.9%
  • Jupyter Notebook 29.9%
  • Shell 0.2%