Experimenting with LTSM RNN.

Code largely from: https://github.com/deeplearning4j/dl4j-examples

Goal: To experiment tuning/using DL4J. To enable this, have largely taken code samples from DL4J repo, with the aim to tune/build upon.

LSTM Hyperparameter Tuning

https://deeplearning4j.org/lstm.html http://karpathy.github.io/2015/05/21/rnn-effectiveness/

Ideas:

-Add benchmarking in to track how long it takes to train. -Load params from JSON. -Try optimizing methods as per guide. -Speed up training - optimize JVM.

Optimizations:

Tried: Changing JVM options (fill in) - no changes. Approx run time ~ 45 mins. (1 epoch)

Default DL4J settings: Approx run time ~ 45 mins. (1 epoch) Set in pom.xml: <nd4j.backend>nd4j-native-platform</nd4j.backend>

Changing to CUDA: Approx run time ~ 15 mins. (1 epoch) Set in pom.xml: <nd4j.backend>nd4j-cuda-8.0-platform</nd4j.backend> https://developer.nvidia.com/cuda-downloads

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src/main/java/orgkwahsog		src/main/java/orgkwahsog
Shakespeare1.txt		Shakespeare1.txt
pom.xml		pom.xml
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Experimenting with LTSM RNN.

LSTM Hyperparameter Tuning

Ideas:

Optimizations:

About

Releases

Packages

Languages

kwahsog/GravesLTSMTraining

Folders and files

Latest commit

History

Repository files navigation

Experimenting with LTSM RNN.

LSTM Hyperparameter Tuning

Ideas:

Optimizations:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages