Recurrent Neural Network Language Model using TensorFlow

Content

Original Work
Motivations
Quickstart
Continue Training (--action continue)
Getting a text perplexity with regard to the LM (--action test)
Line by line perplexity (--action ppl)
Line by line loglikes (--action loglikes)
Line by line prediction (--action predict)
Results

================

Original Work

Our work is based on the RNN LM tutorial on tensorflow.org following the paper from Zaremba et al., 2014.

The tutorial uses the PTB dataset (tgz). We intend to work with various dataset, that's why we made names more generic by removing several "PTB" prefix initially present in the code.

Original sources: TensorFlow v0.11 - PTB

See also:

Nelken's "tf" repo inspired our work by the way it implements features we are interested in.
Benoit Favre's tf rnn lm

## Motivations

Getting started with TensorFlow
Make RNN LM manipulation easy in practice (easily look/edit configs, cancel/resume training, multiple outputs...)
Train RNN LM for ASR using Kaldi (especially using loglikes mode)

Quickstart

git clone https://github.com/pltrdy/tf_rnnlm
cd tf_rnnlm
python word_lm.py --help

Downloading PTB dataset:

./tools/get_ptb.sh

Training small model:

./tools/train.sh

# (is quite equal to:)
python word_lm.py --action train --data_path ./simple-examples/data --model_dir=./small_model --config small

Training custom model:

mkdir custom_model

# Generating new config file
chmod +x gen_config.py
./gen_config.py small custom_model/config

# Edit whatever you want
vi custom_model/config

# Train it. (it will automatically look for 'config' file in the model directory as no --config is set).
# It will look for 'train.txt', 'test.txt' and 'valid.txt' in --data_path
# These files must be present.
python word_lm.py --action train --data_path=./simple-examples/data --model_dir=./custom_model

You can also use command line option to overwrite configuration parameters directly e.g. --batch_size 128, --max_max_epoch 30 etc. (see ./word_lm.py --help)

note: data files are expected to be called train.txt, test.txt and valid.txt. Note that get_ptb.sh creates symlinks for that purpose

Continue Training (--action continue)

One can continue an interrupted training with the following command:

python word_lm.py --action continue --data_path=./simple-examples/data --model_dir=./model

Where ./model must contain config, word_to_id, checkpoint and the corresponding .cktp files.

Getting a text perplexity with regard to the LM (--action test)

# Compute and outputs the perplexity of ./simple-examples/data/test.txt using LM in ./model
python word_lm.py --action test --data_path=./simple-examples/data --model_dir=./model

Line by line perplexity (--action ppl)

Running the model on each stdin line and returning its perplexity (precisely 'per-word perplexity' i.e. exp(costs/iters)

cat ./data/test.txt | python word_lm.py --action ppl --model_dir=./model

Line by line loglikes (--action loglikes)

Running the model on each stdin line and returning its 'loglikes' (i.e. -costs/log(10)).

Note: in particular, it is meant to be used for Kaldi's rescoring.

cat ./data/test.txt | python word_lm.py --action loglikes --model_dir=./model

Line by line prediction (--action predict)

Running the model on each stdin line and prints a prediction informations

cat ./data/test.txt | python word_lm.py --action ppl --model_dir=./model

It will print a json object with the following structure:

ppl: perplexity (float)
predictions: (array, for each word of the line)
- word: current word,
- target: next word,
- prob: probability associated with the target,
- pred_word: predicted word
- pred_prob: probability associated with the predicted word

Results

Quoting tensorflow.models.rnn.ptb.ptb_word_lm.py:22:

There are 3 supported model configurations:

config epochs train valid test

small 13 37.99 121.39 115.91

medium 39 48.45 86.16 82.07

large 55 37.87 82.62 78.29

The exact results may vary depending on the random initialization.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
gen_config.py		gen_config.py
reader.py		reader.py
transpose.py		transpose.py
util.py		util.py
word_lm.py		word_lm.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Recurrent Neural Network Language Model using TensorFlow

Content

Original Work

Quickstart

Continue Training (--action continue)

Getting a text perplexity with regard to the LM (--action test)

Line by line perplexity (--action ppl)

Line by line loglikes (--action loglikes)

Line by line prediction (--action predict)

Results

About

Releases

Packages

Languages

config	epochs	train	valid	test
small	13	37.99	121.39	115.91
medium	39	48.45	86.16	82.07
large	55	37.87	82.62	78.29
The exact results may vary depending on the random initialization.

License

johnnykthink/tf_rnnlm

Folders and files

Latest commit

History

Repository files navigation

Recurrent Neural Network Language Model using TensorFlow

Content

About

Resources

License

Stars

Watchers

Forks

Languages