Error on training when used preprocess_shards #69

ghost · 2016-11-18T13:39:47Z

Due to large training dataset I had to use the preprocess_shards in order to split it. When running the train.lua i get the following error:
loading data...
/home/sergio/torch/install/bin/luajit: /home/sergio/torch/install/share/lua/5.1/hdf5/group.lua:312: HDF5Group:read() - no such child 'num_source_features' for [HDF5Group 33554432 /]
Seems like 'num_source_features' is used in preprocess but not in shards.
Could you please advice? Thanks

guillaumekln · 2016-11-18T13:52:42Z

Unfortunately, preprocesor-shards.py still lags behind in terms of features due to heavy code duplication with preprocess.lua. In the mean time, you can use the updated implementation from @mdasadul:

https://github.com/mdasadul/seq2seq-attn/blob/bcd899ec990da6b2c5c616aab5ac77b5c7760dc6/preprocess-shards.py

See #49.

ghost · 2016-11-18T15:20:26Z

Thanks!

ghost closed this as completed Nov 18, 2016

guillaumekln mentioned this issue Dec 12, 2016

preprocess-shards compatibility with train #77

Closed

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error on training when used preprocess_shards #69

Error on training when used preprocess_shards #69

ghost commented Nov 18, 2016

guillaumekln commented Nov 18, 2016

ghost commented Nov 18, 2016

Error on training when used preprocess_shards #69

Error on training when used preprocess_shards #69

Comments

ghost commented Nov 18, 2016

guillaumekln commented Nov 18, 2016

ghost commented Nov 18, 2016