ADBench LSTM test #152

toelli-msft · 2019-11-13T17:38:13Z

No description provided.

toelli-msft · 2019-11-13T17:55:02Z

This collection of code contains a C++ wrapper for adbench-lstm.ks so that it can be tested against a Python reference implementation. It would be good to get at least one pair of eyes on it before merging.

pashminacameron · 2019-11-14T12:22:44Z

@toelli-msft It appears you have a simple LSTM implementation. I added a sequence LSTM implementation. As I am not sure what the purpose here is, lstm2.py may not be what you wanted, but having it in there will help discussion. I think we should test the sort of LSTM I have pushed as a minimum to make any speed claims.

acl33 · 2019-11-14T12:36:59Z

src/python/ksc/adbench_lstm/lstm.py

+    gates = np.concatenate((inp, hidden, inp, hidden), 0) * weight + bias
+    hidden_size = hidden.shape[0]
+
+    forget  = sigmoid(gates[0:hidden_size])


Maybe assert something here about size/shape of hidden vs size of inp ?

I'm trying to change the original source code as little as possible. See the new top-level comment.

acl33 · 2019-11-14T12:39:07Z

src/python/ksc/adbench_lstm/lstm.py

+        ypred, new_state = lstm_predict(main_params, extra_params, all_states[t], _input)
+        all_states.append(new_state)
+        ynorm = ypred - np.log(sum(np.exp(ypred), 2))
+        ygold = sequence[t + 1]


super-nit: is that yg_old or y_gold?

The latter, but: I'm trying to change the original source code as little as possible. See the new top-level comment.

acl33 · 2019-11-14T12:43:32Z

test/builds/build_and_test_adbench_lstm.sh

@@ -0,0 +1,40 @@
+# There's a lot of duplication between this and
+# build_and_test_mnistcnn.sh, but we will follow the Rule of Three


add similar comment to build_and_test_mnistcnn.sh? (I think the Wikipedia reference is probably unnecessary/OTT but it is humorous to include it ;) )

OK, good idea

src/python/ksc/adbench_lstm/lstm2.py

acl33 · 2019-11-14T12:50:11Z

src/python/ksc/adbench_lstm/lstm2.py

+
+"""
+Ther are many formulations of LSTMs. This code follows the formulation from 
+https://cs224d.stanford.edu/lecture_notes/LectureNotes4.pdf with some simplifications


Maybe a comment what this is for? Do I understand correctly that this is not what you are testing against, it's for eyeball comparison only or something?

Alan - here's the reference for what this is for.

acl33 · 2019-11-14T12:53:15Z

src/python/ksc/adbench_lstm/test.py

+from ksc.adbench_lstm.lstm import (
+    lstm_model, lstm_predict, lstm_objective, sigmoid)
+
+ten = np.ndarray


Is this ten used anywhere? I don't see it. (And d ?)

Thanks, it's not used. (d is used in several places though)

acl33 · 2019-11-14T12:54:40Z

src/python/ksc/adbench_lstm/test.py

+import random
+import numpy as np
+
+from ksc.adbench_lstm.lstm import (


Personal preference but I would probably import ksc.adbench_lstm.lstm as k and then I'm testing a.lstm_model against k.lstm_model etc. Up to you...

I like that idea, thanks!

acl33

LGTM, a few nits/suggestions.

in terms of a Python reference implementation

toelli-msft · 2019-11-14T13:48:11Z

Thanks Alan and Pashmina.

For those who are wondering why this is not a standard LSTM implementation, we have to copy exactly whatever ADBench does so that we are comparing like-for-like. ADBench explicitly titles its graph with "D-LSTM" (Diagonal LSTM) to make it clear that it is not a standard one. We definitely want ADBench to have one, it is an open ADBench issue to implement a standard one, but as yet it is not one.

Hopefully the new comment on top of lstm.py makes this clearer too.

Pashmina, thanks for the reference implementation of a standard LSTM. Perhaps ADBench can use it as a reference.

pashminacameron

Thanks for the context, Tom.

Use constVec instead of explicit build

fe6a927

toelli-msft requested review from awf, pashminacameron, acl33 and Anya28 November 13, 2019 17:49

toelli-msft force-pushed the toelli/adbench-lstm-test branch from 5fdce84 to e667ba4 Compare November 13, 2019 17:53

acl33 reviewed Nov 14, 2019

View reviewed changes

src/python/ksc/adbench_lstm/lstm2.py Outdated Show resolved Hide resolved

acl33 reviewed Nov 14, 2019

View reviewed changes

acl33 approved these changes Nov 14, 2019

View reviewed changes

Thorough test for adbench-lstm.ks

c20e98c

in terms of a Python reference implementation

toelli-msft force-pushed the toelli/adbench-lstm-test branch from 311b34e to 0813ccc Compare November 14, 2019 13:42

pashminacameron approved these changes Nov 14, 2019

View reviewed changes

toelli-msft force-pushed the toelli/adbench-lstm-test branch from 0813ccc to c20e98c Compare November 14, 2019 15:55

toelli-msft merged commit c20e98c into master Nov 14, 2019

toelli-msft deleted the toelli/adbench-lstm-test branch November 14, 2019 16:14

toelli-msft mentioned this pull request Nov 19, 2019

Add thorough test for GMM #153

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ADBench LSTM test #152

ADBench LSTM test #152

toelli-msft commented Nov 13, 2019

toelli-msft commented Nov 13, 2019

pashminacameron commented Nov 14, 2019

acl33 Nov 14, 2019

toelli-msft Nov 14, 2019

acl33 Nov 14, 2019

toelli-msft Nov 14, 2019

acl33 Nov 14, 2019

toelli-msft Nov 14, 2019

acl33 Nov 14, 2019

pashminacameron Nov 14, 2019

acl33 Nov 14, 2019

toelli-msft Nov 14, 2019

acl33 Nov 14, 2019 •

edited by toelli-msft

Loading

toelli-msft Nov 14, 2019

acl33 left a comment

toelli-msft commented Nov 14, 2019

pashminacameron left a comment

		@@ -0,0 +1,40 @@
		# There's a lot of duplication between this and
		# build_and_test_mnistcnn.sh, but we will follow the Rule of Three

ADBench LSTM test #152

ADBench LSTM test #152

Conversation

toelli-msft commented Nov 13, 2019

toelli-msft commented Nov 13, 2019

pashminacameron commented Nov 14, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

acl33 Nov 14, 2019 • edited by toelli-msft Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

acl33 left a comment

Choose a reason for hiding this comment

toelli-msft commented Nov 14, 2019

pashminacameron left a comment

Choose a reason for hiding this comment

acl33 Nov 14, 2019 •

edited by toelli-msft

Loading