[CAY-1251] Introduce worker-side model cache #1252

wynot12 · 2017-11-02T16:59:09Z

Resolves #1251

This PR decouples computation and communication in ML training by introducing worker-side model cache.
Note that cache eviction/refresh policy should be improved.
The current version simply refreshes cache with 10s interval.

Users can turn on this feature with-model_cache_enabled option.

wynot12 · 2017-11-02T17:01:21Z

Here's a graph that compares convergence performance between cache vs. no-cache.
(Experiment setup: Optiplex cluster, NMF, Netflix 1x, 5 epoch)

wynot12 · 2017-11-02T17:50:17Z

#1253 will resolve the fail.

yunseong

Thanks for the PR! It looks good overall, but I have a few concerns:

What if we use more than one trainer threads? Doesn't this case request about 2x pull if the cache is not warmed up (link)?
What are the indices in the graph (e.g., cache2, cache3)?

yunseong · 2017-11-06T02:29:11Z

dolphin/async/src/main/java/edu/snu/cay/dolphin/async/CachedModelAccessor.java

+    this.modelTable = tableAccessor.getTable(modelTableId);
+    this.modelUpdateFunction = modelUpdateFunction;
+
+    // TODO #00: introduce a sophisticated cache refresh/eviction policy


Please update the issue number to #1254

yunseong · 2017-11-06T04:37:55Z

dolphin/async/src/main/java/edu/snu/cay/dolphin/async/CachedModelAccessor.java

+    modelTable.updateNoReply(key, deltaValue);
+    pushTracer.recordTime(1);
+
+    // update local cache. oldValue always exists


I'm a bit confused with the phrase oldValue always exists: does it imply that push() occurs always after some parameters are loaded via pull()? If it's true, could you please add a comment why oldValue is guaranteed to exist?

yunseong · 2017-11-06T04:53:49Z

dolphin/async/src/main/java/edu/snu/cay/dolphin/async/CachedModelAccessor.java

+  }
+
+  /**
+   * This method does not care about cache.


It'd be great if we clarify what are the differences between pull(List<K> keys, Table table) and pull(List<K> keys). The distinction is missing in the base interface, but we differentiate them in this implementation (with cache vs. without cache).

Hmm.. actually this part is completely same with no-cache version.

oh, then the question would be more about
why do we not care about cache in this method, while pull(final List<K> keys) gets the data from the cache?

This question raised my original question above: what are the differences between the two methods?

pull(List<K> keys, Table table) is for using other tables that has no caches.
This ModelAccessor implementation provides a cache only for a table in its field.

pull(List<K> keys, Table table) has been inserted to support offline model evaluation.
So this method may seem awkward in ModelAccessor interface.

wynot12 · 2017-11-07T13:21:33Z

@yunseong I'll update this PR to use guava loading cache soon.

wynot12 · 2017-11-07T14:19:17Z

In the above graph, integers attached at the end of labels (e.g., cache2, cache3) means multiple experiments with cache. Sorry for confusing.

wynot12 · 2017-11-28T12:13:37Z

@yunseong how about to merge this PR?
Since cache-version implementation is separate from the original one and our default setting turns off caching, it does not conflict with our main work that will use non-cache version.

yunseong · 2017-11-29T02:21:07Z

Agreed. The PR looks good and I'm merging it. Thanks!

wynot12 added 9 commits October 25, 2017 06:04

model cache

38b1dc6

metric profiling

74c3bd7

fix pull tracking

7f1da6a

fix

c169a1b

update local cache

66f227f

minor

416597b

Add comments

3a9d465

style error

eccdb42

user parameter to enable model cache

5d2890c

wynot12 requested a review from yunseong November 2, 2017 16:59

Merge branch 'master' into model-cache

746f4db

yunseong suggested changes Nov 6, 2017

View reviewed changes

wynot12 added 2 commits November 7, 2017 18:47

Merge branch 'master' of github.com:cmssnu/cay into model-cache

bfc9e35

use guava loading cache

813a527

wynot12 added 2 commits November 7, 2017 23:00

fix style error

22ddc14

Merge branch 'model-cache' of github.com:cmssnu/cay into model-cache

e446908

wynot12 added 2 commits November 8, 2017 02:10

manual refresh of cache

a2300e6

stop refreshing cache

27a5c2e

wynot12 added the cruise-ps label Nov 8, 2017

wynot12 added 6 commits November 12, 2017 20:42

Merge branch 'master' of github.com:cmssnu/cay into model-cache

31f0b0a

Merge branch 'master' of github.com:cmssnu/cay into model-cache

9c70206

multi-key push

209156c

use multi-get for refreshing model cache

ab77472

Pull tracing in cache's loadAll

288f823

Merge remote-tracking branch 'origin' into model-cache

eef199c

wynot12 added 2 commits November 28, 2017 21:19

style error fix

9b6eead

fix style error

bc10f64

yunseong approved these changes Nov 29, 2017

View reviewed changes

yunseong merged commit 048c6aa into master Nov 29, 2017

yunseong deleted the model-cache branch November 29, 2017 02:21

yunseong mentioned this pull request Dec 25, 2017

[MINOR] Expose input data key from TrainingDataProvider snuspl/harmony#8

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CAY-1251] Introduce worker-side model cache #1252

[CAY-1251] Introduce worker-side model cache #1252

wynot12 commented Nov 2, 2017

wynot12 commented Nov 2, 2017 •

edited

Loading

wynot12 commented Nov 2, 2017

yunseong left a comment

yunseong Nov 6, 2017

yunseong Nov 6, 2017

yunseong Nov 6, 2017

wynot12 Nov 7, 2017 •

edited

Loading

yunseong Nov 13, 2017

wynot12 Nov 14, 2017

wynot12 Nov 14, 2017

wynot12 commented Nov 7, 2017

wynot12 commented Nov 7, 2017 •

edited

Loading

wynot12 commented Nov 28, 2017

yunseong commented Nov 29, 2017

[CAY-1251] Introduce worker-side model cache #1252

[CAY-1251] Introduce worker-side model cache #1252

Conversation

wynot12 commented Nov 2, 2017

wynot12 commented Nov 2, 2017 • edited Loading

wynot12 commented Nov 2, 2017

yunseong left a comment

Choose a reason for hiding this comment

yunseong Nov 6, 2017

Choose a reason for hiding this comment

yunseong Nov 6, 2017

Choose a reason for hiding this comment

yunseong Nov 6, 2017

Choose a reason for hiding this comment

wynot12 Nov 7, 2017 • edited Loading

Choose a reason for hiding this comment

yunseong Nov 13, 2017

Choose a reason for hiding this comment

wynot12 Nov 14, 2017

Choose a reason for hiding this comment

wynot12 Nov 14, 2017

Choose a reason for hiding this comment

wynot12 commented Nov 7, 2017

wynot12 commented Nov 7, 2017 • edited Loading

wynot12 commented Nov 28, 2017

yunseong commented Nov 29, 2017

wynot12 commented Nov 2, 2017 •

edited

Loading

wynot12 Nov 7, 2017 •

edited

Loading

wynot12 commented Nov 7, 2017 •

edited

Loading