Various fixes to reproducible benchmarks #1800

cjnolet · 2023-09-01T23:14:01Z

No description provided.

…bench-use-gbench

…benchmarks

…nchmarks

…in the next commit

…oogle-benchmarks

…n-ann-bench-use-gbench

…oogle-benchmarks

cjnolet · 2023-09-01T23:47:55Z

/ok to test

cjnolet · 2023-09-02T00:42:01Z

/ok to test

cjnolet · 2023-09-02T00:57:32Z

/ok to test

…e-benchmarks

cjnolet · 2023-09-02T03:28:20Z

/ok to test

achirkin · 2023-09-05T07:24:18Z

cpp/bench/ann/src/hnswlib/hnswlib_wrapper.h

  };

  using typename ANN<T>::AnnSearchParam;
  struct SearchParam : public AnnSearchParam {
    int ef;
-    int num_threads{1};
+    int num_threads = omp_get_num_procs();


Does this work well with num_threads > n_queries? We've had a similar logic in raft host refinement and the performance on small batches was horrible due to overheads of managing many threads compared to the amount of work (n_queries = 1).

Good question. This one is challenging because we don't (and shouldn't) know the number of queries when we set this argument. We could set it to something small(ish) like 8 or 16, but that would just lower the saturation point for larger batch queries. In general, I know the queries for online systems are going to be long tailed, with 1 being in the main mass and >= 100 being in the tail.

The problem is that when we run larger batch sizes, we aren't giving hnsw an fair try at all. My thinking was to take the middle ground- setting this to the number of cores. I guess I should measure the impact directly. What kind of perf difference are you seeing for, say, batch size of 10 when the thread pool contains the number of available cores?

Maybe this explicit pool does better job than the openmp machinery that we rely upon in the refine operation. But there, I've got something like ~100x boost for a single-query batch n_queries = 1 (72 cores). I've done some other refactoring at the same time though.

…nchmarks

…imp-google-benchmarks

cjnolet · 2023-09-05T16:47:37Z

/ok to test

cpp/bench/ann/src/common/util.hpp

Co-authored-by: Artem M. Chirkin <[email protected]>

cjnolet · 2023-09-08T19:19:24Z

/ok to test

cjnolet · 2023-09-08T20:06:53Z

/merge

cjnolet · 2023-09-11T16:48:38Z

/ok to test

cjnolet · 2023-09-11T16:48:46Z

/merge

achirkin and others added 30 commits August 9, 2023 13:19

ANN-benchmarks: switch to use gbench

bd738ec

Disable NVTX if the nvtx3 headers are missing

7473c62

Merge branch 'branch-23.10' into enh-google-benchmarks

aa10d7c

Merge branch 'branch-23.10' into enh-google-benchmarks

bed126c

Merge remote-tracking branch 'upstream/branch-23.10' into python-ann-…

09ea7a7

…bench-use-gbench

try to run gbench executable

2917886

Allow to compile ANN_BENCH without CUDA

49732b1

Merge remote-tracking branch 'rapidsai/branch-23.10' into enh-google-…

76cfb40

…benchmarks

Fix style

9b588af

Adapt ANN benchmark python scripts

6d6c17d

Make the default behavior to produce one executable per benchmark

b89b27d

Fix style problems / pre-commit

163a40c

Merge branch 'branch-23.10' into enh-google-benchmarks

0bb51a3

Merge remote-tracking branch 'rapidsai/branch-23.10' into enh-google-…

2b9f649

…benchmarks

Merge branch 'branch-23.10' into enh-google-benchmarks

9728f7e

Merge remote-tracking branch 'origin/branch-23.10' into enh-google-be…

7b1bf01

…nchmarks

Adding k and batch-size options to run.py

1daf2bf

Merge branch 'branch-23.10' - CONFIGS ONLY - dataset_memtype follows …

4e0a53e

…in the next commit

Add dataset_memory_type/query_memory_type as build/search parameters

04893c9

middle of merge, not building

b24fcf7

Tuning guide

30f7467

Merge remote-tracking branch 'artem/enh-google-benchmarks' into enh-g…

3e35121

…oogle-benchmarks

compiling, index building successful, search failing

f927f69

Merge remote-tracking branch 'corey/enh-google-benchmarks' into pytho…

404cd10

…n-ann-bench-use-gbench

FEA first commit rebasing changes on gbench branch

2f19c44

FIX fixing straggling changes from rebase

e0586de

Fix FAISS using a destroyed stream from previous benchmark case

0eaa7e0

Merge remote-tracking branch 'artem/enh-google-benchmarks' into enh-g…

9896963

…oogle-benchmarks

Fixing issue in conf file and stubbing out parameter tuning guide

4062d6f

Adding CAGRA to tuning guide

7141c21

FIX correct conditional

15b0dc0

dantegd and others added 2 commits September 1, 2023 19:17

FIX for single gpu arch detection in CMake

d863ce6

Merge branch 'dev-enh-google-benchmarks' into imp-google-benchmarks

c271a4e

dantegd and others added 2 commits September 1, 2023 19:56

FIX PR review fixes and a {yea}

0d60c56

More fixes

0193607

cjnolet added 2 commits September 1, 2023 23:27

Merge remote-tracking branch 'origin/branch-23.10' into dev-enh-googl…

fcc158a

…e-benchmarks

Merge branch 'dev-enh-google-benchmarks' into imp-google-benchmarks

047e941

achirkin reviewed Sep 5, 2023

View reviewed changes

cjnolet added 3 commits September 5, 2023 12:43

Merge branch 'branch-23.10' into imp-google-benchmarks

b7a6d9a

Merge remote-tracking branch 'origin/branch-23.10' into imp-google-be…

9244674

…nchmarks

Merge branch 'imp-google-benchmarks' of github.com:cjnolet/raft into …

432fa45

…imp-google-benchmarks

github-actions bot removed CMake python ci labels Sep 5, 2023

achirkin reviewed Sep 5, 2023

View reviewed changes

cpp/bench/ann/src/common/util.hpp Outdated Show resolved Hide resolved

cjnolet and others added 2 commits September 7, 2023 13:08

Update cpp/bench/ann/src/common/util.hpp

732b923

Co-authored-by: Artem M. Chirkin <[email protected]>

Merge branch 'branch-23.10' into imp-google-benchmarks

ef112d0

divyegala approved these changes Sep 8, 2023

View reviewed changes

ajschmidt8 removed the request for review from a team September 11, 2023 13:22

Merge branch 'branch-23.10' into imp-google-benchmarks

be6cd5c

rapids-bot bot merged commit c59c9d1 into rapidsai:branch-23.10 Sep 11, 2023
54 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Various fixes to reproducible benchmarks #1800

Various fixes to reproducible benchmarks #1800

cjnolet commented Sep 1, 2023

cjnolet commented Sep 1, 2023

cjnolet commented Sep 2, 2023

cjnolet commented Sep 2, 2023

cjnolet commented Sep 2, 2023

achirkin Sep 5, 2023 •

edited

Loading

cjnolet Sep 5, 2023

achirkin Sep 5, 2023

cjnolet commented Sep 5, 2023

cjnolet commented Sep 8, 2023

cjnolet commented Sep 8, 2023

cjnolet commented Sep 11, 2023

cjnolet commented Sep 11, 2023

Various fixes to reproducible benchmarks #1800

Various fixes to reproducible benchmarks #1800

Conversation

cjnolet commented Sep 1, 2023

cjnolet commented Sep 1, 2023

cjnolet commented Sep 2, 2023

cjnolet commented Sep 2, 2023

cjnolet commented Sep 2, 2023

achirkin Sep 5, 2023 • edited Loading

Choose a reason for hiding this comment

cjnolet Sep 5, 2023

Choose a reason for hiding this comment

achirkin Sep 5, 2023

Choose a reason for hiding this comment

cjnolet commented Sep 5, 2023

cjnolet commented Sep 8, 2023

cjnolet commented Sep 8, 2023

cjnolet commented Sep 11, 2023

cjnolet commented Sep 11, 2023

achirkin Sep 5, 2023 •

edited

Loading