Fori loop simple case test test #7028

ManfeiBai · 2024-05-06T22:18:40Z

No description provided.

Summary: This pull request integrating FlashAttention with SPMD. The way it works is to create a manual sharding region for the kernel which means we wraps all the inputs with enable_manual_sharding and all the outputs with disable_manual_sharding. Added a new test file because the original test file is not SPMD aware. Test Plan: PJRT_DEVICE=TPU python test/test_pallas_spmd.py

Add doc for fori_loop/while_loop and add simple user guide for simple test case

…h#6946)

Co-authored-by: qihqi <[email protected]>

Summary: This PR adds a preliminary user guide for Pallas. Test Plan: Skip CI.

Summary: Clarify some caveats for using the TPU docker images in the landing page. Test Plan: Skip CI.

…6966)

…ytorch#6973)

)

Summary: This PR is to add segment ids to the flash attention wrapper. The segment ids are a way to create an attention mask where each token can only attend to other tokens within the same segment. The mask is therefore a block diagonal matrix. To support it, we further split the flash attention forward into tracing and execution part, and implement all the shape operations to make it compatible with the kernel. Test Plan: PJRT_DEVICE=TPU python test/test_pallas.py

Co-authored-by: Will Cromar <[email protected]>

qihqi and others added 30 commits April 17, 2024 15:31

Add doc explaining how it works (pytorch#6937)

f44e1df

Fix profiling in benchmark script (pytorch#6934)

310f08d

Use TPU build for CPU and GPU Python tests (pytorch#6921)

b2556d6

[Fori_loop|While_loop] Create fori_loop.md (pytorch#6942)

0417d4d

Add doc for fori_loop/while_loop and add simple user guide for simple test case

Update XLA pin, 04/19/2024 (pytorch#6944)

2ec7706

Update jinja and sphinx versions to address the vulnearbility (pytorc…

9ba844a

…h#6946)

Make nms fallback by default. (pytorch#6933)

b06c9c7

revert expand test with dynamo (pytorch#6950)

62a2b11

Lower embedding bag forward only (pytorch#6951)

46919a4

Cleanup the code example in the torch_xla2 README. (pytorch#6939)

6fd448d

[torch_xla2] Simplify developer setup steps (pytorch#6905)

89efd17

Co-authored-by: qihqi <[email protected]>

Create rc13 trigger (pytorch#6956)

fa090a2

update torch deps to 2.3 (pytorch#6959)

b5574d8

update rc14 (pytorch#6962)

69eeace

[Doc] Update Pallas user guide (pytorch#6961)

a7749fa

Summary: This PR adds a preliminary user guide for Pallas. Test Plan: Skip CI.

Add final 2.3 trigger (pytorch#6963)

5369e7d

Temporarily ignore torch commit in CI test (pytorch#6964)

76f7dd0

[Doc] Improve docker instructions (pytorch#6969)

af74c34

Summary: Clarify some caveats for using the TPU docker images in the landing page. Test Plan: Skip CI.

Enable PagedAttention through Pallas (pytorch#6912)

6ed2026

Update readme for 2.3 releae (pytorch#6967)

0a204a6

Update GPU readme (pytorch#6968)

abe090a

Write torch_xla.version.__torch_gitrev__ to file directly (pytorch#…

0054ec0

…6966)

fix pytorch CI after pin update, change test to use assertLessEqual (p…

2a204e9

…ytorch#6973)

Update Openxla-pin to 04/24 (pytorch#6975)

4b48134

Move test_grad_checkpoint.py to tpu test list (pytorch#6976)

2bf59e0

Revert "Update Openxla-pin to 04/24" (pytorch#6980)

023e2c8

Update CODEOWNERS for build infrastructure (pytorch#6953)

3f5ff0f

Move .torch_pin and handle in ansible (pytorch#6920)

b3be775

Update dynamo test to be less constrain (pytorch#6981)

b9a9449

will-cromar and others added 23 commits April 26, 2024 15:12

Build CPP tests in new CI workflow (pytorch#6947)

b834e49

Run TPU CI when label is on PR (pytorch#6984)

174f407

Add readme to call a model (lost due to merge conflicts) (pytorch#6986)

6443e59

Fix permission issues during CI checkout (pytorch#6985)

6d01bb6

[Revert Revert] Update OpenXLA-pin update to Apr25 (pytorch#6982)

971ebe1

Update test_export_fx_passes.py (pytorch#6972)

7527816

Change name of CI documentation (pytorch#6994)

7cc78a6

Rework docs push (pytorch#6954)

c91171d

sudo rm leftover files in GHA (pytorch#6995)

0e032b1

Fixes to dynamic_shapes args in test_unbounded_dynamism.py (pytorch#6999

73b915b

)

Manually push to gh-pages branch (pytorch#6996)

d25f475

Re-land: dynamo expand test with view-replay. (pytorch#6958)

42db709

Move the nightly whl instruction out of the hide area (pytorch#7000)

87329ce

Don't fail docs push if there's nothing to commit (pytorch#7001)

5f75290

Complain when TensorFlow is installed (pytorch#7004)

4a5e238

Clean up workspace before test (pytorch#7005)

b8f8fa9

Tag CI build with git hash (pytorch#7003)

77bbf7f

fix addbmm opinfo (pytorch#6993)

2399e10

Fix more opinfo tests (pytorch#7008)

2907ab3

Fix q dtype in paged attention kernel (pytorch#7011)

865836a

Build upstream CI image on push to master (pytorch#6952)

4883f6f

Support pin pr number in new .torch_pin (pytorch#6998)

cbbefa2

Co-authored-by: Will Cromar <[email protected]>

ManfeiBai closed this May 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fori loop simple case test test #7028

Fori loop simple case test test #7028

ManfeiBai commented May 6, 2024

Fori loop simple case test test #7028

Fori loop simple case test test #7028

Conversation

ManfeiBai commented May 6, 2024