[Pallas] Support XLA_USE_BF16 #6817

alanwaketan · 2024-03-25T21:48:11Z

Summary:
XLA_USE_BF16=1 will make all the internal xla tensors to use BF16 but torch.tensor wrappers will still return torch.float. To address this, we need to set the jax tracers correctly to produce the correct Mosaic.

Test Plan:
PJRT_DEVICE=TPU python test/test_pallas.py -v -k test_flash_attention_wrapper_bf16

JackCaoG · 2024-03-25T22:02:34Z

torch_xla/experimental/custom_kernel.py

@@ -74,9 +75,14 @@ def make_kernel_from_pallas(kernel: Callable, output_shape_dtype_fn: Callable):
  import jax._src.pallas.mosaic.pallas_call_registration

  def convert_torch_dtype_to_jax(dtype: torch.dtype) -> jnp.dtype:
+    XLA_USE_BF16 = os.environ.get("XLA_USE_BF16", "0") == "1"


I think this should work in most cases, but the correct thing to do is to expose this static var in

xla/torch_xla/csrc/dtype.cpp

Line 54 in 22fe1dc

static bool use_bf16 = ShouldUseBF16();

.

Otherwise there is a risk of XLA_USE_BF16 changed during the runtime...

Let me move this to the global scope to mimic the same effect.

Oh I mean what we should do is to expose a pybind to call UseBF16. This way we won't have a case where C++ static variable is False and python static variable is True.

That seems overkilled given the source is the same env...

well I am fine with current approach, knowing it's unlikely for us to run into issue in real life.

Thanks, Jack!

Summary: XLA_USE_BF16=1 will make all the internal xla tensors to use BF16 but torch.tensor wrappers will still return torch.float. To address this, we need to set the jax tracers correctly to produce the correct Mosaic. Test Plan: PJRT_DEVICE=TPU python test/test_pallas.py -v -k test_flash_attention_wrapper_bf16

alanwaketan requested a review from JackCaoG March 25, 2024 21:48

alanwaketan self-assigned this Mar 25, 2024

alanwaketan added the backport_2.3 label Mar 25, 2024

JackCaoG reviewed Mar 25, 2024

View reviewed changes

JackCaoG approved these changes Mar 26, 2024

View reviewed changes

alanwaketan added 2 commits March 26, 2024 01:14

address comments

26e87d7

alanwaketan force-pushed the alanwaketan/pallas_bf16 branch from 860a296 to 26e87d7 Compare March 26, 2024 01:14

alanwaketan merged commit 73c31db into master Mar 26, 2024
18 checks passed

alanwaketan deleted the alanwaketan/pallas_bf16 branch March 26, 2024 18:02

alanwaketan mentioned this pull request Mar 27, 2024

2.3 backport PR request list #6676

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Pallas] Support XLA_USE_BF16 #6817

[Pallas] Support XLA_USE_BF16 #6817

alanwaketan commented Mar 25, 2024

JackCaoG Mar 25, 2024

alanwaketan Mar 26, 2024

JackCaoG Mar 26, 2024

alanwaketan Mar 26, 2024

JackCaoG Mar 26, 2024

alanwaketan Mar 26, 2024

[Pallas] Support XLA_USE_BF16 #6817

[Pallas] Support XLA_USE_BF16 #6817

Conversation

alanwaketan commented Mar 25, 2024

JackCaoG Mar 25, 2024

Choose a reason for hiding this comment

alanwaketan Mar 26, 2024

Choose a reason for hiding this comment

JackCaoG Mar 26, 2024

Choose a reason for hiding this comment

alanwaketan Mar 26, 2024

Choose a reason for hiding this comment

JackCaoG Mar 26, 2024

Choose a reason for hiding this comment

alanwaketan Mar 26, 2024

Choose a reason for hiding this comment