Support stash_type attribute for onnx.LayerNormalization #3888

jinchen62 · 2024-11-22T19:35:48Z

If stash_type is different from input_dtype/result_dtype:

convert x dtype to stash_type
calculate mean and var in stash_type since x is in stash_type already
convert back to result_dtype before stage two calculation
convert mean_dtype and var_dtype if they are different from stash_type

e2e test added in nod-ai/SHARK-TestSuite#399

zjgarvey · 2024-11-22T20:01:54Z

I think we should probably support the stash type arg by separating the two stages of computation as is suggested by ONNX in https://onnx.ai/onnx/operators/onnx__LayerNormalization.html. If an onnx op actually has different result types and stash types, we would likely see numeric mismatches for those situations unless we perform the computation correctly.

Another option is to allow LayerNormalization to be function-expanded on import via

torch-mlir/python/torch_mlir/extras/onnx_importer.py

Line 102 in 1b8d7e0

function_expansion_allowlists_by_domain: Optional[Dict[str, set[str]]] = field(

In any case, we should put together a few e2e tests for this op:

With bf16 result type and bf16 stash type
With bf16 result type and unspecified stash type

zjgarvey

I suppose I still don't get why this works since the two stages mentioned in the onnx docs aren't separated in the torch op. Technically, aren't we supposed to cast back to the original result type before the final mul and add?

I think it is fine to merge this as-is, considering the examples we tested seem to give correct numerics. If we end up seeing numeric failures in models with layer normalization and weird dtypes, we can always fallback on the function expander on import.

Can you mention the relevant e2e tests in the commit message?

jinchen62 · 2024-11-27T00:47:28Z

@zjgarvey Yeah I convert the dtype back before stage two if the dtype is different in decomposition. Added link of e2e test in commit message.

jinchen62 requested review from AmosLewis and zjgarvey November 22, 2024 19:36

Remove stash_type check for onnx.LayerNormalization lowering

878f992

jinchen62 force-pushed the layer_norm branch from 1656027 to 045942a Compare November 24, 2024 10:12

jinchen62 changed the title ~~Remove stash_type check for onnx.LayerNormalization lowering~~ Support stash_type attribute for onnx.LayerNormalization Nov 24, 2024

jinchen62 force-pushed the layer_norm branch from 045942a to 367e6d1 Compare November 24, 2024 10:16

Support stash_type attribute for onnx.LayerNormalization

75b208b

jinchen62 force-pushed the layer_norm branch from 367e6d1 to 75b208b Compare November 24, 2024 10:17

zjgarvey approved these changes Nov 26, 2024

View reviewed changes

jinchen62 merged commit 7452460 into llvm:main Nov 27, 2024
3 checks passed

jinchen62 deleted the layer_norm branch November 27, 2024 00:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support stash_type attribute for onnx.LayerNormalization #3888

Support stash_type attribute for onnx.LayerNormalization #3888

jinchen62 commented Nov 22, 2024 •

edited

Loading

zjgarvey commented Nov 22, 2024

zjgarvey left a comment

jinchen62 commented Nov 27, 2024

Support stash_type attribute for onnx.LayerNormalization #3888

Support stash_type attribute for onnx.LayerNormalization #3888

Conversation

jinchen62 commented Nov 22, 2024 • edited Loading

zjgarvey commented Nov 22, 2024

zjgarvey left a comment

Choose a reason for hiding this comment

jinchen62 commented Nov 27, 2024

jinchen62 commented Nov 22, 2024 •

edited

Loading