Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
JIT: Added SVE
GetFfr
,SetFfr
,LoadVectorFirstFaulting
,GatherVectorFirstFaulting
#104502JIT: Added SVE
GetFfr
,SetFfr
,LoadVectorFirstFaulting
,GatherVectorFirstFaulting
#104502Changes from 6 commits
d0efc9e
42148fd
a7773ac
76b42bd
bb01e37
a602b24
60d410a
aee87d7
7f3bb3c
d952ff1
3923946
461b6a3
d5b8675
07833e3
ce5a9bd
05fb46d
c63f878
a4533fe
72d1dea
6c28927
fb2012e
aca6759
d781fdc
a73fe35
81882a4
ad5ec2e
0f88d8e
10cf342
e7507bb
aef79cd
690e7ad
b23fac7
0c8b688
3184b77
5bb0b3d
823e847
86715e5
8b0f000
044dbda
0655d4b
9d7f22f
18f8f52
0755372
567a442
3ac987d
e8f7fcd
77ec96c
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
what is
op3Reg
here? Should beREG_NA
. Can we add an assert for it?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
op3Reg
would beREG_NA
. I'll add an assert for it.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Don't we need special codegen for
GetFfr
too otherwise it will generate RDFFR (predicated) instead of RDFFR (unpredicated)?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yea, we can add an optimization to use SETFFR if op1 is contained and IsAllBitsSet. I will add the opt.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No, this comment is specifically for
GetFfr
and not forSetFfr
.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah, gotcha. I mis-read.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why does this have
SpecialEffects
flag? Having memoryload should be good enough?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It causes a side-effect. If I do not have that flag on there, this API can get dead-code eliminated.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What is different for this API from other API? Do you have an example where this gets dead-code eliminated?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I mean, it's simple to write something that will be dead-code eliminated:
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
same here. why do we need
SpecialEffect
flag?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Because this API causes a side-effect.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I see, so the MemoryLoad is going to force it to be a global ref, so it should take care of it?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In liveness, there is this logic:
So, this is what I believe I encountered without the flag. The nodes would get removed.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The handling of
GatherVectorFirstFaulting
should be exactly similar toGatherVectorByteZeroExtend
or any otherGatherVector*
APIs in terms of liveness, etc. Can you double check why other APIs are not removed while this one is?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FirstFaulting is side-effectful because it sets the FFR register. The non-FirstFaulting APIs do not do this, therefore, we don't care if they get dead-code eliminated.