Fix {to,from}_array UB when repr(simd) produces padding #342

calebzulawski · 2023-04-23T18:53:56Z

Addresses #341 (but doesn't introduce an intrinsic).

scottmcm · 2023-04-23T23:34:11Z

crates/core_simd/src/vector.rs

-        // it results in better codegen with optimizations disabled, but we should
-        // probably just use `transmute` once that works on const generic types.
+        // FIXME: We currently use a pointer store instead of `transmute_copy` because `repr(simd)`
+        // results in padding for non-power-of-2 vectors (so vectors are larger than arrays).


If vectors are larger than arrays and more aligned (or the same size and same align), then I think the read is always fine for this?

Because if not, we'll have to remove https://doc.rust-lang.org/std/simd/struct.Simd.html#method.as_mut_array.

(Which, comes to think of it, means that we can always implement to_array as *self.as_array(), since we currently require T: Copy.)

Good point, I changed it to *self.as_array()

crates/core_simd/src/vector.rs

scottmcm

I'm not a reviewer here, but FWIW this looks good to me.

programmerjake · 2023-04-24T04:02:26Z

crates/core_simd/src/vector.rs

+    ///
+    /// # Safety
+    /// Writing to `ptr` must be safe, as if by `<*mut [T; N]>::write_unaligned`.
+    const unsafe fn store(self, ptr: *mut [T; N]) {


I think you'll want to explicitly copy self to a temporary first, and then memcpy from the temporary, since afaict llvm optimizes that to a vector load/store due to the vector insns generated by the temporary, whereas just calling memcpy generates only a memcpy so llvm doesn't see any vector operations and generates possibly less-efficient code.

compare store vs. store2 -- store2 has the temporary.
https://godbolt.org/z/jWsc6ezP4

Isn't it intentional, though, that it's not a vector read? Because I read the OP as saying that that would generate an over-long read.

no, because LLVM IR defines vector load/store instructions to never read/write the padding (when align is small enough), as i explained on Zulip (see #341). read_unaligned has no such guarantee

so we intentionally want a llvm load/store instruction with align <elem-align> not full-simd-type-align

It's not written as a vector store, because that would be wrong (writing past the end of the array), but it should lower as one. I added the temporary with a comment as to why it's necessary.

Oh, I see, I was mixing up the LLVM type and the Rust type.

programmerjake · 2023-04-24T04:09:03Z

crates/core_simd/src/vector.rs

-            self.store(tmp.as_mut_ptr());
-            tmp.assume_init()
-        }
+        *self.as_array()


this doesn't generate a vector store instruction so llvm may produce less optimal code.

crates/core_simd/src/vector.rs

Fix {to,from}_array UB when repr(simd) produces padding

394a884

calebzulawski requested review from programmerjake and workingjubilee April 23, 2023 18:54

scottmcm reviewed Apr 23, 2023

View reviewed changes

crates/core_simd/src/vector.rs Outdated Show resolved Hide resolved

scottmcm approved these changes Apr 24, 2023

View reviewed changes

programmerjake reviewed Apr 24, 2023

View reviewed changes

Sp00ph mentioned this pull request Apr 25, 2023

move to array simd rust-lang/stdarch#1422

Draft

Use cast and improve comments

c504f01

calebzulawski force-pushed the load-store branch from cc3e99b to c504f01 Compare April 26, 2023 01:37

calebzulawski requested review from programmerjake and scottmcm April 26, 2023 01:44

programmerjake approved these changes Apr 26, 2023

View reviewed changes

programmerjake reviewed Apr 26, 2023

View reviewed changes

crates/core_simd/src/vector.rs Show resolved Hide resolved

calebzulawski merged commit 195d4ca into master Apr 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix {to,from}_array UB when repr(simd) produces padding #342

Fix {to,from}_array UB when repr(simd) produces padding #342

calebzulawski commented Apr 23, 2023 •

edited

Loading

scottmcm Apr 23, 2023

calebzulawski Apr 24, 2023

scottmcm left a comment

programmerjake Apr 24, 2023

scottmcm Apr 24, 2023

programmerjake Apr 24, 2023 •

edited

Loading

programmerjake Apr 24, 2023

calebzulawski Apr 26, 2023

scottmcm Apr 26, 2023

programmerjake Apr 24, 2023

Fix {to,from}_array UB when repr(simd) produces padding #342

Fix {to,from}_array UB when repr(simd) produces padding #342

Conversation

calebzulawski commented Apr 23, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

scottmcm left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

programmerjake Apr 24, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

calebzulawski commented Apr 23, 2023 •

edited

Loading

programmerjake Apr 24, 2023 •

edited

Loading