Move base64 encoding/decoding to Rust side #13845

aapoalas · 2022-03-05T13:27:52Z

Attempt to at least somewhat fix #13838.

Before:

base64_roundtrip:       n = 10, dt = 4.157s, r = 2/s, t = 415700000ns/op

After:

base64_roundtrip:       n = 10, dt = 1.936s, r = 5/s, t = 193600000ns/op

aapoalas · 2022-03-05T13:30:18Z

ext/web/internal.d.ts

@@ -43,8 +43,8 @@ declare namespace globalThis {
        result: string;
        position: number;
      };
-      forgivingBase64Encode(data: Uint8Array): string;
-      forgivingBase64Decode(data: string): Uint8Array;
+      forgivingBase64Encode(data: string): string;


It's a bit of a shame that the strings (presumably) have to be copied. I wonder if it would be possible to have an optional &str serde?

aapoalas · 2022-03-05T13:33:56Z

ext/web/lib.rs

+    if char::is_ascii_whitespace(&c) {
+      ws = true;
+    } else if c == '=' {
+      if i != len - 1 && i != len - 2 {


There is a bit of a possibility for false negative here: Even if there are '=' characters at the end it does not mean that they'll be removed below since that depends on whitespace removal etc.

As such, it would be more appropriate to track '=' markers at end with an enum: None, One, Two. Then compare to those when '=' is being removed and reset the enum appropriately. If reset does not happen, it means that the '=' at the end were invalid characters.

aapoalas · 2022-03-05T13:34:45Z

ext/web/lib.rs

+  });
+
+  // "Remove all ASCII whitespace from data"
+  let input = if ws { input.replace(|c| char::is_ascii_whitespace(&c), "") } else { input };


It would be preferable if the replacement could be done on the same iter as the invalid characters but that's probably not possible.

aapoalas · 2022-03-05T13:35:24Z

ext/web/lib.rs

  _: (),
 ) -> Result<String, AnyError> {
+  let char_count = s.chars().count();
+  let s = s.into_bytes();
+  if s.len() != char_count {


Properly this should be s.len() > char_count but it doesn't really matter.

aapoalas · 2022-03-05T13:36:14Z

ext/web/lib.rs

@@ -190,14 +205,21 @@ fn op_base64_decode(
      err
    ))
  })?;
-  Ok(ZeroCopyBuf::from(out))
+  Ok(String::from_utf8(out).unwrap())


Not sure if .unwrap() here is valid.

aapoalas · 2022-03-05T13:37:14Z

Ooops, a better perf optimization had already been posted.

Move base64 encoding/decoding to Rust side

79dc591

aapoalas requested review from crowlKats and lucacasonato as code owners March 5, 2022 13:27

aapoalas commented Mar 5, 2022

View reviewed changes

aapoalas closed this Mar 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move base64 encoding/decoding to Rust side #13845

Move base64 encoding/decoding to Rust side #13845

aapoalas commented Mar 5, 2022

aapoalas Mar 5, 2022

aapoalas Mar 5, 2022

aapoalas Mar 5, 2022

aapoalas Mar 5, 2022

aapoalas Mar 5, 2022

aapoalas commented Mar 5, 2022

Move base64 encoding/decoding to Rust side #13845

Move base64 encoding/decoding to Rust side #13845

Conversation

aapoalas commented Mar 5, 2022

Before:

After:

aapoalas Mar 5, 2022

Choose a reason for hiding this comment

aapoalas Mar 5, 2022

Choose a reason for hiding this comment

aapoalas Mar 5, 2022

Choose a reason for hiding this comment

aapoalas Mar 5, 2022

Choose a reason for hiding this comment

aapoalas Mar 5, 2022

Choose a reason for hiding this comment

aapoalas commented Mar 5, 2022