You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jan 9, 2025. It is now read-only.
Zombie driver fails when url contains "high bytes", non-ascii characters. The following example contains a valid Hungarian with accented characters.
Desktop browsers and Mink Goutte driver translate the high bytes correctly:
Zombie driver sends string as-is to javascript, then bytes above
0x7f
go wrong somewhere in Zombie:It's a bit strange how characters are truncated:
é
becomes\xe9
that is character code inISO-8859-1
ű
becomesq
because this character does not exists in that code pageCharacters that don't exist in
ISO-8859-1
encoding are represented with regular letters, for exampleq
, damage is irreversible.Example shows that desktop browsers translate non-asci characters to percent-encoded bytes using their UTF-8 character codes:
é
becomes%C3%A9
ű
becomes%C5%B1
That's correct, web servers expect urls in this way.
The text was updated successfully, but these errors were encountered: