Releases: rspeer/python-ftfy
v6.3.1
v6.3.0
- Switched packaging from poetry to uv.
- Uses modern Python packaging exclusively (no setup.py).
- Added support for mojibake in Windows-1257 (Baltic).
- Detects mojibake for "Ü" in an uppercase word, such as "ZURÜCK".
- Expanded a heuristic that notices improbable punctuation.
- Fixed a false positive involving two concatenated strings, one of which began with the § sign.
- Rewrote
chardata.py
to be more human-readable and debuggable, instead of being full of keysmash-like character sets.
See CHANGELOG.md for the full changelog.
Trusted Publishing is now supposed to create these releases on GitHub at the same time that it publishes to PyPI, following the user guide. It didn't, but it's supposed to.
I think I've fixed the problem (upgrading to sigstore/[email protected]
from the broken v2.1.1), and maybe future releases really will be as simple as pushing a tag.
v6.2.3
See CHANGELOG.md for version changes.
Can you tell that I'm creating these releases manually? I've set up a GitHub action that publishes to PyPI, which is reasonably well documented, but I can't find one that creates a release here on GitHub with the Python package included. Please let me know (or make a PR) if you know how.
v6.0.3
Updates in 6.0.x:
- New function: ftfy.fix_and_explain() can describe all the transformations that happen when fixing a string. This is similar to what ftfy.fixes.fix_encoding_and_explain() did in previous versions, but it can fix more than the encoding.
- fix_and_explain() and fix_encoding_and_explain() are now in the top-level ftfy module.
- Changed the heuristic entirely. ftfy no longer needs to categorize every Unicode character, but only characters that are expected to appear in mojibake.
- Because of the new heuristic, ftfy will no longer have to release a new version for every new version of Unicode. It should also run faster and use less RAM when imported.
- The heuristic ftfy.badness.is_bad(text) can be used to determine whether there appears to be mojibake in a string. Some users were already using the old function sequence_weirdness() for that, but this one is actually designed for that purpose.
- Instead of a pile of named keyword arguments, ftfy functions now take in a TextFixerConfig object. The keyword arguments still work, and become settings that override the defaults in TextFixerConfig.
- Added support for UTF-8 mixups with Windows-1253 and Windows-1254.
- Overhauled the documentation: https://ftfy.readthedocs.org
- Requires Python 3.6 or later.
v5.5.1
See CHANGELOG.md for release notes.