unicode - downgrade non-ascii symbols to closest 7-bit ASCII equivalent (preferrably Java) -
Is there a simple / light solution to replace at least some non-ASCII symbols for related ASCII analogs? For example, this string
abc-åäö.txt
to
abc-aao.txt
A little background: ZIP-tools do not support UTF-8 credentials, so downgrades are required. AFAICR Google replaces any non-ASCII symbol with the '_' character with the "Download attachment as a single zip file" feature.
PS: This code can be done in any other language, if it is more or less understood, I will close Java. PPS: My first question is so far, please drop me down from the ground no it's fine?
Look at java.text.Normalizer
This can help you to convert equivalent characters:
Comments
Post a Comment