unicode - downgrade non-ascii symbols to closest 7-bit ASCII equivalent (preferrably Java) -


Is there a simple / light solution to replace at least some non-ASCII symbols for related ASCII analogs? For example, this string

  abc-åäö.txt  

to

  abc-aao.txt  

A little background: ZIP-tools do not support UTF-8 credentials, so downgrades are required. AFAICR Google replaces any non-ASCII symbol with the '_' character with the "Download attachment as a single zip file" feature.

PS: This code can be done in any other language, if it is more or less understood, I will close Java. PPS: My first question is so far, please drop me down from the ground no it's fine?

Look at java.text.Normalizer This can help you to convert equivalent characters:


Comments

Popular posts from this blog

paypal - How to know the URL referrer in PHP? -

oauth - Facebook OAuth2 Logout does not remove fb_ cookie -

wpf - Line breaks and indenting for the XAML of a saved FlowDocument? -