unicode - downgrade non-ascii symbols to closest 7-bit ASCII equivalent (preferrably Java) -


Is there a simple / light solution to replace at least some non-ASCII symbols for related ASCII analogs? For example, this string

  abc-åäö.txt  

to

  abc-aao.txt  

A little background: ZIP-tools do not support UTF-8 credentials, so downgrades are required. AFAICR Google replaces any non-ASCII symbol with the '_' character with the "Download attachment as a single zip file" feature.

PS: This code can be done in any other language, if it is more or less understood, I will close Java. PPS: My first question is so far, please drop me down from the ground no it's fine?

Look at java.text.Normalizer This can help you to convert equivalent characters:


Comments

Popular posts from this blog

c# - sqlDecimal to decimal clr stored procedure Unable to cast object of type 'System.Data.SqlTypes.SqlDecimal' to type 'System.IConvertible' -

Calling GetGUIThreadInfo from Outlook VBA -

Obfuscating Python code? -