Should we store Japanese/French and other non-ASCII characters in java source code? -
I am using some Japanese / French words in some regular expressions inside my source code. I do not want to convert them into track by \ u notation because it would be difficult and it would be difficult to catch the bug.
Do we have a standard practice in dealing with non-ASCII characters in the source code or is it OK to use them.
Thanks
This is risky as the program behaves now on the machine platform Depending on the default encoding on which the program is compiled or compiler logic And it also makes it difficult to catch the bug.
If there are some such reges, then I like to use Unicode migration. If there is a lot, I use UTF-8 for bullet cutting and source code, but only after I have it
- a build script (and the app can be used with that script
- Some unit tests which confirm that regexes are working correctly
- An automated build server Which runs unit tests for each build
Comments
Post a Comment