Java – Convert Chinese character to Unicode with native2ascii
The native2ascii is a handy tool build-in in the JDK, which is used to convert a file with ‘non-Latin 1’ or ‘non-Unicode’ characters to ‘Unicode-encoded’ characters.
1. Create a file (source.txt)
Create a file named “source.txt”, put some Chinese characters inside, and save it as “UTF-8” format.
Use native2ascii command to convert it into Unicode format.
C:\>native2ascii -encoding utf8 c:\source.txt c:\output.txt
The native2ascii will read all the characters from “c:\source.txt” and encode it with “utf8” format, and output all encoded characters to “c:\output.txt”
3. Read Output
Open the “c:\output.txt”, you will see the all encoded characters, e.g \ufeff\u6768\u6728\u91d1