SAX Error – MalformedByteSequenceException: Invalid byte 1 of 1-byte UTF-8 sequence.
Published: December 17, 2009 , Updated: April 2, 2010 , Author: mkyong
Problem
When some special UTF-8 characters inside a XML file, and your SAX’s parser is not configure to parse the UTF-8 properly, the following exception will be thrown.
com.sun.org.apache.xerces.internal.impl.io.MalformedByteSequenceException: Invalid byte 1 of 1-byte UTF-8 sequence. ...
Solution
The solution is quite simple, get the content in UTF-8 format, and override the SAX input source.
File file = new File("c:\\file-utf.xml"); InputStream inputStream= new FileInputStream(file); Reader reader = new InputStreamReader(inputStream,"UTF-8"); InputSource is = new InputSource(reader); is.setEncoding("UTF-8"); saxParser.parse(is, handler);
You can read the full example here – how do read UTF-8 XML file with SAX parser
Any Java questions or problems? please post at this JavaNullPointer.com forum, see you there ~
Thanks! This helped me resolve my issue.
Thanks, great and quick help!
[...] SAX Error – MalformedByteSequenceException: Invalid byte 1 of 1-byte UTF-8 sequence Common SAX error for XML file contains Unicode character. [...]
[...] you used normal SAX’s way to parse it, you may encounter this “Invalid byte 1 of 1-byte UTF-8 sequence” [...]