How to read XML file in Java – (SAX Parser)
SAX parser is work differently with DOM parser, it either load any XML document into memory nor create any object representation of the XML document. Instead, the SAX parser use callback function (org.xml.sax.helpers.DefaultHandler) to informs clients of the XML document structure.
SAX Parser is faster and uses less memory than DOM parser.See following SAX callback methods :
- startDocument() and endDocument() – Method called at the start and end of an XML document.
- startElement() and endElement() – Method called at the start and end of a document element.
- characters() – Method called with the text contents in between the start and end tags of an XML document element.
1. XML file
Create a simple XML file.
<?xml version="1.0"?> <company> <staff> <firstname>yong</firstname> <lastname>mook kim</lastname> <nickname>mkyong</nickname> <salary>100000</salary> </staff> <staff> <firstname>low</firstname> <lastname>yin fong</lastname> <nickname>fong fong</nickname> <salary>200000</salary> </staff> </company>
2. Java file
Use SAX parser to parse the XML file.
import javax.xml.parsers.SAXParser; import javax.xml.parsers.SAXParserFactory; import org.xml.sax.Attributes; import org.xml.sax.SAXException; import org.xml.sax.helpers.DefaultHandler; public class ReadXMLFile { public static void main(String argv[]) { try { SAXParserFactory factory = SAXParserFactory.newInstance(); SAXParser saxParser = factory.newSAXParser(); DefaultHandler handler = new DefaultHandler() { boolean bfname = false; boolean blname = false; boolean bnname = false; boolean bsalary = false; public void startElement(String uri, String localName,String qName, Attributes attributes) throws SAXException { System.out.println("Start Element :" + qName); if (qName.equalsIgnoreCase("FIRSTNAME")) { bfname = true; } if (qName.equalsIgnoreCase("LASTNAME")) { blname = true; } if (qName.equalsIgnoreCase("NICKNAME")) { bnname = true; } if (qName.equalsIgnoreCase("SALARY")) { bsalary = true; } } public void endElement(String uri, String localName, String qName) throws SAXException { System.out.println("End Element :" + qName); } public void characters(char ch[], int start, int length) throws SAXException { if (bfname) { System.out.println("First Name : " + new String(ch, start, length)); bfname = false; } if (blname) { System.out.println("Last Name : " + new String(ch, start, length)); blname = false; } if (bnname) { System.out.println("Nick Name : " + new String(ch, start, length)); bnname = false; } if (bsalary) { System.out.println("Salary : " + new String(ch, start, length)); bsalary = false; } } }; saxParser.parse("c:\\file.xml", handler); } catch (Exception e) { e.printStackTrace(); } } }
Result
Start Element :company Start Element :staff Start Element :firstname First Name : yong End Element :firstname Start Element :lastname Last Name : mook kim End Element :lastname Start Element :nickname Nick Name : mkyong End Element :nickname Start Element :salary Salary : 100000 End Element :salary End Element :staff Start Element :staff Start Element :firstname First Name : low End Element :firstname Start Element :lastname Last Name : yin fong End Element :lastname Start Element :nickname Nick Name : fong fong End Element :nickname Start Element :salary Salary : 200000 End Element :salary End Element :staff End Element :company
This example may encounter exceptions for UTF-8 XML file, please read this article about how to read the XML “UTF-8″ file in SAX
You may interest to read this How to read XML file in Java – (DOM Parser)
hi
i am trying to find difference between two xml files please help me on this….
[...] A partir de esto podríamos, en lugar de escribir por pantalla los valores, procesarlos y crear atributos de objetos de clases que hayamos definido nosotros. Un ejemplo de esto se puede encontrar aquí. [...]
super example
Very straightforward, simple example. Nice job.
how to search xml file data based on the given keyword from html form using sax parser?
Hi
I work in Middle office and need to parse and write thousands of fixml messages. Can you please let me know the fastet xml parser some thing like that (10000/min) and share a tutorial
Thanks
Sudar
I want to parse a content some thing like this. It is html formatted content.
I want the values present in tags like name ,age,sex etc. How can I do that?
Service: Employee File 10 records
Company: A2B company
Name
Employee number
Sex
Age
Designation
Date
Mark
1001
Male
25
Analyst
2005-02-01
ricky
1005
Male
28
Analyst
2008-12-01
David
1007
Male
35
SeniorAnalyst
2005-08-11
hilary
1008
female
28
maketing
2001-02-01
Hi,
Can anyone please tell me how to Parse multiple XML files and store the result in a single TEXT file.
Just read xml file by file normal and store the result in a single text file. What’s your problem?
can u explain with an example what is the difference between SAX Parser and DOM Parser?
DOM – reads all structure into memory, and data stays in memory, and next you can read your data from memory, and make a lot of operations such as search. (useful for small files)
SAX – nothing is stored in memory, so you can’t restore any data by later operations. file is parsed once, and you must catch data while it is parsed (useful for large files)
Thanks for your invaluable inputs. Some extras,
DOM – Random access to the XML file.
SAX – Using callback mechanism, parse XML file from top to bottom hierarchy, ya usually in large file which required access sequentially.
I tried to read xml file ( approx. 600 MB ) on 3.2GB RAM computer and i got outofmemory exception with XOM, VTD-XML ex. Only this code makes it successfully. Thank you
yes, SAX is designed to parse large XML file. Thanks for sharing your experience.
[...] ?SAX??xml?? [...]
Perfect Hello World to see how to parse a file using SAX. Thank you!
On executing the code, I get “javax.xml.parsers.FactoryConfigurationError: Provider org.apache.xerces.jaxp.SAXParserFactoryImpl not found”.
I’ve already added following JARs to classpath:
xercesImpl-2_9_1 , xml-apis-2_9_1
Any help is greatly appreciated !!
I think that this is the best tutorial about SAX and XML!
[...] Read XML file – (SAX Parser) Example to read a XML file with SAX parser. [...]
[...] You can compare the syntax with SAX and DOM parser in the following two examples – How to read XML file in Java – (SAX Parser) – How to read XML file in Java – (DOM [...]
[...] Written on December 17, 2009 at 6:04 am by mkyong Here’s my previous article about how to read XML file in Java SAX Parser. The previous example is working fine to parse the plain text (ANSI) XML file, however, it may [...]
[...] This is the Java’s example i used to read the XML file with SAX. [...]
while i try to compile the sax parser code i got the following error:
org.xml.sax.SAXParseException: Content is not allowed in prolog.
at org.apache.xerces.util.ErrorHandlerWrapper.createSAXParseException(Unknown Source)
at org.apache.xerces.util.ErrorHandlerWrapper.fatalError(Unknown Source)
at org.apache.xerces.impl.XMLErrorReporter.reportError(Unknown Source)
at org.apache.xerces.impl.XMLErrorReporter.reportError(Unknown Source)
at org.apache.xerces.impl.XMLScanner.reportFatalError(Unknown Source)
at org.apache.xerces.impl.XMLDocumentScannerImpl$PrologDispatcher.dispatch(Unknown Source)
at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source)
at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source)
at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source)
at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
at javax.xml.parsers.SAXParser.parse(Unknown Source)
at javax.xml.parsers.SAXParser.parse(Unknown Source)
at com.mypack.ReadXMLFileSAX.main(ReadXMLFileSAX.java:99)
This may cause by the invalid XML content. please read
http://www.mkyong.com/java/sax-error-content-is-not-allowed-in-prolog/
Or, can you provide me your XML file?
[...] If you want to compare different implementation between DOM and SAX, Please visit here for How to read XML file in Java – (SAX Parser) [...]