| Java, Eclipse and Web programming Tutorials |
Version 1.1
Copyright © 2008 - 2009 Lars Vogel
08.12.2009
| Revision History | ||
|---|---|---|
| Revision 0.6 | 26.02.2008 | Lars Vogel |
| Sax, Dom, Stax | ||
| Revision 0.7 | 21.10.2008 | Lars Vogel / Marcus Rieck |
| Added Jaxb | ||
| Revision 0.7.1 | 20.02.2009 | Lars Vogel |
| Corrected typo in intro XML example | ||
| Revision 0.7.2 | 20.04.2009 | Lars Vogel |
| Fixed incorrect package and setter in JAXB example | ||
| Revision 0.8 | 21.07.2009 | Lars Vogel |
| Valid XML | ||
| Revision 0.9 | 05.11.2009 | Lars Vogel |
| Improved Stax reader example with nestled XML elements | ||
| Revision 1.0 | 06.11.2009 | Lars Vogel |
| Added link to RSS article | ||
| Revision 1.1 | 08.12.2009 | Lars Vogel |
| fixed example XML (comment was not closed) | ||
Table of Contents
XML stands for Extensible Markup Language and was defined 1998 by the World Wide Web Consortium (W3C).
A XML document consists out of elements, each element has a start tag, content and an end tag. A XML document must have exactly one root element, e.g. one tag which encloses the remaining tags. XML makes a difference between capital and non-capital letters.
A XML file is required to be well-formated.
Well-formated XML must apply to the following conditions:
A XML document always starts with a prolog (see below for an explanation of what a prolog is)
Every tag has a closing tag.
All tags are completely nested.
A XML file is valid if it is well-formated and if it is contains a link to a XML schema and is valid according to the schema.
In general the following is considered as advantages in using XML for data processing / representation.
XML is Plain text
XML allows the data identification without any display information
Style can be defined via XSL
Easily processed due to it regular and consistent notation
XML files are hierarchical
The following is a valid, well-formated XML file.
<?xml version="1.0"?> <!-- This is a comment --> <address> <name>Lars </name> <street> Test </street> <telephon number= "0123"/> </address>
A XML document always starts with a prolog which describes XML file. This prolog can be minimal, e.g. <?xml version="1.0"?> or can contain other information, e.g. the encoding, e.g. <?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
A tag which doesn't enclose any content is know as an "empty tag". For example: <flag/>
Java contains several methods to access XML. The following is a short overview of the available methods. The methods I consider most useful will then get later demonstrated.
W3C Standard for programming API for general programming languages. Access is done over an object tree.
Sequential reading of XML files. Can not be used to create XML documents.
SAX provides an Event-Driven XML Processing following the Push-Parsing Model. What this model means is that in SAX, Applications will register Listeners in the form of Handlers to the Parser and will get notified through Call-back methods. Here the SAX Parser takes the control over Application thread by Pushing Events to the Application.
Streaming API for XML, simply called StaX, is an API for reading and writing XML Documents. Introduced in Java 6.0 and considered as superior to SAX and DOM.
JAXB is a Java standard that defines how Java objects are converted to/from XML (specified using a standard set of mappings. JAXB defines a programmer API for reading and writing Java objects to / from XML documents and a service provider which / from from XML documents allows the selection of the JAXB implementation
JAXB applies a lot of defaults thus making reading and writing of XML via Java very easy.
Streaming API for XML, called StaX, is an API for reading and writing XML Documents.
StaX is a Pull-Parsing model. Application can take the control over parsing the XML documents by pulling (taking) the events from the parser.
The core StaX API falls into two categories and they are listed below. They are
Cursor API
Event Iterator API
Applications can any of these two API for parsing XML documents. The following will focus on the event iterator API as I consider it more convenient to use.
The event iterator API has two main interfaces: XMLEventReader for parsing XML and XMLEventWriter for generating XML.
This example is stored in project "de.vogella.xml.stax.reader".
Applications loop over the entire document requesting for the Next Event. The Event Iterator API is implemented on top of Cursor API.
In this example we will read the following XML document and create objects from it. file.
<?xml version="1.0" encoding="UTF-8"?> <config> <item date="January 2009"> <mode>1</mode> <unit>900</unit> <current>1</current> <interactive>1</interactive> </item> <item date="February 2009"> <mode>2</mode> <unit>400</unit> <current>2</current> <interactive>5</interactive> </item> <item date="December 2009"> <mode>9</mode> <unit>5</unit> <current>100</current> <interactive>3</interactive> </item> </config>
Define therefore the following class to store the individual entries of the XML file.
package de.vogella.xml.stax.model;
public class Item {
private String date;
private String mode;
private String unit;
private String current;
private String interactive;
public String getDate() {
return date;
}
public void setDate(String date) {
this.date = date;
}
public String getMode() {
return mode;
}
public void setMode(String mode) {
this.mode = mode;
}
public String getUnit() {
return unit;
}
public void setUnit(String unit) {
this.unit = unit;
}
public String getCurrent() {
return current;
}
public void setCurrent(String current) {
this.current = current;
}
public String getInteractive() {
return interactive;
}
public void setInteractive(String interactive) {
this.interactive = interactive;
}
@Override
public String toString() {
return "Item [current=" + current + ", date=" + date + ", interactive="
+ interactive + ", mode=" + mode + ", unit=" + unit + "]";
}
}
The following reads the XML file and creates a List of object Items from the entries in the XML file.
package de.vogella.xml.stax.read;
import java.io.FileInputStream;
import java.io.FileNotFoundException;
import java.io.InputStream;
import java.util.ArrayList;
import java.util.Iterator;
import java.util.List;
import javax.xml.stream.XMLEventReader;
import javax.xml.stream.XMLInputFactory;
import javax.xml.stream.XMLStreamException;
import javax.xml.stream.events.Attribute;
import javax.xml.stream.events.EndElement;
import javax.xml.stream.events.StartElement;
import javax.xml.stream.events.XMLEvent;
import de.vogella.xml.stax.model.Item;
public class StaXParser {
static final String DATE = "date";
static final String ITEM = "item";
static final String MODE = "mode";
static final String UNIT = "unit";
static final String CURRENT = "current";
static final String INTERACTIVE = "interactive";
@SuppressWarnings({ "unchecked", "null" })
public List<Item> readConfig(String configFile) {
List<Item> items = new ArrayList<Item>();
try {
// First create a new XMLInputFactory
XMLInputFactory inputFactory = XMLInputFactory.newInstance();
// Setup a new eventReader
InputStream in = new FileInputStream(configFile);
XMLEventReader eventReader = inputFactory.createXMLEventReader(in);
// Read the XML document
Item item = null;
while (eventReader.hasNext()) {
XMLEvent event = eventReader.nextEvent();
if (event.isStartElement()) {
StartElement startElement = event.asStartElement();
// If we have a item element we create a new item
if (startElement.getName().getLocalPart() == (ITEM)) {
item = new Item();
// We read the attributes from this tag and add the date attribute to our object
Iterator<Attribute> attributes = startElement
.getAttributes();
while (attributes.hasNext()) {
Attribute attribute = attributes.next();
if (attribute.getName().toString().equals(DATE));
item.setDate(attribute.getValue());
}
}
if (event.isStartElement()) {
if (event.asStartElement().getName().getLocalPart()
.equals(MODE)) {
event = eventReader.nextEvent();
item.setMode(event.asCharacters().getData());
continue;
}
}
if (event.asStartElement().getName().getLocalPart().equals(UNIT)) {
event = eventReader.nextEvent();
item.setUnit(event.asCharacters().getData());
continue;
}
if (event.asStartElement().getName().getLocalPart().equals(CURRENT)) {
event = eventReader.nextEvent();
item.setCurrent(event.asCharacters().getData());
continue;
}
if (event.asStartElement().getName().getLocalPart().equals(INTERACTIVE)) {
event = eventReader.nextEvent();
item.setInteractive(event.asCharacters().getData());
continue;
}
}
// If we reach the end of an item element we add it to the list
if (event.isEndElement()){
EndElement endElement = event.asEndElement();
if (endElement.getName().getLocalPart() ==(ITEM)){
items.add(item);
}
}
}
} catch (FileNotFoundException e) {
e.printStackTrace();
} catch (XMLStreamException e) {
e.printStackTrace();
}
return items;
}
}
You can test the parser via the following test program. Please note that the file config.xml must exist in the Java project folder.
package de.vogella.xml.stax.read;
import java.util.List;
import de.vogella.xml.stax.model.Item;
public class TestRead {
public static void main(String args[]) {
StaXParser read = new StaXParser();
List<Item> readConfig = read.readConfig("config.xml");
for (Item item : readConfig) {
System.out.println(item);
}
}
}
This example is stored in project "de.vogella.xml.stax.writer".
Lets assume you would like to write the following simple XML file.
<?xml version="1.0" encoding="UTF-8"?> <config> <mode>1</mode> <unit>900</unit> <current>1</current> <interactive>1</interactive> </config>
StaX does not provide functionality to format the XML file automatically. So you have to add end-of-lines and tab information to your XML file.
package de.vogella.xml.stax.writer;
import java.io.FileOutputStream;
import javax.xml.stream.XMLEventFactory;
import javax.xml.stream.XMLEventWriter;
import javax.xml.stream.XMLOutputFactory;
import javax.xml.stream.XMLStreamException;
import javax.xml.stream.events.Characters;
import javax.xml.stream.events.EndElement;
import javax.xml.stream.events.StartDocument;
import javax.xml.stream.events.StartElement;
import javax.xml.stream.events.XMLEvent;
public class StaxWriter {
private String configFile;
public void setFile(String configFile) {
this.configFile = configFile;
}
public void saveConfig() throws Exception {
// Create a XMLOutputFactory
XMLOutputFactory outputFactory = XMLOutputFactory.newInstance();
// Create XMLEventWriter
XMLEventWriter eventWriter = outputFactory
.createXMLEventWriter(new FileOutputStream(configFile));
// Create a EventFactory
XMLEventFactory eventFactory = XMLEventFactory.newInstance();
XMLEvent end = eventFactory.createDTD("\n");
// Create and write Start Tag
StartDocument startDocument = eventFactory.createStartDocument();
eventWriter.add(startDocument);
// Create config open tag
StartElement configStartElement = eventFactory.createStartElement("",
"", "config");
eventWriter.add(configStartElement);
eventWriter.add(end);
// Write the different nodes
createNode(eventWriter, "mode", "1");
createNode(eventWriter, "unit", "901");
createNode(eventWriter, "current", "0");
createNode(eventWriter, "interactive", "0");
eventWriter.add(eventFactory.createEndElement("", "", "config"));
eventWriter.add(end);
eventWriter.add(eventFactory.createEndDocument());
eventWriter.close();
}
private void createNode(XMLEventWriter eventWriter, String name,
String value) throws XMLStreamException {
XMLEventFactory eventFactory = XMLEventFactory.newInstance();
XMLEvent end = eventFactory.createDTD("\n");
XMLEvent tab = eventFactory.createDTD("\t");
// Create Start node
StartElement sElement = eventFactory.createStartElement("", "", name);
eventWriter.add(tab);
eventWriter.add(sElement);
// Create Content
Characters characters = eventFactory.createCharacters(value);
eventWriter.add(characters);
// Create End node
EndElement eElement = eventFactory.createEndElement("", "", name);
eventWriter.add(eElement);
eventWriter.add(end);
}
}
And a little test.
>
package de.vogella.xml.stax.writer;
public class TestWrite {
public static void main(String[] args) {
StaxWriter configFile = new StaxWriter();
configFile.setFile("config2.xml");
try {
configFile.saveConfig();
} catch (Exception e) {
e.printStackTrace();
}
}
}
For another (more complex example of using Stax please see Reading and creating RSS feeds via Java (with Stax)
JAXB uses annotations to indicate the central elements.
Table 1.
| Annotation | Description |
|---|---|
| @XmlRootElement(namespace = "namespace") | Define the root element for a XML tree |
| @XmlType(propOrder = { "field2", "field1",.. }) | Allows to define the order in which the fields are written in the XML file |
| @XmlElement(name = "neuName") | Define the XML element which will be used. Only need to be used if the neuNeu is different then the JavaBeans Name |
Create a new Java project called "de.vogella.xml.jaxb".
Create the following domain model with the JAXB annotations.
package de.vogella.xml.jaxb.model;
import javax.xml.bind.annotation.XmlElement;
import javax.xml.bind.annotation.XmlRootElement;
import javax.xml.bind.annotation.XmlType;
@XmlRootElement(name = "book")
// If you want you can define the order in which the fields are written
// Optional
@XmlType(propOrder = { "author", "name", "publisher", "isbn" })
public class Book {
private String name;
private String author;
private String publisher;
private String isbn;
// If you like the variable name, e.g. "name", you can easily change this
// name for your XML-Output:
@XmlElement(name = "bookName")
public String getName() {
return name;
}
public void setName(String name) {
this.name = name;
}
public String getAuthor() {
return author;
}
public void setAuthor(String author) {
this.author = author;
}
public String getPublisher() {
return publisher;
}
public void setPublisher(String publisher) {
this.publisher = publisher;
}
public String getIsbn() {
return isbn;
}
public void setIsbn(String isbn) {
this.isbn = isbn;
}
}
package de.vogella.xml.jaxb.model;
import java.util.ArrayList;
import javax.xml.bind.annotation.XmlElement;
import javax.xml.bind.annotation.XmlElementWrapper;
import javax.xml.bind.annotation.XmlRootElement;
//This statement means that class "Bookstore.java" is the root-element of our example
@XmlRootElement(namespace = "de.vogella.xml.jaxb.model")
public class Bookstore {
// XmLElementWrapper generates a wrapper element around XML representation
@XmlElementWrapper(name = "bookList")
// XmlElement sets the name of the entities
@XmlElement(name = "book")
private ArrayList<Book> bookList;
private String name;
private String location;
public void setBookList(ArrayList<Book> bookList) {
this.bookList = bookList;
}
public ArrayList<Book> getBooksList() {
return bookList;
}
public String getName() {
return name;
}
public void setName(String name) {
this.name = name;
}
public String getLocation() {
return location;
}
public void setLocation(String location) {
this.location = location;
}
}
Create the following test program for writing and reading the XML file.
package test;
import java.io.FileReader;
import java.io.FileWriter;
import java.io.IOException;
import java.io.Writer;
import java.util.ArrayList;
import javax.xml.bind.JAXBContext;
import javax.xml.bind.JAXBException;
import javax.xml.bind.Marshaller;
import javax.xml.bind.Unmarshaller;
import de.vogella.xml.jaxb.model.Book;
import de.vogella.xml.jaxb.model.Bookstore;
public class BookMain {
private static final String BOOKSTORE_XML = "./bookstore-jaxb.xml";
public static void main(String[] args) throws JAXBException, IOException {
ArrayList<Book> bookList = new ArrayList<Book>();
// create books
Book book1 = new Book();
book1.setIsbn("978-0060554736");
book1.setName("The Game");
book1.setAuthor("Neil Strauss");
book1.setPublisher("Harpercollins");
bookList.add(book1);
Book book2 = new Book();
book2.setIsbn("978-3832180577");
book2.setName("Feuchtgebiete");
book2.setAuthor("Charlotte Roche");
book2.setPublisher("Dumont Buchverlag");
bookList.add(book2);
// create bookstore, assigning book
Bookstore bookstore = new Bookstore();
bookstore.setName("Fraport Bookstore");
bookstore.setLocation("Frankfurt Airport");
bookstore.setBookList(bookList);
// create JAXB context and instantiate marshaller
JAXBContext context = JAXBContext.newInstance(Bookstore.class);
Marshaller m = context.createMarshaller();
m.setProperty(Marshaller.JAXB_FORMATTED_OUTPUT, Boolean.TRUE);
m.marshal(bookstore, System.out);
Writer w = null;
try {
w = new FileWriter(BOOKSTORE_XML);
m.marshal(bookstore, w);
} finally {
try {
w.close();
} catch (Exception e) {
}
}
// get variables from our xml file, created before
System.out.println();
System.out.println("Output from our XML File: ");
Unmarshaller um = context.createUnmarshaller();
Bookstore bookstore2 = (Bookstore) um.unmarshal(new FileReader(
BOOKSTORE_XML));
for (int i = 0; i < bookstore2.getBooksList().toArray().length; i++) {
System.out.println("Book " + (i + 1) + ": "
+ bookstore2.getBooksList().get(i).getName() + " from "
+ bookstore2.getBooksList().get(i).getAuthor());
}
}
}
If you run the BookMain a XML file will be created from the object. Afterwards the file is read again and the object are created again.
XPath (XML Path Language) is a language for selecting / searching nodes from an XML document. Java 5 introduced the javax.xml.xpath package which provides a XPath library.
The following explains how to use XPath to query an XML document via Java.
The following explains how to use XPath. Create a new Java project called "UsingXPath".
Create the following xml file.
<?xml version="1.0" encoding="UTF-8"?> <people> <person> <firstname>Lars</firstname> <lastname>Vogel</lastname> <city>Heidelberg</city> </person> <person> <firstname>Jim</firstname> <lastname>Knopf</lastname> <city>Heidelberg</city> </person> <person> <firstname>Lars</firstname> <lastname>Strangelastname</lastname> <city>London</city> </person> <person> <firstname>Landerman</firstname> <lastname>Petrelli</lastname> <city>Somewhere</city> </person> <person> <firstname>Lars</firstname> <lastname>Tim</lastname> <city>SomewhereElse</city> </person> </people>
Create a new package "myxml" and a new Java class "QueryXML".
package myxml;
import java.io.IOException;
import javax.xml.parsers.DocumentBuilder;
import javax.xml.parsers.DocumentBuilderFactory;
import javax.xml.parsers.ParserConfigurationException;
import javax.xml.xpath.XPath;
import javax.xml.xpath.XPathConstants;
import javax.xml.xpath.XPathExpression;
import javax.xml.xpath.XPathExpressionException;
import javax.xml.xpath.XPathFactory;
import org.w3c.dom.Document;
import org.w3c.dom.NodeList;
import org.xml.sax.SAXException;
public class QueryXML {
public void query() throws ParserConfigurationException, SAXException,
IOException, XPathExpressionException {
// Standard of reading a XML file
DocumentBuilderFactory factory = DocumentBuilderFactory.newInstance();
factory.setNamespaceAware(true);
DocumentBuilder builder;
Document doc = null;
XPathExpression expr = null;
builder = factory.newDocumentBuilder();
doc = builder.parse("person.xml");
// Create a XPathFactory
XPathFactory xFactory = XPathFactory.newInstance();
// Create a XPath object
XPath xpath = xFactory.newXPath();
// Compile the XPath expression
expr = xpath.compile("//person[firstname='Lars']/lastname/text()");
// Run the query and get a nodeset
Object result = expr.evaluate(doc, XPathConstants.NODESET);
// Cast the result to a DOM NodeList
NodeList nodes = (NodeList) result;
for (int i=0; i<nodes.getLength();i++){
System.out.println(nodes.item(i).getNodeValue());
}
// New XPath expression to get the number of people with name lars
expr = xpath.compile("count(//person[firstname='Lars'])");
// Run the query and get the number of nodes
Double number = (Double) expr.evaluate(doc, XPathConstants.NUMBER);
System.out.println("Number of objects " +number);
// Do we have more then 2 people with name lars?
expr = xpath.compile("count(//person[firstname='Lars']) >2");
// Run the query and get the number of nodes
Boolean check = (Boolean) expr.evaluate(doc, XPathConstants.BOOLEAN);
System.out.println(check);
}
public static void main(String[] args) throws XPathExpressionException, ParserConfigurationException, SAXException, IOException {
QueryXML process = new QueryXML();
process.query();
}
}
Thank you for practicing with this tutorial.
Please note that I maintain this website in my private time. If you like the information I'm providing please help me by donating.For questions and discussion around this article please use the www.vogella.de Google Group. Also if you note an error in this article please post the error and if possible the correction to the Group.
I believe the following is a very good guideline for asking questions in general and also for the Google group How To Ask Questions The Smart Way.
http://www.ibm.com/developerworks/xml/library/x-atomtordf/ Convert Atom to RDF via Stax from Brian M. Carey
http://www.vogella.de/articles/RSSFeed/article.html Read and write RSS feeds via Java (Stax)
http://java.sun.com/developer/technicalArticles/WebServices/jaxb/ JAXB Overview
http://www.ibm.com/developerworks/library/x-javaxpathapi.html The Java XPath API (by Elliotte Rusty Harold)