XML Document Object Model
The .NET framework offers classes that can read and manipulate XML documents. These classes are located in the System.Xmlnamespace. .NET uses the XML Document Object Model which is a set of classes that represents different parts of an XML document (and the document itself). The following are some of the classes offered by XML DOM.
Class | Description |
---|---|
XmlNode | Represents a node in the XML document tree. Nodes are anything you see in an xml file such as elements attributes, comments, and even white spaces. This class is the base class of the other XML DOM classes. |
XmlDocument | Represents the actual XML document. Can be used to load an existing XML file or save a constructed XML file. |
XmlElement | Represents a single XML element. |
XmlAttribute | Represents a single XML attribute. |
XmlText | Represents that text inside an XML element. |
XmlComment | Represents an XML comment. |
Figure 1 – XML DOM Classes
An XML Document is represented by the XmlDocument class. It contains the actual XML markup that you will be reading or manipulating. An XML Document should start with a root element which is also called the document element. This can be represented as an XmlElement. Everything inside it is its child nodes. You must also understand the concept of parent and child nodes. A parent node is a node that contains child nodes. As an example, consider this element that contains multiple elements:
<Product>
<ProductID>001</ProductID>
<ProductName>Shampoo</ProductName>
<Price>10</Price>
</Product>
Here, the parent node is the Product element. It contains two child nodes which are the ProductID, ProductName, and Price elements. If for example the ProductName still has more elements inside it, then it will be considered as a parent node of those elements it contains.
Let’s take a look at some properties and methods of each of these classes.
XmlNode
The XmlNode represents any part of the XML and is the base class of other classes in the XML DOM. For example, the XmlElement class is derived from the XmlNode so it shares XmlNode’s properties and methods. The following are some useful properties of the XmlNodebase class.
Property | Description |
---|---|
Attributes | A collection of XmlAttribute objects which represents the attributes that this node contain. |
ChildNodes | A collection of XmlNodes that serves as child nodes of this node. |
FirstChild | Gets the first child node of this node. |
HasChildNodes | Tells whether this node has child nodes. |
InnerText | Gets all the inner text of all the child nodes of these nodes. If multiple inner texts are present, then they are combined into a single string. |
InnerXml | Gets the XML string between the opening and closing tags of this node. |
LastChild | Get the last child node of this node. |
Name | Gets the name of the node. |
NextSibling | Get the node immediately following this node. |
NodeType | Gets the type of this node. |
OuterXml | Gets the markup representing this node and all its child nodes. |
OwnerDocument | Gets the XmlDocument to which this node belongs. |
ParentNode | Gets the parent node of this node. |
PreviousSibling | Get the node immediately preceding this node. |
Value | Gets or sets the value of this node. |
Figure 2 – XmlNode Properties
The Attributes property of a node contains a list of attributes that a node or element contains. Each attribute is represented by the XmlAttribute class. Consider this example:
<Person name="John Smith" age="30"></Person>
This element’s Attributes property contains two XmlAttribute objects that hold data for the name and age attribute.
The ChildNodes property is a collection of child nodes of the current node. Each child node is also an XmlNode object. You can test if the node has child nodes by checking the value of the HasChildNodes property. This will return true if it has at least one or more child nodes, and false if it has none. The FirstChild and LastChild properties simply get the first and last child nodes of the current node. PreviousSibling and NextSibling properties gets the previous and next nodes of the current node.
<PreviousNode></PreviousNode>
<CurrentNode></CurrentNode>
<NextNode></NextNode>
The Parent node gets the reference the parent of a child node. If no parent exists, then this property contains null.
The Name property indicates the name of the element.
<Person>Example</Person>
The Name of the above element is Person.
The InnerText retrieves the text located between the opening and closing tags of a control. If you are working with a comment, then this property returns the text of the comment. If the element has multiple nodes inside it, the InnerText property gets all the inner texts of all the child nodes and combine them into a single string. Consider the following XML markup.
<Example>
<Sample1>Text1</Sample1>
<Sample2>Text2</Sample2>
<Sample3>
<Sample4>Text3</Sample4>
</Sample3>
</Example>
The InnerText property will produce the output Text1Text2Text3. Notice that the third child element has another child node and the InnerText property still successfully retrieve the text inside that child node.
The InnerXml is similar to InnerText property, but this property returns the actual XML markup inside an element. The OuterXmlproperty is similar to InnerXml but includes the current node in the results.
The OwnerDocument property returns the reference to an XmlDocument that owns the current node.
The NodeType contains a value from System.Xml.XmlNodeType enumeration. You can tell the type of a node by using this property. The table below lists the values of the XmlNodeType enumeration.
Value | Description |
---|---|
Element | The node is an element. |
Attribute | The node is an attribute. |
Text | The node is a text. |
Comment | The node is a comment. |
Document | The root element of a document. |
Whitespace | A whitespace such as space, new lines or tab. |
XmlDeclaration | An XML Declaration element. |
Figure 3 – XmlNodeType Enumeration Values
Finally, the Value property specifies the value of the node. The Value property varies on each type of node. For example, calling the Value property a comment returns the text of the comment. The Value property of an attribute is the value of the attribute. Note that you must use InnerText or InnerXml to access contents of an XmlElement instead of using the Value property.
The following are some useful methods of the XmlNode class.
Method | Description |
---|---|
AppendChild | Adds a new child node for this node. |
InsertAfter | Insert a node immediately after the end of this node. |
InsertBefore | Insert a node immediately before the beginning of this node. |
PrependChild | Adds a node to the beginning of the list of the child nodes of this node. |
RemoveAll | Removes the child nodes and attributes of this node. |
RemoveChild | Removes a specified child node. |
ReplaceChild | Replaces an old child node with a new one. |
SelectNodes | Selects a list of node matching an XPath expression. |
SelectSingleNode | Selects the first node that matches the XPath expression. |
WriteContentTo | Writes all the child nodes of a node to an XmlWriter. |
WriteTo | Writes the current node to a specified XmlWriter. |
Figure 4 – XmlNode Methods
Some of this methods will be demonstrated in later lessons about writing and reading XML documents.
XmlDocument
The XmlDocument class represents the actual XML document and all its contents. The following are some properties exclusive to this class.
Property | Description |
---|---|
DocumentElement | The root element of the document. |
PreserveWhitespace | Tells whether to preserve white spaces in the content of the document. |
The following are some methods exclusive to XmlDocument class.
Method | Description |
---|---|
CreateAttribute | Creates an XmlAttribute object. |
CreateComment | Creates an XmlComment object. |
CreateElement | Creates an XmlElement object. |
CreateTextNode | Creates a XmlText object with the specified text. |
CreateXmlDeclaration | Creates an XmlDeclaration with the specified values. |
GetElementById | Gets the XmlElement with the specified ID. |
GetElementsByTagName | Returns a list of descendant elements that matches the specified name. |
Load | Loads an XML document from a file. |
LoadXml | Loads an XML from a string. |
Save | Saves the content of this XmlDocument into a specified file. |
Notice the methods which starts with Create. This methods are used to create XML DOM objects. For example, if you want to create an element for the current document, then you use the CreateElement method. These is because using the DOM classes’ constructors are prohibited due to its protection level. Most of these methods will be demonstrated in later lessons.
XmlElement
The XmlElement class represents a single XML element. The following are some useful properties of the XmlElement class.
Property | Description |
---|---|
Attributes | Contains all the attributes of an element. |
HasAttributes | Tells whether the element contains at least one or more attributes. |
IsEmpty | Specifies whether this element, when empty, should be written in short tag format (<element />) or long tag format (<element></element>). A value of true uses short tag format and false for long tag format. |
Value | The text between the opening tag and closing tag of this element. If it contains child elements, then this property will be empty. |
The following are some methods exclusive to XmlElement.
Method | Description |
---|---|
GetAttribute | Returns the value of the attribute with the specified name. |
GetAttributeNode | Returns the XmlAttribute with the specified name. |
GetElementsByTagName | Returns a list of child elements that matches the specified name. |
The XmlText and XmlComment classes represents a text inside an element and a comment respectively. They are too simple to have their own sections in this lesson so I decided not to discuss them in detail.