Archive for the ‘ .Net ’ Category

Xml Serialization – Tips & Tricks


Let’s say we have an XSD representing a library with a list of books and employees in it. 

<?xml version="1.0" encoding="utf-8"?>

<xsd:schema id="Lib"
           targetNamespace="http://schemas.ali.com/lib/"
           elementFormDefault="qualified"
           xmlns="http://schemas.ali.com/lib/"
           xmlns:mstns="http://schemas.ali.com/lib/"
           xmlns:xsd="http://www.w3.org/2001/XMLSchema"
           version="1.0"
           attributeFormDefault="unqualified">

  <xsd:element name="Library" type="LibraryType" />

  <xsd:complexType name="LibraryType">
    <xsd:all>
      <xsd:element name="Books" type="BooksType" minOccurs="0" maxOccurs="1" />
      <xsd:element name="Employees" type="EmployeesType" minOccurs="0" maxOccurs="1" />
    </xsd:all>
  </xsd:complexType>

  <xsd:complexType name="BooksType">
    <xsd:sequence minOccurs="0" maxOccurs="unbounded">
      <xsd:element name="Book" type="BookType" />
    </xsd:sequence>
  </xsd:complexType>

  <xsd:complexType name="EmployeesType">
    <xsd:sequence minOccurs="0" maxOccurs="unbounded">
      <xsd:element name="Employee" type="EmployeeType" />
    </xsd:sequence>
  </xsd:complexType>

  <xsd:complexType name="BookType">
    <xsd:attribute name="Title" type="xsd:string" use="required" />
    <xsd:attribute name="Author" type="xsd:string" use="optional" />
  </xsd:complexType>

  <xsd:complexType name="EmployeeType">
    <xsd:attribute name="Name" type="xsd:string" use="required" />
  </xsd:complexType>

</xsd:schema>

Here is a sample Xml using this XSD: 

<?xml version="1.0" encoding="utf-8" ?>

<Library xmlns="http://schemas.ali.com/lib/">
  <Books>
    <Book Title="Book 1" Author="Ali"/>
    <Book Title="Book 2" Author="Sara"/>
  </Books>
  <Employees>
    <Employee Name="Ali"/>
    <Employee Name="Sara"/>
  </Employees>
</Library>

Tip 1 – Generating Code from XSD 

We’d like to have an object representation of this Xml. Thus, we’ll use the Xml Schema Definition tool to generate .Net C# code from the XSD, as follows: 

  • Start Visual Studio Command Prompt
  • Run this command: xsd “path to XSD file” -language:CS /classes /outputdir:”path to output directory”

As a result, a .cs file will be generated and copied to the output directory. Take a look at the file GeneratedLibrary.cs

Tip 2 – Using List<T> instead of array 

The <Books> and <Employees> elements are generated as arrays. So I would like to change those arrays to List<T> objects to make it easier to add items to them instead of having to worry about the size of the array and expanding it. Take a look at the modified class LibraryWithLists.cs. However, this is still not good enough because if I want to create a library with one book, I’ll have to write the following code: 

LibraryType library = new LibraryType();
library.Books = new BooksType();
library.Books.Book = new List<BookType>();
BookType newBook = new BookType();
newBook.Title = "Book 1";
newBook.Author = "Author 1";
library.Books.Book.Add(newBook);

But that doesn’t seem neat enough. I want to write something like: 

LibraryType library = new LibraryType();
library.Books = new List<BookType>();
BookType newBook = new BookType();
newBook.Title = "Book 1";
newBook.Author = "Author 1";
library.Books.Add(newBook);

Thus, I’ll need to get rid of the class BooksType and change it in the declaration from “private BooksType booksField;” to “private List<BookType> booksField;”. However, making that change only is not enough. We need to tell the serializer that the new property is an XmlArrayItem and not an XmlElement. Take a look at the resulting code in Library.cs.  

Tip 3 – Serializing Object to Xml 

Now, we should be able to write code and generate Xml from the object created. For example, writing the code below 

//  Create a library
LibraryType library = new LibraryType();

//  Create Books tag
library.Books = new List<BookType>();

//  Add 5 books to the library
for (int i = 1; i <= 5; i++)
{
    BookType book = new BookType();
    book.Title = string.Format("Book {0}", i);
    book.Author = string.Format("Author {0}", i);
    library.Books.Add(book);
}

//  Create employees tag
library.Employees = new List<EmployeeType>();

//  Add 3 employees to the library
for (int i = 1; i <= 3; i++)
{
    EmployeeType employee = new EmployeeType();
    employee.Name = string.Format("Book {0}", i);
    library.Employees.Add(employee);
}

//  Now that the object is created, serialize it and print out resulting Xml
XmlSerializer serializer = new XmlSerializer(typeof(LibraryType));
StringWriter sw = new StringWriter();
serializer.Serialize(sw, library);
Console.WriteLine("Object serialized to Xml:\n\n{0}", sw.ToString());

would result in the following Xml 

<?xml version="1.0" encoding="utf-16"?>
<Library xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns="http://schemas.ali.com/lib/">
  <Books>
    <BookType Title="Book 1" Author="Author 1" />
    <BookType Title="Book 2" Author="Author 2" />
    <BookType Title="Book 3" Author="Author 3" />
    <BookType Title="Book 4" Author="Author 4" />
    <BookType Title="Book 5" Author="Author 5" />
  </Books>
  <Employees>
    <EmployeeType Name="Book 1" />
    <EmployeeType Name="Book 2" />
    <EmployeeType Name="Book 3" />
  </Employees>
</Library>

Tip 4 – Serializing without Namespace 

If you look at the serialized Xml above, you’ll notice the extra xmlns:xsi=”http://www.w3.org/2001/XMLSchema-instance“,  xmlns:xsd=”http://www.w3.org/2001/XMLSchema”  and xmlns=”http://schemas.ali.com/lib/“. These namespaces are added by default. In my case, I only care about the  xmlns=”http://schemas.ali.com/lib/” which is the url for the XSD of my Xml file. To get rid of the above namespaces and keep the one referring to the XSD, we’ll need to pass our custom XmlSerializerNamespaces object to the Serialize() method. 

//  Create our own xml serializer namespace
//  Avoiding default xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
//  and xmlns:xsd="http://www.w3.org/2001/XMLSchema"
XmlSerializerNamespaces ns = new XmlSerializerNamespaces(); 

//  Add lib namespace with empty prefix
ns.Add("", "http://schemas.ali.com/lib/"); 

//  Now serialize by passing the XmlSerializerNamespaces object
//  as a parameter to the Serialize() method
XmlSerializer serializer = new XmlSerializer(typeof(LibraryType));
StringWriter sw = new StringWriter();
serializer.Serialize(sw, library, ns);
Console.WriteLine("Object serialized to Xml:\n\n{0}", sw.ToString());

If you want to get rid of the namespaces altogether, you can simply write ns.Add(“”, “”) instead of ns.Add(“”, “http://schemas.ali.com/lib/“)

Tip 5 – Changing Encoding

If you look at the generated Xml above, you’ll notice in the Xml declaration that the encoding is set to utf-16. To make this UTF8 encoding, we’ll need to change the Stream object settings before we do the serialization. To do this, replace the code below

StringWriter sw = new StringWriter();
serializer.Serialize(sw, library, ns);

with

//  Serialize the object to Xml with UTF8 encoding
MemoryStream ms = new MemoryStream();
XmlTextWriter xmlTextWriter = new XmlTextWriter(ms, Encoding.UTF8);
xmlTextWriter.Formatting = Formatting.Indented;
serializer.Serialize(xmlTextWriter, library, ns);
ms = (MemoryStream)xmlTextWriter.BaseStream;
string xml = Encoding.UTF8.GetString(ms.ToArray());

To make this more generic, we can create a static method that can do serialization with any encoding.

/// <summary>
/// Serializes the object to Xml based on encoding and name spaces.
/// </summary>
/// <param name="serializer"></param>
/// <param name="encoding"></param>
/// <param name="ns"></param>
/// <param name="objectToSerialize"></param>
/// <returns></returns>
public static string Serialize(XmlSerializer serializer,
                           Encoding encoding,
                           XmlSerializerNamespaces ns,
                           object objectToSerialize)
{
    MemoryStream ms = new MemoryStream();
    XmlTextWriter xmlTextWriter = new XmlTextWriter(ms, encoding);
    xmlTextWriter.Formatting = Formatting.Indented;
    serializer.Serialize(xmlTextWriter, objectToSerialize, ns);
    ms = (MemoryStream)xmlTextWriter.BaseStream;
    return encoding.GetString(ms.ToArray());
}

Now we can write something like the below

string xml = Serialize(serializer, Encoding.UTF8, ns, library);

Tip 6 – Removing Xml Declaration

Let’s say you want to completely remove the Xml Declarartion <?xml Version=”1.0″ Encoding=”utf-8″?> from your serialized Xml. You can do so neatly by using an XmlWriterSettings class and setting its OmitXmlDeclaration property to true. Here is how the above Serialize method would change to support this:

Thus, we can do something like the below to omit the Xml declaration

/// <summary>
/// Serializes the object to Xml based on encoding and name spaces.
/// </summary>
/// <param name="serializer">XmlSerializer object (passing as param to avoid creating one every time)</param>
/// <param name="encoding">The encoding of the serialized Xml</param>
/// <param name="ns">The namespaces to be used by the serializer</param>
/// <param name="omitDeclaration">Whether to omit Xml declarartion or not</param>
/// <param name="objectToSerialize">The object we want to serialize to Xml</param>
/// <returns></returns>
public static string Serialize(XmlSerializer serializer,
                               Encoding encoding,
                               XmlSerializerNamespaces ns,
                               bool omitDeclaration,
                               object objectToSerialize)
{
    MemoryStream ms = new MemoryStream();
    XmlWriterSettings settings = new XmlWriterSettings();
    settings.Indent = true;
    settings.OmitXmlDeclaration = omitDeclaration;
    settings.Encoding = encoding;
    XmlWriter writer = XmlWriter.Create(ms, settings);
    serializer.Serialize(writer, objectToSerialize, ns);
    return encoding.GetString(ms.ToArray()); ;
}

Tip 7 – Deserializing Xml to Object 

string xml = Serialize(serializer, Encoding.Default, ns, true, library);

This functionality is really cool. Load Xml into an object I can understand and easily interact with. For example, you’d write the following code to read the contents of Xml file “Sample1.xml” and convert it to object LibraryType.

//  Read the first Xml file
TextReader tr = new StreamReader("Sample1.xml");

//  Deserialize the Xml file into a LibraryType object
XmlSerializer serializer = new XmlSerializer(typeof(LibraryType));
LibraryType lib1 = (LibraryType)serializer.Deserialize(tr);

If you look at the object lib1 using the debugger, you’ll see that it’s properly loaded:

LibraryType Object
LibraryType Object

We can add a new book to this object and serialize it back to Xml using the following code

if (lib1.Books == null)
{
    lib1.Books = new List<BookType>();
}

BookType newBook = new BookType();
newBook.Title = "Book 3";
lib1.Books.Add(newBook);

//  Serialize back the library type object and output Xml
StringWriter sw = new StringWriter();
serializer.Serialize(sw, lib1);
Console.WriteLine("{0}:\n\n{1}", "Sample1.xml", sw.ToString());

The resulting Xml would look something like:

<?xml version="1.0" encoding="utf-16"?>
<Library xmlns="http://schemas.ali.com/lib/">
  <Books>
    <Book Title="Book 1" Author="Ali" />
    <Book Title="Book 2" Author="Sara" />
    <Book Title="Book 3" />
  </Books>
  <Employees>
    <Employee Name="Ali" />
    <Employee Name="Sara" />
  </Employees>
</Library>

Tip 8 – Resolving Empty Lists Issue

Now, let’s try the same deserialization code above, but on xml file “Sample2.xml” which has no <Employees> tag. When we deserialize xml into an object then serialize back into Xml, we get the following:

<?xml version="1.0" encoding="utf-16"?>
<Library xmlns="http://schemas.ali.com/lib/">
  <Books>
    <Book Title="Book 1" Author="Ali" />
    <Book Title="Book 2" Author="Sara" />
  </Books>
  <Employees />
</Library>

Notice the extra <Employees/> which we really didn’t intend to have in our Xml. I guess the reason for this issue is because XmlSerializer is initializing all List<T> variables in the object on deserialization (verified that by watching the object in the debugger after deserialization) and thus we’ll get this empty tag <Employees/> when we serialize back. To resolve this issue, there are two workarounds. The first approach is definitely better, but you might find the other approach helpful based on your needs.

Approach 1

Don’t bother with the empty lists. Just clean up their corresponding empty Xml tags on serialization. The method below takes a string represenation of the Xml and removes all empty tags.

/// <summary>
/// //////////Deletes empty Xml tags from the passed xml
/// </summary>
/// <param name="xml"></param>
/// <returns></returns>
public static string CleanEmptyTags(String xml)
{
    Regex regex = new Regex(@"(\s)*<(\w)*(\s)*/>");
    return regex.Replace(xml, string.Empty);
}

With the method above in mind, our Serialize method would change as follows:

public static string Serialize(XmlSerializer serializer,
                               Encoding encoding,
                               XmlSerializerNamespaces ns,
                               bool omitDeclaration,
                               object objectToSerialize)
{
    MemoryStream ms = new MemoryStream();
    XmlWriterSettings settings = new XmlWriterSettings();
    settings.Indent = true;
    settings.OmitXmlDeclaration = omitDeclaration;
    settings.Encoding = encoding;
    XmlWriter writer = XmlWriter.Create(ms, settings);
    serializer.Serialize(writer, objectToSerialize, ns);
    string xml = encoding.GetString(ms.ToArray());
    xml = CleanEmptyTags(xml);
    return xml;
}

Approach 2

Call the deserialize as usual then set any empty instantiated lists (Count == 0) to null. Here is the static method and its helper method that does the job.


/// <summary>
/// Deserializes the passed Xml then deallocates any instantiated and empty lists.
/// </summary>
/// <param name="serializer"></param>
/// <param name="tr"></param>
/// <param name="objectNamespace"></param>
/// <returns></returns>
public static object Deserialize(XmlSerializer serializer, TextReader tr, string objectNamespace)
{
    //  Deserialize Xml into object
    object objectToReturn = serializer.Deserialize(tr);

    //  Clean up empty lists
    CleanUpEmptyLists(objectToReturn, objectNamespace);

    return objectToReturn;
}

/// <summary>
/// Sets any empty lists in the passed object to null. If the passed object itself is a list,
/// the method returns true of it's empty and false otherwise.
/// </summary>
/// <param name="o"></param>
/// <param name="objectNamespace"></param>
/// <returns></returns>
public static bool CleanUpEmptyLists(object o, string objectNamespace)
{
    //  Skip if the object is already null
    if (o == null)
    {
        return false;
    }

    //  Get the types of the object
    Type type = o.GetType();

    //  If this is an empty list, set it to null
    if (o is IList)
    {
        IList list = (IList)o;

        if (list.Count == 0)
        {
            return true;
        }
        else
        {
            foreach (object obj in list)
            {
                CleanUpEmptyLists(obj, objectNamespace);
            }
        }

        return false;
    }
    //  Ignore any objects that aren't in our namespace for perf reasons
    //  and to avoid getting errors on trying to get into every little detail
    else if (type.Namespace != objectNamespace)
    {
        return false;
    }

    //  Loop over all properties and handle them
    foreach (PropertyInfo property in type.GetProperties())
    {
        //  Get the property value and clean up any empty lists it contains
        object propertyValue = property.GetValue(o, null);
        if (CleanUpEmptyLists(propertyValue, objectNamespace))
        {
            property.SetValue(o, null, null);
        }
    }

    return false;
}

Using the above static method, you can now deserialize as follows:

//  Deserialize the Xml file into a LibraryType object
XmlSerializer serializer = new XmlSerializer(typeof(LibraryType));
LibraryType lib = (LibraryType)Deserialize(serializer, tr, typeof(LibraryType).Namespace);

If you know of a better solution, please let me know.

Source

Download full source code here. You can use it to try out the above tips one by one.

Generating a GUID


A GUID is a 128-bit integer (16 bytes) that can be used across all computers and networks wherever a unique identifier is required. Such an identifier has a very low probability of being duplicated. The term is usually referred to as GUID (Globally Unique Identifier) when working with Microsoft technologies. Otherwise, it is referred to as UUID (Universally Unique Identifier).

Here are multiple ways to generate a GUID:

Visual Studio

Run guidgen.exe. There are two ways to run guidgen:

  1. Open Visual Studio IDE, go to Tool menu, and select the Create Guid menu item. This option is available in VS 2008 but not in VS 2010. So use method 2 if you have VS 2010 installed.
  2. Open Visual Studio Command Prompt (from start menu) and enter the following command: guidgen.

Using any of the methods above, you get the following:

guidgen

You can use any of the 4 formats available, but usually the Registry format (currently selected) is the mostly used one. To generate a new GUID, click New GUID. To copy the GUID to the clipboard, click Copy.

.Net

You can generate a Guid using any .Net language. Here is how you can do it with C#.

Guid myGuid = System.Guid.NewGuid();

SQL Server

A GUID is represented as type uniqueidentifier in T-SQL. It is generated as shown below.

SELECT newid()

Online

If none of the above methods are available for you, you can get one generated online at http://www.guidgenerator.com/.

Follow

Get every new post delivered to your Inbox.

%d bloggers like this: