This thesis sets out to find out if it is possible to create a system thatconverts documents between Office Open XML (OOXML), Open Document Format (ODF) andHyperText Markup Language (HTML). In addition to this, some studies in theircontext is executed.
By looking at their specifications, it becomes clear that the structure of contentis the same. Regarding formatting, the open standards differ. With OOXML andODF it is a mere difference in structure of the document. HTML on the otherhand need CSS in order to include equivalentpresentation qualities as the other two formats.
A system was designed to create mappings between the formats. The scope wasnarrowed down to converting only text, without any presentationalqualities. This was done to locate the lowest common denominator to buildfurther on, and to make sure the conversions between the formats was feasible. Withthis restriction, the resulting system had successful conversions betweendocuments containing text. The only exception to the system was that it failedto convert the contents of DOCX to ODT and HTML. However, the remainingconversions ran successfully; between HTML and ODT, from ODT to DOCX and fromHTML to DOCX.
Even if the system was not carrying out complete conversions between all theformats, it shows great promise towards accomplishing this, hence affirming thepossibility to convert between the three priorly mentioned formats.