Global Companies Relying on Business Critical xDoc Based Applications... 

 

 
 

CambridgeDocs xDoc Word to HTML Conversions

Questions? Download xDoc View Download Documentation
xDoc provides out-of-the-box support to convert your Microsoft Word *.doc files into HTML using pure Java and XML transformation technologies.

With xDoc, you can transform your Word documents for deployment on internal and external websites using the best of multi-threaded J2SE transformation technologies, and you can do this on Solaris or Linux servers as well as on Windows 2000/XP machines, and with any Java application server, be it BEA WebLogic or IBM WebSphere.  And since xDoc reads the binary *.doc file directly, without needing to automate Microsoft Word, you get enhanced, robust server performance in doing your conversions.

Furthermore, because of the open XML-based approach that xDoc uses in transforming the Word documents, you have programmatic access to the content before it gets transformed, which allows you to index it in an Oracle or Documentum database, say, or modify a Word "template" with real data before publishing it to the Web.

Java Word to HTML Conversion Benefits:
Convert Microsoft Word documents on Windows, Linux and Solaris machines
Convert documents in a multi-threaded server environment
Works with any Java application server like BEA WebLogic or IBM WebSphere
Create custom documents on a server using *.doc files as templates
Index Word content in a Documentum or Oracle Database

xDoc uses two steps to transform your Word *.doc files into HTML. 

As part of its integrated multi-step approach, xDoc first uses the Java Word Driver to read a binary Microsoft *.doc file, and convert it into a stylistic XML output that captures all of the document's content, styles, formatting, layout and graphics information.

Next, xDoc uses the Java XSLT Driver to transform that stylistic XML into HTML, using provided, out-of-the-box stylesheets, as shown below.