|
|||||||||
CambridgeDocs xDoc Word to XML Conversions |
|||||||||||||||||
| Questions? | Download xDoc Pro | Download Documentation | |||||||||||||||
| xDoc transforms your Microsoft Word
documents using the Java Word Driver, which is the most
sophisticated and complete Java-based means of parsing Microsoft *.doc files available anywhere today. xDoc uses the Java Word Driver as part
of its integrated multi-step approach
to converting content. The Java Word Driver reads in the binary Microsoft *.doc format and extracts as much information as possible, including text, formatting, styles, layout and graphical information. The Java Word Driver outputs a complete stylistic XML rendering of the document. The stylistic XML is in fact "non-lossy," and gives you unprecedented programmatic access to both the original Word content as well as its formatting, which you can then use for mapping to XML schemas / DTDs like DocBook and DITA. Furthermore, you can use a Microsoft Word *.doc file as a template and fill it in with live data for multi-channel publishing, such as (re)converting the content to HTML, PDF, and RTF formats. And, for the first time, you can do all of this on Solaris and Linux servers as well as on Windows machines, because the Java Word Driver is cross-platform and does not automate Microsoft Word in any way.
The list below provides just a sampling of the items that the xDoc Java Word Driver provides you with the ability to identify, parse and process: Java Word Driver FAQWhat Microsoft Word formats are supported?
What XML format can I convert my Word documents into?
What formats (PDF, HTML) can I publish a Word document into?
Can I do a two-way conversion back into Word on a server?
|
|||||||||||||||||
|
© 2002-2006 CambridgeDocs All Rights Reserved. -- Privacy Policy |