Global Companies Relying on Business Critical xDoc Based Applications... 

 

 
 

CambridgeDocs xDoc PDF to HTML Conversions

Questions? Download xDoc View Download Documentation
With xDoc, you can quickly and easily transform your binary PDF files into HTML suitable for website deployment. 

Using leading out-of-the-box J2SE capabilities, xDoc can read a binary PDF file and transform it into HTML content in a multi-threaded fashion on any Windows 2000 / XP, Solaris, or Linux server, with any application server like IBM WebSphere or BEA WebLogic, and without needing to have Adobe Acrobat installed. 

Furthermore, because xDoc uses an open XML-based approach to transform a PDF file, your own application now has tremendous access to the otherwise-inscrutable PDF content, which allows you to index it in a database like Oracle or Documentum, or to even update a PDF with live data before posting it to the web, such as meta-data describing the file's origin, its author, etc.

Java PDF to HTML Conversion Benefits:
Convert PDF documents on Windows, Linux and Solaris machines
Works in multi-threaded server environments
Operates with J2EE application servers like IBM WebSphere and BEA WebLogic
Does not require Adobe Acrobat to be installed on the server
Allows you to index PDF content in an Oracle or Documentum database for quick searches
All document content, pagination, graphical, and layout information is preserved between formats

xDoc transforms Adobe PDF data in two stages, as part of its integrated multi-step approach to handling content transformations.

First, the Java PDF Driver reads in the binary PDF file, and transforms it into a stylistic XML output that preserves the document's content, formatting, layout (including pagination and columns), vector graphics, and images. 

Next, the Java XSLT Driver reads in the stylistic XML, and using a pre-supplied XSLT stylesheet, transforms the XML into HTML content suitable for any internal or external website deployment.