New Release includes Visual Rules by
Example, Microsoft Excel to XML, and XML to RTF
functionality
BOSTON –
September 29, 2003 -
CambridgeDocs, a leader in the emerging market
for XML-based content integration, today
announced the xDoc Converter 1.7, an update to
its flagship xDoc Converter product. The xDoc
Converter is a popular tool for migrating
unstructured content from legacy sources,
including Microsoft Word, HTML, and Adobe PDF
documents into any XML schema (XSD) or DTD for
improved searching and indexing across the
enterprise.
The new release includes the Microsoft Excel
Driver, which can be used to transform Microsoft
Excel 97, 2000, and XP spreadsheets into XML.
Once the data is in XML, customers are free to
use the XML for storage, calculation, or further
transformation using XSLTs. xDoc’s modular
architecture enables , the Microsoft Excel
driver to produce ppXML, which can be then be
further transformed into HTML, XSL:FO, PDF, or
Microsoft RTF.
Visual Rules By Example, which allows an end
user to “highlight” a phrase or some text in a
visually rendered version of the original
document is also found in the newest version.
This highlighting can be used to generate
“rules” for identifying and extracting content
out of the document. Titles, for example can be
highlighted and a set of rules for identifying
titles in similar source documents will be
generated automatically. Multiple examples, from
multiple source documents can be used to
generate the rules.
The ability to convert XML and XSL:FO into
Microsoft’s Rich Text Format is another new
feature of this latest release. This capability
makes it possible, for example, to convert a PDF
file into XML , and then convert the XML into
RTF for editing and opening within Microsoft
Word.
“This is an important release of the xDoc
Converter because it makes it very easy for
anyone to use the xDoc Engine to transform
content into XML,“ said Rizwan Virk,
CambridgeDocs CTO. “Also, our expanding list of
drivers and publishing options goes a long way
to making the xDoc platform ideal for conversion
of documents into and out of XML.”
The xDoc Converter is the first dedicated
platform and IDE specifically designed to help
businesses transform pre-existing unstructured
HTML and Microsoft Word documents into
“meaningful” XML, such as DocBook XML, LegalXML,
NewsML, HR-XML, SCORM, and other
customer-specific schemas. The resulting XML can
be used for content management, multi-channel
publishing, and syndication via Web Services.
The xDoc Converter software application is
available for immediate download.
About CambridgeDocs
CambridgeDocs is a leader in the emerging market for XML-based content
integration. This market deals with the
integration of legacy content with new XML-based
systems (e.g. Content Management, Enterprise
Information Portals, EAI, and Web Services) and
standards (e.g. DocBook, HRXML, RIXML, IRXML,
FPML, DAS-XML, NewsML, any custom XML
schema/DTD’s, etc.).
Towards this end, CambridgeDocs provides a
technology platform & services for taking
existing unstructured and semi-structured
internal and external content (e.g. MS Word,
HTML, PDF, Quark, etc.), and transforming it
into "meaningful XML". Once transformed, the
content can be made available for delivery
through XML-based Web Services, classified and
indexed within Enterprise Information Portals,
and aggregated, assembled and published in
multiple different formats including support for
wireless and mobile devices.
# # #
Contact Info:
|
|
|
|