New tool unlocks value of unstructured
content by introducing XML Structure
BALTIMORE, MD –
December 10, 2002 -
CambridgeDocs, a leader in the emerging market
for XML-based content integration, is
demonstrating its soon to be released xDoc
Converter, a tool for migrating unstructured
content from legacy sources, including Microsoft
Word, HTML, and Adobe PDF documents into any XML
schema (XSD) or DTD for improved searching and
indexing across the enterprise at the XML
Conference.
The xDoc Converter can work in any industry and
can migrate to any DTD or XML Schema, such as
DocBook, LegalXML, HR-XML, NewsML, SCORM, XHTML.
Subsequent products will address other issues of
content interoperability in the enterprise.
Companies are investing millions of dollars to
buy and implement content management systems,
document management systems, and enterprise
portals. However, many of the benefits
associated with these new systems for managing
enterprise content are lost without the ability
to bring old documents into these systems in a
meaningful way. XML provides a
presentation-independent way to represent
content, and is fast becoming the standard way
to create new content.
Unlike other tools, that can only generate
stylistic XML, which looks like HTML, the xDoc
Converter can actually extract meaning out of
the document and assign it to appropriate tags.
For this reason, the xDoc Converter can generate
semantic or "meaningful" XML, which can be very
complex schemas defined within each industry.
To date, there has been a big gap between how
documents exist today - as unstructured
Microsoft Word documents, HTML files, text
files, PDF files - and how they will exist in
the future as part of a enterprise content
management and publishing strategy - which is
predicated on them being structured XML
documents,” said Rizwan Virk, Chairman and CEO
of Cambridge Docs. “This ‘content gap’ was
because all of the existing documents needed to
be re-typed or re-formatted in order to make
them available or people had to write conversion
code. Our goal is to narrow that gap with the
xDoc Converter by making virtually seamless
content integration possible regardless of the
file and its original format.”
Prior to xDoc, companies had to write lots of
custom parsing and formatting code, or had to
manually re-type documents. With the xDoc
Converter, companies can quickly and easily
migrate large amounts of legacy content into
meaningful XML.
Many organizations are moving to manage their
content - documents, memos, reports, intranet
pages, brochures, and other documents- as XML
because of its inherent ability to support the
management and publishing of content.
The benefits of
having documents in XML include:
- Separation
of content from presentation
- Write once,
publish anywhere - from XML to HTML, WML,
PDF, etc.
- Save time
and money from reduced authoring and
publishing costs
- Ability to
assemble new documents from existing pieces
more effectively
About CambridgeDocs
CambridgeDocs is a leader in the emerging market
for XML-based content integration. This market
deals with the integration of legacy content
with new XML-based systems (e.g. Content
Management, Enterprise Information Portals, EAI,
and Web Services) and standards (e.g. DocBook,
HRXML, RIXML, IRXML, FPML, DAS-XML, NewsML, any
custom XML schema/DTD’s, etc.).
Towards this end, CambridgeDocs provides a
technology platform & services for taking
existing unstructured and semi-structured
internal and external content (e.g. MS Word,
HTML, PDF, Quark, etc.), and transforming it
into "meaningful XML". Once transformed, the
content can be made available for delivery
through XML-based Web Services, classified and
indexed within Enterprise Information Portals,
and aggregated, assembled and published in
multiple different formats including support for
wireless and mobile devices.
# # #
Contact Info:
|