RTF Importer Module
This module will get you the best of both worlds: excellent out-of-the-box structural markup in XML and separated style information in form of CSS properties. Proven conversion technology with built-in structure recognition heuristics preserves the document structure to an extent you probably won't be able to find anywhere else – instantly.
Conversion features
- fully recreates the document structure with automatic section nesting (customizable) and support for Word sections
- supports paragraph and character styles
- powerful table translation (HTML 4 or Oasis Exchange Table model - CALS), incl. nested tables, row/column spans, cell properties, borders and backgrounds
- processes footnotes, hyperlinks, references, forms, index entries, annotations, page headers and footers
- supports any combination of nested lists, tables and combination of layout elements possible in RTF documents
- support for document properties (incl. user properties), document template reference and document variables
- Unicode and many two-byte encodings supported
- includes a WMF renderer and image rewriting capabilities
- new inline nesting optimization with intelligent, customizable property hoisting to surrounding container element
- translation of most style and layout information into CSS2
- highlighting and support for non properly nested target regions
- pass-through import filter
- extracting embedded object binary data
- handles large files (only subject to available memory)
- improved support for textbox, image, TOC and fields
Import Formats
- RTF Version 1.6 (Details )
- Microsoft Word binary format (*.doc) (only when executing on Windows 95/98/2000/NT with Word 97 or later installed)
- most Word 95, Word 97, Word 2000, Word XP and Word 2003 RTF documents
- RTF-embedded WMF (Windows Meta File) and EMF
- RTF-embedded images (PICT, PNG, GIF, JPEG)