This section describes how to extract and manipulate data from documents and data-streams.
▪ | How to select, enable/disable and delete existing fields and tags; using the Field Definitions dialog to change the location and content of data fields and tags: see Editing fields and tags |
▪ | Specifying when and how a field may be used, forcing 'front' and 'back' pages, overlays, trays and resetting the sheet count; new files, log-files, fields-files and sets: see Setting field actions |
▪ | Defining a hierarchy of fields and tags so that some are dependent on the processing of others; combining fields: see Fields list/tree |
▪ | How to make field values persistent, select an EE file for reuse and set up a banner page; the specialist PJL field prefix feature: see Setting fields file options |
▪ | Troubleshooting common data extraction problems – tags not found, inappropriate symbol sets, unexpected box characters: see Field problems |
▪ | Using command-line options to set up Author, Title, Subject and Keywords for a PDF document: see PDF document summary |
▪ | How to include information on the presentation of the data output to XML, e.g. fonts, images and style-sheets: see Outputting to XML |
▪ | About MoveText, AddImage, AddFile, Blankout, BarCodes, BC39Reader, QRCodeReader, DMatrix, Tesseract, Rocr, OCRust, Evaluate and Script plugins: see Other plugins |
|
Links
Export data
Composite fields