 |
The Total Integration PDF Text Extractor Application is a command
line application that allow developers to incorporate the ability
to extract text/words from PDF files. The Total Integration PDFTextExtractor
Application provides data in structured format. The application
supports PDF versions 1.2, 1.3 and 1.4. The PDFTextExtractor Application
is written in portable C++ and is available on many platforms including
Mac OS, Mac OS X, Mac OS X Server, Windows 95/98/NT/2000, Linux,
Solaris, IRIX and AIX.
Here is a list of the features of the PDFTextExtractor Library:
| Document page number extraction. |
| Extraction of text on a per word basis. |
| Text/Word property (location) extraction. |

|
|
|