DEPRECATION WARNING
This documentation is not using the current rendering mechanism and is probably outdated. The extension maintainer should switch to the new system. Details on how to use the rendering mechanism can be found here.
textExtract: pdf, doc, xls¶
Author: | Kasper Skårhøj |
---|---|
Created: | 2002-11-01T00:32:00 |
Changed: | 2005-07-20T13:49:52 |
Author: | René Fritz |
Email: | r.fritz@colorcube.de |
Info 3: | |
Info 4: |
textExtract: pdf, doc, xls¶
Extension Key: cc_txtextexec
Copyright 2003-2005, René Fritz, <r.fritz@colorcube.de>
This document is published under the Open Content License
available from http://www.opencontent.org/opl.shtml
The content of this document is related to TYPO3
- a GNU/GPL CMS/Framework available from www.typo3.com
Introduction¶
This extension provides a three services of the type 'textExtract' for text extraction from the file types: pdf, doc, xls.
Depends on external programs: pdftotext, catdoc and xls2csv which have to be installed separately on your server.
Services of the type textExtract are used by the DAM Indexing module.
Changelog¶
Current Version
support for $conf['wantedCharset']in process function
textExtract: pdf, doc, xls - 1