DEPRECATION WARNING

This documentation is not using the current rendering mechanism and is probably outdated. The extension maintainer should switch to the new system. Details on how to use the rendering mechanism can be found here.

textExtract: pdf, doc, xls

Author:Kasper Skårhøj
Created:2002-11-01T00:32:00
Changed:2005-07-20T13:49:52
Author:René Fritz
Email:r.fritz@colorcube.de
Info 3:
Info 4:

textExtract: pdf, doc, xls

Extension Key: cc_txtextexec

Copyright 2003-2005, René Fritz, <r.fritz@colorcube.de>

This document is published under the Open Content License

available from http://www.opencontent.org/opl.shtml

The content of this document is related to TYPO3

- a GNU/GPL CMS/Framework available from www.typo3.com

Table of Contents

textExtract: pdf, doc, dot 1

Introduction 1

Changelog 1

Introduction

This extension provides a three services of the type 'textExtract' for text extraction from the file types: pdf, doc, xls.

Depends on external programs: pdftotext, catdoc and xls2csv which have to be installed separately on your server.

Services of the type textExtract are used by the DAM Indexing module.

Changelog

Current Version

support for $conf['wantedCharset']in process function

img-1 textExtract: pdf, doc, xls - 1