Configuring Tika Services
General information about how to configure the Tika Services can be found in the official Tika documentation
Tip
The tika-
can be applied on all variants of Tika services.
In case you want to exclude certain mime types from being processed by Tika, you can do the following:
Create the file /etc/
with this content:
<?xml version="1.0" encoding="UTF-8"?>
<properties>
<parsers>
<parser class="org.apache.tika.parser.DefaultParser">
<mime-exclude>application/zip</mime-exclude>
</parser>
<parser class="org.apache.tika.parser.EmptyParser">
<mime>application/zip</mime>
</parser>
</parsers>
</properties>
This tells Tika to exclude zip files from DefaultParser and use EmptyParser instead, who does basically nothing.
Apply tika-config.xml
Tip
Tika docs "Using a Tika Configuration XML file" provides information how to apply the tika-config.xml file, however pan_env can make the things simpler.
Adding following line to /etc/
, makes the TIKA_CONFIG env variable global on host.
TIKA_CONFIG DEFAULT="/etc/tika/tika-config.xml"