Configuring Tika Services

General information about how to configure the Tika Services can be found in the official Tika documentation

In case you want to exclude certain mime types from being processed by Tika, you can do the following:

Create the file /etc/tika/tika-config.xml with this content:

<?xml version="1.0" encoding="UTF-8"?>
<properties>
  <parsers>
    <parser class="org.apache.tika.parser.DefaultParser">
      <mime-exclude>application/zip</mime-exclude>
    </parser>
    <parser class="org.apache.tika.parser.EmptyParser">
      <mime>application/zip</mime>
    </parser>
  </parsers>
</properties>
Copied!

This tells Tika to exclude zip files from DefaultParser and use EmptyParser instead, who does basically nothing.

Apply tika-config.xml