antispam:plugin:extracttext

Use external tools to extract text from images, PDFs, and other document types.

warden --task=antispam:plugin:extracttext
Option
Value Default Description
--extracttext_maxparts <string> 10 The maximum mime parts number to analyze, a value of 0 means all mime parts will be analyzed.
--extracttext_timeout <string> 5 10 The timeout in seconds of external tool checks, per attachment. The second number speficies maximum total time for all checks.
--extracttext_use <string1,string2> Specifies what tool to use for what message parts. The general syntax is <name> <specifiers>. <name> - the internal name of a tool. <specifiers> - File extension and regular expressions for file names and MIME types. The regular expressions are anchored to beginning and end.
--extracttext_use_add <string> Add a single entry to extracttext_use.
--extracttext_use_del <string> Remove a single entry from extracttext_use.
--extracttext_external <string1,string2> The external tool. The tool must read a document on standard input or from a file and write text to standard output. The special keyword "{}" will be substituted at runtime with the temporary filename to be scanned by the external tool. Environment variables can be defined with "{KEY=VALUE}", these strings will be removed from command line. It is required that command line used outputs result directly to STDOUT. The general syntax is <name> <command> <parameters>. <name> - The internal name of this tool. <command> - The full path to the external command to run. <parameters> - Parameters for the external command. The temporary file name containing the document will be automatically added as last parameter.
--extracttext_external_add <string> Add a single entry to extracttext_use.
--extracttext_external_del <string> Remove a single entry from extracttext_use.
--default <yes> Reset all settings to their default values.
--default_option <option> Reset a specific setting to its default value.
--reload <yes> Reload the service after saving settings.

Examples

// set the extracttext_timeout
warden --task=antispam:plugin:extracttext --extracttext_timeout='7 15' --reload=yes

// reset extracttext_timeout to its default value
warden --task=antispam:plugin:extracttext --default_option=extracttext_timeout --reload=yes

// reset all settings to their default values
warden --task=antispam:plugin:extracttext --default=yes --reload=yes

Related Pages