UserDictionary

A comma-separated list of dictionaries to use in addition to the standard dictionaries.

You can create your own dictionaries to improve OCR performance when the media that you are analyzing contains proper names or technical terms.

Each dictionary file must meet the following requirements:

  • The dictionary must be a text file, in ASCII or UTF-8 encoding.
  • Words must be separated by whitespace.
  • The first two letters of the file name specify the corresponding language code (for example FrenchTownNames.txt would be used with documents written in French). For a list of supported languages with language codes, see OCR Supported Languages.

After creating your dictionary file(s), specify a comma-separated list of one or more of the following:

  • An absolute path to a dictionary file.
  • A relative path to a dictionary file. Relative paths are relative to the UserDataDirectory.
  • A string that matches one of your AdditionalDataLabels, if you use the AdditionalData and AdditionalDataLabels action parameters to upload the dictionary as part of the Process action.

To strengthen security, Media Server can be configured to read files only from specified directories. For information about the directories that Media Server is permitted to read from, see the parameter AllowedInputDirectories.

Type: String (comma-separated list)
Default:  
Required: No
Configuration Section: TaskName
Example: UserDictionary=english_medical_terms.txt
See Also:

KeepOnly

UserDataDirectory