File Content Extraction

File Content Extraction reads data from many different types of file.

File Content Extraction extracts content and metadata from documents, and subfiles from container files (for example ZIP files, e-mails with attachments).

File Content Extraction is embedded in Connectors, CFS, and the View component.

  • CFS uses File Content Extraction to extract data from the files that are retrieved by connectors. CFS uses the data extracted by File Content Extraction to build documents for indexing into the Content component.

  • Connectors use File Content Extraction to extract the contents of container files during the Collect and View actions.

  • The View component uses File Content Extraction to extract the contents of documents and convert it to HTML for display in a Web browser.