Named Entity Recognition

Knowledge Discovery Named Entity Recognition allows you to extract entities from text. You can use Named Entity Recognition as part of your ingest process to enrich the data before you index it into Knowledge Discovery, or you can use it separately.

Named Entity Recognition is available in several different formats and packages, depending on how you want to use it in your Knowledge Discovery architecture:

  • Named Entity Recognition in ingest (CFS task or NiFi processor). This component is part of the Knowledge Discovery Ingest process, and allows you to extract entities from documents, and add the information to document fields before you index. This process can make it easier to search for particular information after you index.

  • Named Entity Recognition SDK. The SDK is available in C, Java, and .NET implementations, which allow you to program your own applications that use Named Entity Recognition for entity extraction.

  • Named Entity Recognition Server. An ACI server, which you can use to perform Named Entity Recognition with standard ACI requests.

The particular setup you use depends on your usage and your wider Knowledge Discovery architecture. For more information about which package to use, refer to the Named Entity Recognition User and Programming Guide.

Named Entity Recognition Server and the Named Entity Recognition SDK are each available as a ZIP package, which you can download and install. See Knowledge Discovery Installation and Setup. The Named Entity Recognition NiFi processor is a standard part of the NiFi Ingest package, and Named Entity Recognition in CFS is available as part of a standard CFS installation. for more information about installing these packages, see Knowledge Discovery Ingest.

In addition to the Named Entity Recognition package, you must install the Named Entity Recognition grammars that you want to use. The grammars define the entities that you want to find. These are all available as a single Named Entity Recognition grammars ZIP package. Your license determines which of the grammars you can use.

For more information about Named Entity Recognition, refer to the Named Entity Recognition User and Programming Guide. This guide includes information about the standard grammars. There is also a guide for the premium grammars, the Named Entity Recognition Grammars User Guide.