Skip to main content

Babel Street Analytics API

Entity types

The entityType identifies the type of name being matched and to select the algorithms to use for matching. Where supported, stop words and override files are specific to an entity type. Parameters can be set for specific languages and entity types. Each of these entity types can be used in conjunction with the rni_name field type.

Important

The entityType should always be specified to utilize all available methods when indexing and matching names. The rni_name field type can be used in conjunction with all entity types, including PERSON, LOCATION, ORGANIZATION, and IDENTIFIER. If you don't specify an entityType, the type PERSON will be used.

Table 1. Entity types

Entity type

Description

Features

PERSON

A human identified by name, nickname, or alias.

Values are tokenized and token pairs are compared.

Stop words, overrides, frequency and gender models are supported.

LOCATION

A city, state, country, region or other location.

Values are tokenized and token pairs are compared.

Stop words, overrides, and frequency models are supported.

ORGANIZATION

A corporation, institution, government agency, or other group of people defined by an established organizational structure.

Values are tokenized and token pairs are compared.

Stop words, overrides, frequency models, and embeddings are supported.

Real World IDs are supported.

IDENTIFIER

IDENTIFIER:DRIVERS_LICENSE

IDENTIFIER:LICENSE_PLATE

IDENTIFIER:NATIONAL_ID_NUM

An alphanumeric identifier.

Values are not tokenized. The entire identifier is treated as a string. Scoring is primarily by string edit distance.