Skip to main content

Babel Street Analytics API

Babel Street Analytics API

Getting Started

Welcome to Babel Street Analytics 

Babel Street Analytics uses natural language processing, statistical modeling, and machine learning to analyze unstructured and semi-structured text across hundreds of language-script combinations, revealing valuable information and actionable data. Analytics provides endpoints for extracting entities and relationships, translating and comparing the similarity of names, categorizing and adding linguistic tags to text and more.

Each Analytics endpoints processes either names or documents. The structure and type of input data depends on the type of endpoint. The name endpoints match names, addresses, records. There is an endpoint to deduplicate names and one to translate names. The text analytics endpoints process unstructured text documents, identifying languages, topics, and extracting critical business information.

Server is the on-premises installation of Analytics, with access to Analytic's functions as RESTful web service endpoints. This solves cloud security worries and allows customization (models/indexes) as needed for your business.

Cloud Limits

The maximum payload size is 600KB, with a maximum character count of 50,000.

By default, Analytics will only process one call (active HTTP connection) at a time. You can send a second call once you have received a response from the first. Interested in making multiple concurrent calls? Contact us.

The maximum size name for any of the name processing endpoints, (address-similarity, name-similarity, name-translation, name-deduplication, name-translation) is 500 characters.

Inter-Version Compatibility

In an effort to improve the accuracy and performance of our software, we often make changes to our machine learned models. Occasionally, these adjustments may cause unexpected changes to the results our system outputs (e.g. the sentiment, entities and confidence reported by the system on a given document may change). Furthermore, the statistical nature of our software means that the results produced by different versions of Analytics will sometimes be incompatible or non-comparable.