Partners Category 
Software Product Company
Cambridge, MA

Basis Technology was founded in 1995, Their products and services have been used by over two hundred major firms, including, EMC, Endeca/Oracle, Exalead/Dassault, Fujitsu, Google, Hewlett-Packard, Microsoft, Oracle, and governments around the world. Their  language analysis and digital forensics software are widely used in the U.S. defense and intelligence industry by firms such as BBN, CACI, Lockheed Martin, MITRE, Northrop Grumman, and SAIC.

Basis Technology also the top provider of Asian linguistic technology to web search engines, including, Google, Microsoft Bing, and Yahoo!.

  • Our Rosette linguistics platform uses state of the art natural language processing techniques to improve information retrieval, text mining, machine learning, statistics, and computational linguistics.
  • Rosette provides capabilities like identifying the language of incoming text, providing a normalized representation in Unicode, and locating names, places and other key concepts from a body of unstructured text.
  • The digital forensics group are the primary contributors to several open source projects, including Autopsy and The Sleuth Kit, for which they provide commercial add-ons, training, and support.

Product Overview

Text Analytics

Highlight was created by linguistics and text analytics experts at Basis Technology in order to simplify IC-compliant workflow and report generation, greatly reducing the number of name inconsistencies from translator and intelligence analyst reports.

Rosette®, provided by Basis Technology, is a suite of linguistic analysis components that integrate into applications to quickly add multilingual capabilities for mining unstructured data.

Because of the intrinsic nature of unstructured text, standard enterprise data solutions have a very limited ability to understand and utilize this treasure trove of information. 80% of big data is represented by “unstructured” text chunks found in documents, web pages, and databases with all the hallmarks of big data: the three Vs (Volume, Velocity, and Variety).

By combining linguistic analysis, statistical modeling, and machine learning, Text Analytics, enables you to get the most out of your data.