Text Clustering API

Text Clustering is MeaningCloud's solution for automatic document clustering, i.e., the task of grouping a set of texts in such a way that texts in the same group (called a cluster) are more similar to each other than to those in other clusters.

The algorithm receives a set of texts and returns the list of detected clusters. Each cluster is assigned a descriptive name, a relevance value (indicating the relative importance of the cluster with respect to all clusters), its size, and the list of elements that are included in the cluster. Each document may be assigned to one or several clusters.

Text clustering may be used for different tasks, such as grouping similar documents (news, tweets, etc.) and the analysis of customer/employee feedback, discovering meaningful implicit subjects across all documents.

The current supported languages are Spanish, English, French, Italian, Portuguese, Catalan, Danish, Swedish, Norwegian, Finnish, Chinese, Russian and Arabic. The API is configured for a general purpose clustering task and includes software developed by the Carrot2 Project.

Changelog

Version	Date		Status
1.1	12/February/2020
1.1.7 (12/February/2020) Bug associated to the presence of control characters in the text sent has been fixed. 1.1.6 (18/December/2019) Support has been added for Arabic. 1.1.5 (04/December/2019) Support has been added for Russian. 1.1.5 (20/November/2019) Support has been added for Chinese. 1.1.4 (04/September/2019) The engine has been updated to the latest Carrot2 version (3.16.1) Minor bugs have been fixed. 1.1.3 (26/December/2018) Stopwords for the following languages have been reviewed and improved: Portuguese, French, Italian, Danish, Norwegian, Finnish and Swedish. The engine has been updated to the latest Carrot version available. 1.1.2 (19/July/2018) Support added for nordic pack. 1.1.1 (13/November/2017) Support to Catalan has been added. 1.1.0 (26/April/2016) Error messages have changed to allow unified error management with the rest of the APIs.
1.0	07/July/2016
1.0.4 (07/July/2015) Version retired. 1.0.3 (22/December/2015) Error codes restructured to be more coherent with the rest of the APIs. 1.0.2 (06/October/2015) New clustering mode added (`mode=dg`). Review on the stopwords used in the process. 1.0.1 (09/September/2015) Initial public version. 1.0 (15/June/2015) Initial private version.

Click on the version number to see the changelog.

Languages

English
Spanish
French
Italian
Portuguese
Catalan
Danish
Swedish
Norwegian
Finnish
Chinese
Russian
Arabic

Integrations

Contact Us

Do you have any questions? Have you detected a bug? Contact us through our feedback section or at support@meaningcloud.com

Documentation

Test Console

Developer Tools

Changelog

Languages

Integrations

Related Links

Contact Us