Text Clustering is MeaningCloud's solution for automatic document clustering, i.e., the task of grouping a set of texts in such a way that texts in the same group (called a cluster) are more similar to each other than to those in other clusters.

The algorithm receives a set of texts and returns the list of detected clusters. Each cluster is assigned a descriptive name, a relevance value (indicating the relative importance of the cluster with respect to all clusters), its size, and the list of elements that are included in the cluster. Each document may be assigned to one or several clusters.

Text clustering may be used for different tasks, such as grouping similar documents (news, tweets, etc.) and the analysis of customer/employee feedback, discovering meaningful implicit subjects across all documents.

The current supported languages are Spanish, English, French, Italian, Portuguese, Catalan, Danish, Swedish, Norwegian, Finnish, Chinese, Russian and Arabic. The API is configured for a general purpose clustering task and includes software developed by the Carrot2 Project.

Documentation

Everything and anything you need to take advantage of this API full capabilities.

Test Console

Choose an input and a configuration, and immediately check the results!

Developer Tools

Do you want to integrate this API into your environment? Check our Developer Tools!

Changelog

Version Date Status
1.1 12/February/2020

1.1.7 (12/February/2020)

  • Bug associated to the presence of control characters in the text sent has been fixed.

1.1.6 (18/December/2019)

  • Support has been added for Arabic.

1.1.5 (04/December/2019)

  • Support has been added for Russian.

1.1.5 (20/November/2019)

  • Support has been added for Chinese.

1.1.4 (04/September/2019)

  • The engine has been updated to the latest Carrot2 version (3.16.1)
  • Minor bugs have been fixed.

1.1.3 (26/December/2018)

  • Stopwords for the following languages have been reviewed and improved: Portuguese, French, Italian, Danish, Norwegian, Finnish and Swedish.
  • The engine has been updated to the latest Carrot version available.

1.1.2 (19/July/2018)

1.1.1 (13/November/2017)

  • Support to Catalan has been added.

1.1.0 (26/April/2016)

  • Error messages have changed to allow unified error management with the rest of the APIs.
1.0 07/July/2016

1.0.4 (07/July/2015)

  • Version retired.

1.0.3 (22/December/2015)

  • Error codes restructured to be more coherent with the rest of the APIs.

1.0.2 (06/October/2015)

  • New clustering mode added (mode=dg).
  • Review on the stopwords used in the process.

1.0.1 (09/September/2015)

  • Initial public version.

1.0 (15/June/2015)

  • Initial private version.

Click on the version number to see the changelog.

Languages

  • English
  • Spanish
  • French
  • Italian
  • Portuguese
  • Catalan
  • Danish
  • Swedish
  • Norwegian
  • Finnish
  • Chinese
  • Russian
  • Arabic

Integrations

Related Links

Contact Us

Do you have any questions? Have you detected a bug? Contact us through our feedback section or at support@meaningcloud.com