Lemmatization, PoS and Parsing API

Lemmatization, PoS and Parsing is the name of MeaningCloud' API for the different basic linguistic modules.

Even though it is simple in name, the parser contains a myriad of functionalities derived from the complete morphosyntactic and semantic analysis it carries out. Instead of including different APIs to obtain all the possible features provided by this analysis, features are configured through different parameters, allowing the user to take advantage of as many of them as he wishes and to combine them with other MeaningCloud's features, such as Topics Extraction or Sentiment Analysis.

Through this API you will be able to carry out some of the most used tasks in linguistic applications, all of them different aspects of the morphosyntactic and semantic analysis:

Syntactic analysis: obtains a thorough syntactic analysis, giving a complete syntactic tree where the leaves represent the most basic elements and their morphological and semantic analyses.
Lemmatization: obtains the lemmas of the different words in a text.
PoS tagging: obtains not only the grammatical category of a word, but also all the possible grammatical categories in which a word of each specific PoS type can be classified (check the tagset associated). In the cases it applies, the morphological analysis will be related to a semantic analysis.

This API can be configured so that the same topics that are extracted by the Topics Extraction API are included in the corresponding node on the syntactic tree, allowing the user to combine this extraction with syntactic information to detect patterns in a text. Similarly, it's also possible to include the information detected by the Sentiment Analysis, making this a very powerful too that allows you to combine different types of analysis.

The current supported languages are Spanish, English, French, Italian, Portuguese and Catalan.

Changelog

Version	Date		Status
2.0	22/September/2020
2.0.26 (22/September/2020) Resources have been updated. 2.0.25 (10/September/2020) Minor bugs have been fixed and resources have been updated. 2.0.24 (05/May/2020) Minor bugs have been fixed and resources have been updated. 2.0.23 (14/January/2020) The ontology value `CASHTAG` (Top>ID>Cashtag) has been renamed for the more generic name `TICKER`, and ticker detection has been improved. 2.0.22 (28/October/2019) Negation and how it affects the different tokens has been improved. Minor bugs have been fixed and resources have been updated. 2.0.22 (09/September/2019) Minor bugs have been fixed and resources have been updated. 2.0.21 (03/April/2019) Resources have been updated. 2.0.21 (13/March/2019) Minor bugs have been fixed, and resources have been updated. 2.0.20 (12/December/2018) Bug fix for recursive search of money expressions. Improvements added to variants detection of entities. Minor bugs have been fixed, and resources have been updated. 2.0.19 (06/November/2018) Improvements added to money expressions detection and to overall performance when processing HTML content. Minor bugs have been fixed, and resources have been updated. 2.0.18 (22/August/2018) Improvements have been added to money expressions detection as well as for company detection heuristics. Minor bugs have been fixed, and resources have been updated. 2.0.17 (28/June/2018) Minor bugs have been fixed, and resources have been updated. 2.0.16 (31/May/2018) Time and quantity expressions have been refactored to follow a more coherent criteria. Addresses detection has been improved, including zip code detection. Minor bugs have been fixed for heuristic detection. Resources have been updated. 2.0.15 (12/April/2018) Resources have been updated. 2.0.14 (03/April/2018) Minor bugs related to the heuristic detection of entities have been fixed. 2.0.13 (18/January/2018) Minor bugs have been fixed and resources have been updated. Relevance calculations have been improved for concept and entity detection. 2.0.12 (13/November/2017) Minor bugs have been fixed and resources have been updated. Service management has been refactorized. 2.0.11 (19/September/2017) Minor bugs have been fixed and resources have been updated. Under-the-hood improvements have been added for user dictionaries. Phrasal verbs detection has been improved in English. Time and quantity expressions detection has been improved. 2.0.10 (26/June/2017) Several minor bugs have been fixed and resources have been updated. 2.0.9 (27/March/2017) Several minor bugs have been fixed and resources have been updated. Entities heuristic detection has been improved. A new general sentiment model has been added for Portuguese. 2.0.8 (27/October/2016) Several minor bugs have been fixed and resources have been updated. Bug in specific texts with parentheses has been fixed. Improvements added in the syntactic analysis and topics detection. Two new general sentiment models have been added for Italian and Catalan. 2.0.7 (27/July/2016) Several minor bugs have been fixed and resources have been updated. 2.0.6 (13/June/2016) Several minor bugs have been fixed and resources have been updated. 2.0.5 (26/April/2016) Several minor bugs have been fixed and resources have been updated. 2.0.4 (07/April/2016) The output format `of`=img has been improved to show sentiment information. User defined sentiment models are now supported. Several minor bugs have been fixed and resources have been updated. 2.0.3 (02/March/2016) Several minor bugs have been fixed and resources have been updated. 2.0.2 (02/February/2016) Several minor bugs have been fixed and resources have been updated. 2.0.1 (22/December/2015) Several minor bugs have been fixed and resources have been updated. 2.0.0 (01/December/2015) Sentiment Analysis has been integrated in the morphosyntactic analysis. New changes in the `topic_list`: check out the latest Topics Extraction release. Traceability with user dictionaries has been improved. The disambiguation parameters have been restructured to add clarity to what they actually do. The possibility of specifying an interface language has been added, making it easier to work with multilingual sources. Some fields in the output such have changed names or been grouped to improve usability. The `mode` parameter has been retired, as all the different modes just gave information already present in the basic morphosyntactic analysis.
1.2	02/March/2016
1.2.14 (02/March/2016) Version retired. 1.2.13 (22/December/2015) Resources have been updated. 1.2.12 (01/December/2015) Resources have been updated. 1.2.11 (06/October/2015) Several minor bugs have been fixed and resources have been updated. 1.2.10 (09/September/2015) Several minor bugs have been fixed and resources have been updated. Significant improvements have been added to URLs and HTML texts processing. 1.2.9 (28/July/2015) Resources have been updated. 1.2.8 (14/July/2015) Several minor bugs have been fixed and resources have been updated. 1.2.7 (02/June/2015) Several minor bugs have been fixed and resources have been updated. Python client has been improved. The relaxed typography parameter, `rt`, now has a new value related to `ud`. 1.2.6 (18/May/2015) Several minor bugs have been fixed. CASHTAG has been added as a new node of the ontology. Resources have been updated (including cashtag elements). Memory leaks issue related to user dictionaries has been solved. Smart prefix detection has been improved. For English the following points have been improved: Desambiguation between common and proper nouns. Use of stopwords depending on the typography. The field `normalized_form` now includes a prefix to indicate the normalized type it will contain. 1.2.5 (15/April/2015) Several minor bugs and concurrency minor problems have been fixed. Resources have been updated. Normalized form now contains the action verb for periphrasis. Filtering children for phrases has been added. Suggestions for unknown words using foreign words has been added. Suggestions for unknown words has been improved, specially for short words, based in typing mistakes and letters repetition. Smart typography detection added. 1.2.4 (24/June/2014) Several minor bugs have been fixed, and resources have been updated. 1.2.3 (20/May/2014) Several minor bugs have been fixed, and resources have been updated. 1.2.2 (17/March/2014) Several bugs have been fixed in entity detection and resources have been updated. Response time has been improved in the documentation pages. 1.2.1 (04/February/2014) Several minor bugs have been fixed, and resources have been updated. Heuristic rules for entity detection have been improved, increasing the quantity and the classification quality of the unknown entities detected. 1.2 (23/September/2013) Attribute naming for semantic information has been standarized so every element that can be an array has '_list' in its name. This allows flexibility when it comes to defining new attributes and assures the output will always be the same independently of the number of values the specific case has. The response headers have been updated so the content type is correct for all output formats supported. Resources have been upgraded. Bugs reported through our feedback section have been fixed. Error messages in all APIs have been unified. Related Facebook and Twitter links have been added to the semantic linked data information (`semld`) of known entities and concepts. The documentation has been improved, both in format and contents.

Click on the version number to see the changelog.

Languages

English
Spanish
French
Italian
Portuguese
Catalan

Integrations

Contact Us

Do you have any questions? Have you detected a bug? Contact us through our feedback section or at support@meaningcloud.com

Documentation

Test Console

Developer Tools

Changelog

Languages

Integrations

Related Links

Contact Us