Tag Archives: lemmatization pos and parsing

Posts related to the Lemmatization, PoS and parsing functionality

Applying text analytics to financial compliance

In one of our previous posts we talked about Financial Compliance, FinTech and its relation to Text Analytics. We also showed the need for normalized facts for mining text in search of suspects of financial crimes and proposed the form SVO (subject, verb, object) to do so.

financial crime

Financial crime

Thus, we had defined clause as the string within the sentence capable to convey an autonomous fact. Finally, we had explained how to integrate with the Lemmatization, PoS and Parsing API in order to get a fully syntactic and semantic enriched JSON-formatted tree for input text, from which we will work extracting SVO clauses.

In this post, we are going to continue with the extraction process, seeing in detail how to work to extract those clauses from the response returned by the Parsing API.

Continue reading

Lemmatization, PoS, Parsing 2.0: Migration guide

We have released a new version of our core linguistic analyzer: Lemmatization, PoS and Parsing. In Lemmatization, PoS and Parsing 2.0:

  • More analysis possibilities have been included to allow you to combine a complete morphosyntactic analysis with other types of analysis such as Sentiment Analysis and Topics Extraction.
  • Configuration options have been changed to provide more flexibility in the analyses and to make the options available more understandable.
  • We’ve refactored our code to:
    • Improve the quality of the concepts/keywords extraction.
    • Make easier and more flexible the use and traceability of user dictionaries.
    • Give the possibility of obtaining a more complex integrated analysis to give flexibility in complex scenarios where the standard output is not enough.
  • A new type of topic has been added, quantity expressions, to cover a specific type of information that was hard to obtain with previous versions.
  • Some fields in the output have been modified, either to give them more appropriate names or to make them easier to use and understand.
  • Some use modes have been retired as the information provided was redundant with what a morphosyntactic analysis already gives.

All these improvements mean the migration process is not as fast as it would be with a minor version. These are the things you need to know to migrate your applications from Lemmatization, PoS and Parsing 1.2 to Lemmatization, PoS and Parsing 2.0.
Continue reading