Category Archives: Language Technology

Posts about language technology.

Konplik Health: MeaningCloud splits its business of AI services for healthcare and pharma

As a client and friend to MeaningCloud, it is with great pleasure that I share the news that we have established a dedicated business exclusively focused on providing services for the health and pharmaceutical sectors: Konplik Health. This is an exciting step forward to accelerate our growth.

Today we announce to the public the completion of this spin-off from our Artificial Intelligence (AI) businesses with its 22 years experience into this new, independent company. The spin-off will allow both product and management teams to drive increased responsiveness to their customers’ particular needs and achieve faster growth through focused and fit-for-purpose operating models.

Konplik Health

Continue reading


Communication during the Coronavirus (I): Thematic analysis in Spanish digital news media

While it is obvious that the priority during this pandemic is to cure the sick, to prevent new cases from surfacing and to ensure there are economic and social measures in place to help the people and businesses most afflicted overcome the current situation; without a doubt, in the near future, the analysis of content related to the coronavirus that has been generated by the media and social network users will be the object of research for numerous disciplines such as sociology, philology, linguistics, audio-visual communication, and politics, to name a few.

At MeaningCloud we want to do our bit in this area, by applying our experience and our Text Analytics solutions to analyze the enormous volume of information in natural language, in Spanish and in other languages, in Spain and in other countries, given that, unfortunately, this is a global crisis.

This first article in the series centers on the thematic analysis of content that has been generated in Spanish by digital media platforms in Spain over the last month, how it has evolved during this period of time and the informative positioning of the main media platforms in Spain.

These other articles (only available, at the moment, in Spanish) analyse conversation topics on Twitter in Spain (both from the hashtags and general topics perspective and also applying a specific thematic categorization) and the linguistic analysis of presidential speeches related to this crisis.

Continue reading


الصيحة! Text Analytics in Arabic

At MeaningCloud we aim to provide the most advanced text analytics product with the broadest language coverage in the market. That’s why before we finished 2019 we worked on launching several new language packs to increase the coverage given by our standard pack — English, Spanish, French, Italian, Portuguese and Catalan — and our Nordic pack — Swedish, Danish, Norwegian and Finnish.

The third pack we launched is the Arabic pack. Arabic, the fifth most spoken language in the world, is the official language in twenty countries and co-official in six others. It is the first language of 280 million speakers, and the second language of another 250 million. Moreover, for religious reasons, several million Muslims living in other countries have knowledge of Arabic.

Its most peculiar characteristic is that it uses its own writing system, from right to left, joining the letters together. In this way, each letter can have up to four forms. It is also interesting that, despite the fact that they were introduced in the 1920s, there are no capital letters in Arabic. Since sometimes common names can be confused with proper names, the latter are usually enclosed in parentheses or quotes.

MeaningCloud now provides coverage for Arabic for the following functionality:

Arabic

This coverage will be extended through the successive product releases depending on the market demand. Find detailed information on our new language coverage page.

So, what are these text analytics tasks and what are they used for?
Continue reading


Ура! Text Analytics in Russian

At MeaningCloud we aim to provide the most advanced text analytics product with the broadest language coverage in the market. That’s why before we finished 2019 we worked on launching several new language packs to increase the coverage given by our standard pack — English, Spanish, French, Italian, Portuguese and Catalan — and our Nordic pack — Swedish, Danish, Norwegian and Finnish.

The second pack we launched is the Russian pack. Russian is the official language of the Russian Federation, Belarus, Kazakhstan and Kyrgyzstan. It was the de facto language in the Soviet Union, so its use it’s also common in the Baltic States, the Caucasus and Central Asia. It’s the most common of the Slavic languages with almost 144 million speakers.

Russian is written using the Cyrillic alphabet, and although transliteration into the Latin alphabet has been common due to the technical restrictions and to the unavailability of Cyrillic keyboards abroad, it’s used less and less thanks to the Unicode extension that incorporates the Russian alphabet and the many free programs that leverage it.

MeaningCloud now provides coverage for Russian for the following functionality:

Russian pack

This coverage will be extended through the successive product releases depending on the market demand. Find detailed information on our new language coverage page.

So, what are these text analytics tasks and what are they used for?
Continue reading


好棒! Text Analytics in Chinese

At MeaningCloud we aim to provide the most advanced text analytics product with the broadest language coverage in the market. That’s why before we finish 2019 we have worked on launching several new language packs to increase the coverage given by our standard pack — English, Spanish, French, Italian, Portuguese and Catalan — and our Nordic pack — Swedish, Danish, Norwegian and Finnish.

The first pack we are launching is the Chinese pack. Chinese, the official language of the People’s Republic of China. It’s the language with the most native speakers, almost a 16% of the global population.

Chinese (in all its varieties) is a group of languages based on ideograms, traditionally arranged in vertical columns, read from top to bottom down a column and right to left across columns. The variety covered by MeaningCloud is simplified Chinese.

MeaningCloud now provides coverage for Chinese for the following functionality:

Chinese pack

This coverage will be extended through the successive product releases depending on the market demand. Find detailed information on our new language coverage page.

So, what are these text analytics tasks and what are they used for?
Continue reading


Performance Metrics for Text Categorization

One of the most common and extensively studied knowledge extraction task is text categorization. Frequently customers ask how we evaluate the quality of the output of our categorization models, especially in scenarios where each document may belong to several categories.

The idea is to be able to keep track of changes in the continuous improvement cycle of models and know if those changes have been for good or bad, to commit or reject them.

This post gives answer to this question describing the metrics that we commonly adopt for model quality assessment, depending on the categorization scenario that we are facing.

 

Continue reading


NLP technologies: state of the art, trends and challenges

This post presents MeaningCloud’s vision on the state of Natural Language Processing technology by the end of 2019, based on our work with customers and research projects.

NLP technology has practically achieved human quality (or even better) in many different tasks, mainly based on advances in machine learning/deep learning techniques, which allow to make use of large sets of training data to build language models, but also due to the improvement in core text processing engines and the availability of semantic knowledge databases.

Continue reading


Case Study: Text Analytics against Fake News

Everybody has heard about fake news. Fake news is a neologism that can be formally defined as a type of yellow journalism or propaganda that consists of deliberate disinformation or hoaxes spread via traditional print and broadcast news media or online social media. It is also commonly used to refer to fabricated or junk news, with no basis in fact, but presented as being factually accurate.

The reason for putting someone’s efforts in creating fake news is mainly to cause financial, political or reputational damage to people, companies or organizations, using sensationalist, dishonest, or outright fabricated headlines to increase readership and dissemination among readers using viralization. In addition, clickbait stories, a special type of fake news, earn direct advertising revenue from this activity.

Continue reading


How Artificial Intelligence makes RPA smarter: two use cases

RPA-automation-computer-robot-tools and statistics

Artificial Intelligence and RPA

Many organizations could be gaining huge operational efficiencies if they combined Artificial Intelligence and RPA (Robotic Process Automation).

In a previous post (The leading role of Natural Language Processing in Robotic Process Automation) we introduced the subject of NLP in RPA. In this post, we are seeing two use cases where Natural Language Processing (also known as Text Analytics) integrated with RPA/BPM software suites, is mature enough to solve typical insight extraction problems, conveniently and cost-effectively.

Continue reading


We The Humans: Artificial Intelligence for social good

MeaningCloud partners with the think tank “We the Humans“, sponsoring the challenge “Artificial Intelligence for social good”.

The mission of “We the Humans” consists in:

  • Encouraging the social debate about the correct use and development of Artificial Intelligence.
  • Bringing these concerns to the public agenda.
  • Supporting organizations in the development and adoption of an ethical AI.


We The Humans Think Tank
Continue reading