This is the second of two tutorials where we will be using MeaningCloud Extension for RapidMiner to extract insights that combine structured data with unstructured text. Read the first one here. To follow these tutorials you will need to have RapidMiner Studio and our Extension for RapidMiner installed on your machine (learn how here).
In this tutorial we shall attempt to extract a rule set that will predict the positivity/negativity of a review based on MeaningCloud’s topics extraction feature as well as sentiment analysis.
To be more specific, we will try to give an answer to the following question:
- Which topics have the most impact in a customer review and how do they affect the sentiment of the review that the user has provided?
For this purpose, we will use a dataset of food reviews that comes from Amazon. The dataset can be found here.