The Deep Categorization analysis integrates the functionality provided by the Deep Categorization API, that is, assigning one or more categories to a text, using a very detailed rule-based language that allows you to identify very specific scenarios and patterns using a combination of morphological, semantic and text rules.
This is the interface that will appear when you click the Deep Categorization button:
As you can see there are two sections in the interface: Input, which we have already covered in the corresponding section, and Analysis Settings.
In Analysis Settings there are two values to select:
We've seen in the Settings section that there's an advanced settings menu with additional configuration options for the Text Classification analysis. These are the options available and their default value:
There are two main aspects to configure: the number of categories you want to see in the results, and which fields you want to output for each category.
There's more information about each one of these fields in the response section of the API documentation.
The results obtained from the categorization will be shown in a new Excel sheet called "Deep Categorization". This sheet will include a column with the source text, a column with the IDs if enabled, and then a column for each of the output fields configured in the advanced settings.
When the analysis is configured to output more than one category, each additional category associated to a text will be inserted as a new row, allowing a more flexible use of the results.
This is an example of a possible output of texts in English classified using the VoC model for the insurance domain and without using IDs. The configuration is set to show the output fields configured by default and up to 10 categories: