Deep Categorization in Google Sheets

The Deep Categorization analysis integrates the functionality provided by the Deep Categorization API, that is, assigning one or more categories to a text, using a very detailed rule-based language that allows you to identify very specific scenarios and patterns using a combination of morphological, semantic and text rules.

On the right, you can see the interface that appears when you click the Deep Categorization button.

As you can see, there are three sections: Select cells with texts to analyze, which we have already covered in the corresponding section, Analysis settings and Advanced settings.

In Analysis settings there are two configurable values to set:

  • Language, to select the language of the texts. The possible values are: Spanish, English, French, Catalan, Portuguese, and Italian.
  • Model, to define the model that will be applied to categorize the texts. The list of models also includes the user-defined ones associated to the key that's being used: they will be identified with a lock icon before the name, as follows:
    • Deep Categorization user model

Deep Categorization user interface

Important

To be able to use any of our vertical packs, you need to have access to them! You can request access in the developer home. You can read more about it here.

Advanced settings

The Advanced settings menu contains additional options for the Deep Categorization analysis. There is only one section, Output Configuration, to configure the output of the add-on: you can define both the number of categories you want to see in the results and which fields you want to output for each category.

  • Number of categories shown: set by default to 1 and with a maximum value of 10.
  • Fields:
    • Code: shows the code associated to the category.
    • Label: shows the label of the category; it's non-configurable, so it always appears in the results.
    • Rank: shows the rank or order in which a category has been associated to a text.
    • Relevance: shows the absolute relevance associated to the category.
    • Relative Relevance: shows the relative relevance associated to the category.
    • Polarity: shows the polarity of the category detected.
Deep Categorization advanced settings

There's more information about each one of these fields in the response section of the API documentation.

Output

The results obtained from the categorization will be shown in a new spreadsheet called "Deep Categorization". This sheet will include a column with the source text, a column with the IDs if enabled, and a column for each one of the output fields configured in the advanced settings.

When the analysis is configured to output more than one category, each additional category associated to a text will be inserted as a new row, enabling a more flexible use of the results.

This is an example of a possible output of texts in English categorized using the IAB model and without using IDs. The configuration is set to show the default output fields and up to 3 categories: