The Text Classification analysis integrates the functionality provided by the Text Classification API, that is, it allows to assign one or more categories to any text according to the model selected. The model used to classify the input text may be either one of the models included in the API or one of the models defined by the user.
For every language there's a default model that follows the International Press Telecommunications Council standard (IPTC) for news, which consists of around 1400 categories.
On the right, you can see the interface that appears when you click the Text Classification button.
As you can see, there are three sections: Select cells with texts to analyze, which we have already covered in the corresponding section, Analysis settings and Advanced settings.
In Analysis settings there are two configurable values to set:
The Advanced settings menu contains additional options for the Text Classification analysis. There is only one section, Output Configuration, to configure the output of the add-on: you can define both the number of categories you want to see in the results and which fields you want to output for each category.
There's more information about each one of these fields in the response section of the API documentation.
The results obtained from the classification will be shown in a new spreadsheet called "Text Classification". This sheet will include a column with the source text, a column with the IDs if enabled, and a column for each one of the output fields configured in the advanced settings.
When the analysis is configured to output more than one category, each additional category associated to a text will be inserted as a new row, enabling a more flexible use of the results.
This is an example of a possible output of texts in English classified using the IPTC model and without using IDs. The configuration is set to show the default output fields and up to 3 categories: