MeaningCloud sponsors prize for Author Profiling Research

Author Profiling ResearchCLEF Initiative and Conference

MeaningCloud sponsors the prize to the best team at the 5th International Competition on Author Profiling Research, PAN@CLEF 2017. This competition is part of PAN (Plagiarism, Authorship and Social Software Misuse), a series of scientific events and shared tasks on digital text forensics. The 17th evaluation lab on digital text forensics will be held as part of the CLEF conference in Dublin, Ireland, on September 11-14, 2017.

Authorship Analysis

Authorship analysis deals with the classification of texts into classes based on the stylistic choices of their authors. Beyond the author identification and author verification tasks where the style of individual authors is examined, author profiling distinguishes between classes of authors studying their sociolect aspect, that is, how people use language. This helps in identifying profiling aspects such as gender, age, native language, or personality type. Author profiling is a problem of growing importance in applications in forensics, security, and marketing.

Winners of the 4th Competition on Author Profiling Research

PAN16 Author Profiling Research Award - Malvina Nissim - Paolo Rosso - Francisco Rangel

MeaningCloud has been contributing to this event sponsoring the prize to the best competing team since 2015.

PAN@CLEF 2015

PAN@CLEF 2016

PAN@CLEF 2017

The best performing team at the 2016 edition (4th International Competition on Author Profiling) was:

  • Mart Busger op Vollenbroek, Talvany Carlotto, Tim Kreutz, Maria Medvedeva, Chris Pool, Johannes Bjerva, Hessel Haagsma, Malvina Nissim. University of Groningen, Netherlands.

Authorship Profiling Research with MeaningCloud APIs

MeaningCloud provides building blocks for Authorship Analysis. Check out our complete Lemmatization, Part of Speech and Parsing API. It contains a myriad of functionalities derived from the full morphosyntactic and semantic analysis it carries out:

  • Syntactic analysis: obtains a thorough syntactic analysis, giving a complete syntactic tree where the leaves represent the most basic elements and their morphological and semantic analyses.
  • Lemmatization: obtains the lemmas of the different words in a text.
  • PoS tagging: gives not only the grammatical category of a word but also all the possible grammatical categories in which a word of each particular PoS type can be classified. In the cases it applies, the morphological analysis will refer to a semantic analysis.

References

Overview of the Author Profiling Task at PAN 2013
F. Rangel, P. Rosso, et al. In P. Forner, R. Navigli, and D. Tufis, editors, CLEF 2013 Evaluation Labs and Workshop – Working Notes Papers, 23-26 September, Valencia, Spain, September 2013.

Overview of PAN’16—New Challenges for Authorship Analysis: Cross-genre Profiling, Clustering, Diarization, and Obfuscation.
P. Rosso, F. Rangel, et al. In N. Fuhr et al, editors, Experimental IR Meets Multilinguality, Multimodality, and Interaction. 7th International Conference of the CLEF Initiative (CLEF 16), September 2016, Springer.

 


About Jose C. Gonzalez

CEO at MeaningCloud. PhD in Telecommunication Engineering. Professor at Technical University of Madrid (1985-2015). Member of the Board of Directors of LT-Innovate.

Leave a Reply

Your email address will not be published. Required fields are marked *

*
*