Researchers have used large quantities of online data to study dynamics in novel ways. Consider the specific case of online networked individuals sharing geo-located, multimodal, pieces of information in social media platforms, e.g., users of Twitter, Instagram, Facebook. Can their social dynamics be used to unveil the hidden dimensions that regulate the social life of our cities? To answer this question, our research has focused on understanding how people psychologically experience cities and, as a result, we have created new mapping tools that capture the aesthetic, olfactory and sonic layers of our cities, modeling happiness and use of figurative language, e.g., irony. The work presented in this talk mixes data mining, urban informatics, and computational social science to study how these dimensions relate to demographic, e.g., age or gender, and socio-economic factors, e.g., education, crime, race, or wealth, that characterize the profile of the modern urban fabric.
PAN at CLEF 2017
Shared Tasks
Important Dates
- March 15, 2017: Early bird software submission
- April 15, 2017: TIRA evaluation phase opens
- May 15, 2017: TIRA evaluation phase deadline
- May 25, 2017 (extended): Paper submission: [template] [guidelines] [submission]
- June 16, 2017: Peer review notification
- July 3, 2017: Camera-ready participant papers submission
- July 15, 2017: Early bird conference registration
- September 11-14, 2017: Conference
The timezone of all deadlines is Anywhere on Earth.
Keynotes
Authorship profiling and text-based deception detection have been the focus of attention in recent years, partly because of a rapid growth of Internet communication and necessity to detect fraud in online reviews and dating profiles, reveal suicidal tendencies in authors of texts on social media, as well as to assess who likes/dislikes these or those products and services (male, female, extroverts…) using linguistic analysis of their reviews, etc. Industrial companies are in need of techniques for quick and valid assessment of candidates’ intelligence and personality, and analyzing texts could provide such opportunities. Author profiling and deception detection domain is rapidly developing but not for Slavic languages. They have long been beyond the scope of relevant studies, which is largely due to the fact that there were no corresponding text corpora available and no efficient methods of natural language processing in place. In this keynote we present the results of the research aimed at deception detection and personality and gender recognition in Russian written texts. In this research RusPersonality was used, which is by far the largest corpus of written texts in Slavic languages with rich metadata (information on the authors of the texts – gender, age, occupation, scores on different personality traits, results of neuropsychological assessment, etc. and information on the texts – genre, topic, deceptive/truthful, etc.). The second source of material for research conducted in RusProfiling Lab is social media. In most of the experiments topic-independent features were used. Special attention in the talk will be paid to the estimation of a likelihood of self-destructive (including suicidal as the most severe form) behavior using linguistic analyses of writing. The keynote concludes by encouraging a discussion on the necessity to seek for topic-independent and language-independent features in authorship profiling and for explanation of the estimated correlations between linguistics parameters of written texts and characteristics of their authors.
Program
PAN's program is part of the CLEF conference program.
September 11 | ||
|
Labs Overviews | |
15 min. talk | Overview of PAN'17: Author Identification, Author Profiling, and Author Obfuscation Martin Potthast, Francisco Rangel, Michael Tschuggnall, Efstathios Stamatatos, Paolo Rosso, and Benno Stein |
|
19:00-22:00 | Welcome Reception | |
September 12 | ||
Keynotes, Chair: Paolo Rosso | ||
14:00-15:00 | Profiling the sensorial, emotional and ironic life of a city Rossano Schifanella |
|
15:00-16:00 | RusProfiling: Analyzing Russian written texts to detect deception and identify the author's
personality and gender Tatiana Litvinova |
|
16:00-16:30 | Break | |
Author Profiling, Chair: Francisco Rangel | ||
16:30-16:50 | Overview of the 5th Author Profiling Task at PAN 2017: Gender and Language Variety
Identification in Twitter Francisco Rangel, Paolo Rosso, Martin Potthast, Benno Stein |
|
16:50-17:10 | N-GrAM: New Groningen Author-profiling Model Angelo Basile, Gareth Dwyer, Maria Medvedeva, Josine Rawee, Hessel Haagsma, and Malvina Nissim |
|
17:10-17:30 | Author Profiling with Word+Character Neural Attention Network Yasuhide Miura, Tomoki Taniguchi, Motoki Taniguchi, and Tomoko Ohkuma |
|
17:30-17:50 | PAN 2017: Author Profiling - Gender and Language Variety Prediction Matej Martinc, Iza Škrjanec, Katja Zupan, and Senja Pollak |
|
17:50-18:10 | Convolutional Neural Networks for Author Profiling Sebastián Sierra, Manuel Montes-y-Gómez, Thamar Solorio, and Fabio A. González |
|
18:10-18:30 | Discussion | |
September 13 | ||
Author Identification, Chair: Michael Tschuggnall | ||
13:45-14:05 | Author Clustering using Hierarchical Clustering Analysis Helena Gómez-Adorno, Yuridiana Aleman, Darnes Vilariño, Miguel A. Sanchez-Perez, David Pinto, Grigori Sidorov |
|
14:05-14:25 | Overview of the Author Identification Task at PAN-2017: Style Breach Detection and Author
Clustering Michael Tschuggnall, Efstathios Stamatatos, Ben Verhoeven, Walter Daelemans, Günther Specht, Benno Stein, Martin Potthast |
|
14:25-14:45 | Discovering Author Groups Using a beta-compact Graph-based Clustering Yasmany García, Daniel Castro, Vania Lavielle, and Rafael Muñoz |
|
14:45-15:05 | UniNE at CLEF 2017: Author Clustering Mirco Kocher and Jacques Savoy |
|
15:05-16:15 | Poster Session & Break | |
Style Breach Detection & Author Obfuscation, Chair: Efstathios Stamatatos | ||
16:15-16:25 | Overview of the Author Identification Task at PAN-2017: Style Breach Detection and Author
Clustering Michael Tschuggnall, Efstathios Stamatatos, Ben Verhoeven, Walter Daelemans, Günther Specht, Benno Stein, Martin Potthast | |
16:25-16:45 | OPI-JSA at CLEF 2017: Author Clustering and Style Breach Detection Daniel Karaś, Martyna Śpiewak, and Piotr Sobecki |
|
16:45-17:05 | Overview of the Author Obfuscation Task at PAN 2017: Safety Evaluation Revisited Matthias Hagen, Martin Potthast, Benno Stein |
|
17:05-17:25 | Author Masking using Sequence-to-Sequence Models Oleg Bakhteev, Andrey Khazov |
|
17:25 | Discussion & Closing | |
19:00-22:00 | Conference Dinner |