PAN at CLEF 2013

Shared Tasks
Important Dates
Keynotes
Program
Organizing Committee

Shared Tasks

Important Dates

March 1, 2013: Early bird software submission
April 15, 2013: Final software submission for text alignment and author identification
April 30, 2013: Final software submission for source retrieval and author profiling
June 1, 2013: Paper submission: [template] [guidelines] [submission]
June 30, 2013: Early bird conference registration
September 23-26, 2013: Conference

The timezone of all deadlines is Anywhere on Earth.

Keynotes

Identifying Personality and Psychological States in Words

Cindy K. Chung

The University of Texas at Austin

How do the words we use reflect our personalities, psychological states, and social relationships? Psychologists have applied simple word counting and word patterning methods such as Linguistic Inquiry and Word Count (LIWC) and the Meaning Extraction Method (MEM) across a variety of contexts to understand individual differences in communication. Word use and interaction patterns in everyday conversations and social media exchanges can reveal personality, engagement, and deception. Quantitative text analysis from a psychological perspective can help to uncover how people connect, communicate, and interact in digital spaces.

Linguistic Evidence in Court: Current Issues, Current Trends

Lawrence Solan

Brooklyn Law School

Many forensic sciences have been under attack for their failure to be grounded in adequate research. Even fingerprint evidence is questioned, not for its being generally unreliable, but because there is inadequate research into the extent that accuracy decreases as the quantity and quality of information are each systematically degraded. At the same time, psychological research has revealed biases to which experts are subject, especially confirmation bias ? the propensity to value evidence that confirms one?s hypothesis more than evidence that disconfirms it. Somewhat at odds with these observations are interesting debates about the relative accuracy of expert judgment by individuals based on their experience on the one hand, and algorithmic expertise on the other. This presentation will discuss how legal systems react to offers of linguistic expert evidence in light of these developments.

Program

PAN's program is part of the CLEF conference program.

September 23
14:00-17:30	Lab overviews (incl. Coffee break)
15 min. talk	Recent Trends in Digital Text Forensics and its Evaluation Tim Gollub, Martin Potthast, Anna Beyer, Matthias Busse, Francisco Rangel, Paolo Rosso, Efstathios Stamatatos, and Benno Stein
17:30-18:00	Poster Booster Session
18:00-19:30	Poster Session + Welcome Reception
	Approaches for Source Retrieval and Text Alignment of Plagiarism Detection Leilei Kong, Haoliang Qi, Cuixia Du, Mingxing Wang and Zhongyuan Han
	Text Alignment Module in CoReMo 2.1 Plagiarism Detector Diego Antonio Rodríguez-Torrejón and José Manuel Martín Ramos
	Guess again and see if they line up: Surrey's runs at plagiarism detection Lee Gillam
	Unsupervised Ranking for Plagiarism Source Retrieval Kyle Williams, Hung-Hsuan Chen, Sagnik Ray Choudhury and C. Lee Giles
	Plagiarism Candidate Retrieval Using Selective Query Formulation and Discriminative Query Scoring Osama Haggag and Samhaa El-Beltagy
	Diverse Queries and Feature Type Selection for Plagiarism Discovery Šimon Suchomel, Jan Kasprzak and Michal Brandejs
	Using a Variety of n-Grams for the Detection of Different Kinds of Plagiarism Prasha Shrestha and Thamar Solorio
	CopyCaptor: Plagiarized Source Retrieval System using Global word frequency and Local feedback Taemin Lee, Jeongmin Chae, Kinam Park and Soonyoung Jung
	Source Retrieval via Naïve Approach and Passage Selection Heuristics Ondřej Veselý, Tomas Foltynek and Jiří Rybička
	Using Hybrid Similarity Methods for Plagiarism Detection Notebook for PAN at CLEF 2013 Yurii Palkovskii and Alexei Belov
	Grammar Checker Features for Author Identification and Author Profiling Roman Kern
	Distance Learning for Author Verification Paola Ledesma, Gibran Fuentes, Gabriela Jasso, Angel Toledo, and Ivan Meza
	Style-based Distance Features for Author Verification Erwan Moreau and Carl Vogel
	Authorship Identification Using Correlations of Frequent Features Timo Petmanson
	A Textual Modus Operandi: Surrey's Simple System for Author Identification Anna Vartapetiance and Lee Gillam
	Lexical-Syntactic and Graph-Based Features for Authorship Verification Darnes Vilariño, David Pinto, Helena Gómez, Saúl León, and Esteban Castillo
	Intrinsic Author Identification Using Modified Weighted KNN M.R. Ghaeini
	Vector Space Model and Overlap Metric for Author Identification Arun Jayapal and Binayak Goswami
	INAOE's participation at PAN'13: Author Profiling task A. Pastor López-Monroy, Manuel Montes-Y-Gómez, Hugo Jair Escalante, Luis Villaseñor-Pineda and Esaú Villatoro-Tello
	Ensemble-based classification for author profiling using various features Michal Meina, Karolina Brodzińska, Bartosz Celmer, Maja Czoków, Martyna Patera, Jakub Pezacki and Mateusz Wilk
	Author Profiling Using Corpus Statistics, Lexicons and Stylistic Features Maria De-Arteaga, Sergio Jimenez, George Dueñas, Sergio Mancera and Julia Baquero
	Can We Hide in the Web? Large Scale Simultaneous Age and Gender Author Profiling in Social Media Lucie Flekova and Iryna Gurevych
	Readability for author profiling? Lee Gillam
	Automatic Author Profiling Based on Linguistic and Stylistic Features Braja Gopal Patra, Somnath Banerjee, Dipankar Das, Tanik Saikh and Sivaji Bandyopadhyay
	Semantic-based Features for Author Profiling Identification: First insights Delia Irazu Hernandez Farias, Rafael Guzmán-Cabrera, Antonio Reyes and Martha Alicia Rocha
	CNG text classification for authorship profiling task Magdalena Jankowska, Vlado Keselj and Evangelos Milios
	Style-based distance features for author profiling Erwan Moreau and Carl Vogel
	Author Profiling for English and Spanish Text Upendra Sapkota, Thamar Solorio, Manuel Montes-Y-Gómez and Gabriela Ramírez-De-La-Rosa
	Using Simple Content Features for the Author Profiling Task Edson Weren, Viviane P. Moreira and Jose Oliveira
	Two Methodologies Applied to the Author Profiling Task Yuridiana Aleman, Nahun Loya, Darnes Vilariño, and David Pinto
	Content-centric age and gender profiling Wee Yong Lim, Jonathan Goh and Vrizlynn L. L. Thing
September 24
	Keynote & Author Profiling, Chair: Paolo Rosso
11:00-12:00	Identifying Personality and Psychological States in Words Cindy K. Chung
12:00-12:30	Overview of the Author Profiling Task at PAN 2013 Francisco Rangel, Paolo Rosso, Moshe Koppel, Efstathios Stamatatos and Giacomo Inches
12:30-13:00	INAOE's participation at PAN'13: Author Profiling task A. Pastor López Monroy, Manuel Montes y Gómez, Hugo Jair Escalante, Luis Villaseñor Pineda and Esaú Villatoro Tello
12:50-13:00	Award for the best performing author profiling approach at PAN'13. The award is sponsored by the ForensicLab of the Universitat Pompeu Fabra, Barcelona, in memoriam of M. Teresa Turell.
13:00-14:30	Lunch
	Author Profiling & Author Identification, Chair: Patrick Juola
14:30-14:50	Ensemble-based classification for author profiling using various features Michal Meina, Karolina Brodzińska, Bartosz Celmer, Maja Czoków, Martyna Patera, Jakub Pezacki and Mateusz Wilk
14:50-15:10	Readability for author profiling? Lee Gillam
15:10-15:40	Overview of the Author Identification Task at PAN 2013 Patrick Juola and Efstathios Stamatatos
15:40-16:10	Authorship Verification via k-Nearest Neighbor Estimation Oren Halvani, Martin Steinebach and Ralf Zimmermann
16:10-16:30	Discussion
September 25
	Keynote & Author Identification, Chair: Francisco Rangel
11:30-12:30	Linguistic Evidence in Court: Current Issues, Current Trends Lawrence Solan
12:30-12:50	Authorship Verification with Compression Features Cor J. Veenman and Zhenshi Li
12:50-13:10	Proximity based One-class Classification with Common N-Gram Dissimilarity for Authorship Verification Task Magdalena Jankowska, Vlado Keselj and Evangelos Milios
13:10-13:30	Discussion
13:30-14:30	Lunch
	Plagiarism Detection, Chair: Efstathios Stamatatos
14:30-15:00	Overview of the 5th International Competition on Plagiarism Detection Martin Potthast, Matthias Hagen, Tim Gollub, Martin Tippmann, Johannes Kiesel, Paolo Rosso, Efstathios Stamatatos, and Benno Stein
15:00-15:20	Text Alignment Module in CoReMo 2.1 Plagiarism Detector Diego Antonio Rodríguez-Torrejón and José Manuel Martín Ramos
15:20-15:50	Approaches for Source Retrieval and Text Alignment of Plagiarism Detection Leilei Kong, Haoliang Qi, Cuixia Du, Mingxing Wang and Zhongyuan Han
16:50-16:10	Plagiarism Candidate Retrieval Using Selective Query Formulation and Discriminative Query Scoring Osama Haggag and Samhaa El-Beltagy
16:10-16:20	Unsupervised Ranking for Plagiarism Source Retrieval Kyle Williams, Hung-Hsuan Chen, Sagnik Ray Choudhury and C. Lee Giles
16:20-16:30	Discussion
20:30	Social Dinner: Hotel Balneario Las Arenas

Organizing Committee

Martin Potthast

University of Kassel, hessian.AI, and ScaDS.AI

Paolo Rosso

Efstathios Stamatatos

University of the Aegean

Benno Stein

Bauhaus-Universität Weimar