Shared Tasks

Important Dates

  • March 1, 2016: Early bird software submission
  • April 29, 2016: TIRA evaluation phase deadline
  • May 25, 2016 (extended): Paper submission: [template] [guidelines] [submission]
  • June 17, 2016: Peer review notification
  • July 1, 2016: Camera-ready participant papers submission
  • July 24, 2016: Early bird conference registration
  • September 5-8, 2016: Conference

The timezone of all deadlines is Anywhere on Earth.

Keynotes

Rui Sousa-Silva
Detecting translingual plagiarism: A forensic linguistic contribution to computational processing
Universidade do Porto

Plagiarism detection has evolved significantly in recent years, partly in response to the media attention attracted by high-profile plagiarism cases involving journalists and politicians. A culture of control has been establishing itself to guarantee integrity and honesty in all areas related to copyright and authorship, including implementation of policies, codes of conduct, tariffs of penalties, and matching detection software. The latter has dramatically improved alongside the technological developments over the years. Currently instances of linguistic plagiarism can easily be matched to the original, while pointing out differences between the plagiarised and the plagiarising texts. These methods work particularly well with same language texts; however, systematically detecting translingual plagiarism - i.e. where a derivative text copies from a source in another language without attribution - remains a problem area. This is especially so because the possibilities of combining language pairs are immense, thus requiring an enormous data processing power. This session presents illustrative cases of translingual plagiarism and discusses some of the approaches adopted by forensic linguists to investigate and prove that a certain translated text is an instance of plagiarism. The keynote concludes by encouraging a discussion of computational approaches that can be adopted to assist forensic linguists in their own investigation.

Rui Sousa-Silva is assistant professor of the Faculty of Arts and post-doctoral researcher at the Linguistics Centre (CLUP) of the University of Porto, where he is currently conducting his research into Forensic Linguistics and Cybercrime. He has a PhD in Applied Linguistics from Aston University (Birmingham, UK), where he submitted his thesis on Forensic Linguistics: ‘Detecting Plagiarism in the Forensic Linguistics Turn’. He studied cross-cultural attitudes to plagiarism, and proposed an original approach to translingual plagiarism detection. He also authored and co-authored several papers on (computational) authorship analysis, and is co-editor with Professor Malcolm Coulthard of the recently founded international bilingual biennial journal Language and Law / Linguagem e Direito.

Read more… Read less…

Program

PAN's program is part of the CLEF conference program.

September 5
13:30-16:10 Lab Overviews 1
30 min. talk Overview of PAN'16 New Challenges for Authorship Analysis: Cross-genre Profiling, Clustering, Diarization, and Obfuscation
Paolo Rosso, Francisco Rangel, Martin Potthast, Efstathios Stamatatos, Michael Tschuggnall and Benno Stein
18:30-19:30 Welcome Reception
September 6
Keynote + Author Obfuscation, Chair: Paolo Rosso
13:30-14:30 Detecting Translingual Plagiarism: A Forensic Linguistic Contribution to Computational Processing
Rui Sousa-Silva
14:30-15:00 Author Obfuscation: Attacking the State of the Art in Authorship Verification
Martin Potthast, Matthias Hagen, Benno Stein
15:00-15:20 SU@PAN'2016: Author Obfuscation
Tsvetomila Mihaylova, Georgi Karadjov, Yasen Kiprov, Georgi Georgiev, Ivan Koychev, and Preslav Nakov
15:20-15:30 Discussion
15:30-16:15 PAN Poster Session
Authorship clustering using multi-headed recurrent neural networks
Douglas Bagnall
UniNE at CLEF 2016: Author Clustering
Mirco Kocher
Methods for intrinsic plagiarism detection and author diarization
Mikhail Kuznetsov, Anastasia Motrenko, Rita Kuznetsova, and Vadim Strijov
Multi feature space combination for authorship clustering
Muharram Mansoorizadeh, Mohammad Aminiyan, Taher Rahgooy, Mahdy Eskandari
ExploringWord Embeddings and Character N-Grams for Author Clustering
Yunita Sari, Mark Stevenson
Author Diarization Using Cluster-Distance Approach
Abdul Sittar, Hafiz Rizwan Iqbal, and Rao Muhammad Adeel Nawab
A Big Increase in Known Unknowns: from Author Verification to Author Clustering
Anna Vartapetiance and Lee Gillam
Experiments in Authorship-Link Ranking and Complete Author Clustering
Valentin Zmiycharov, Dimitar Alexandrov, Hristo Georgiev, Yasen Kiprov, Georgi Georgiev, Ivan Koychev, and Preslav Nakov
Author Masking through Translation
Yashwant Keswani, Harsh Trivedi, Parth Mehta, and Prasenjit Majumder
Evaluating Safety, Soundness and Sensibleness of Obfuscation Systems
Matthias Liebeck, Pashutan Modaresi, and Stefan Conrad
Author obfuscation using WordNet and language models
Muharram Mansoorizadeh, Taher Rahgooy, Mohammad Aminiyan, Mahdy Eskandari
SU@PAN'2016: Author Obfuscation
Tsvetomila Mihaylova, Georgi Karadjov, Yasen Kiprov, Georgi Georgiev, Ivan Koychev, and Preslav Nakov
Age and Gender Identification using Stacking for Classification
Madhulika Agrawal and Teresa Gonçalves
Cross-Genre Author Profile Prediction Using Stylometry-Based Approach
Shaina Ashraf, Hafiz Rizwan Iqbal, Rao Muhammad Adeel Nawab
Author Profiling Using Support Vector Machines
Rodwan Bakkar Deyab, José Duarte, and Teresa Gonçalves
Author Profiling using SVMs andWord Embedding Averages
Roy Bayot and Teresa Gonçalves
CAPS: A Cross-genre Author Profiling System
Ivan Bilan and Desislava Zhekova
Author Profiling using Complementary Second Order Attributes and Stylometric Features
Konstantinos Bougiatiotis and Anastasia Krithara
GronUP: Groningen User Profiling
Mart Busger op Vollenbroek, Talvany Carlotto, Tim Kreutz, Maria Medvedeva, Chris Pool, Johannes Bjerva, Hessel Haagsma, and Malvina Nissim
Using Machine Learning Algorithms for Author Profiling in Social Media
Daniel Dichiu, Irina Rancea
Statistical Semantics in Context Space: Amrita CEN@Author Profiling
Barathi Ganesh HB, Anand Kumar M, and Soman KP
Profile-based Approach for Age and Gender Identification
Ma. José Garciarena Ucelay, Ma. Paula Villegas, Dario G. Funez, Leticia C. Cagnina, Marcelo L. Errecalde, Gabriela Ramírez-de-la-Rosa, and Esaú Villatoro-Tello
PANcakes Team: A Composite System of Genre-Agnostic Features For Author Profiling
Pepa Gencheva, Martin Boyanov, Elena Deneva, Preslav Nakov, Yasen Kiprov1, Ivan Koychev1, and Georgi Georgiev
UniNE at CLEF 2016: Author Profiling
Mirco Kocher, Jaques Savoy
Adapting Cross-Genre Author Profiling to Language and Corpus
Ilia Markov, Helena Gómez-Adorno, Grigori Sidorov, and Alexander Gelbukh
Exploring the Effects of Cross-Genre Machine Learning for Author Profiling in PAN 2016
Pashutan Modaresi, Matthias Liebeck, and Stefan Conrad
Profiling microblog authors using concreteness and sentiment
Oliver Pimas, Andi Rexha, Mark Kröll, and Roman Kern
Cross-genre Age and Gender Identification in Social Media
Anam Zahid, Aadarsh Sampath, Anindya Dey, Golnoosh Farnadi
Derivative Approach for Plagiarism Source Retrieval
Rhulani Maluleka
19:45 Conference Dinner
September 7
Author Profiling, Chair: Francisco Rangel
13:30-14:00 Overview of the 4th Author Profiling Task at PAN 2016: Cross-Genre Evaluations
Francisco Rangel, Paolo Rosso, Ben Verhoeven, Walter Daelemans, Martin Potthast, Benno Stein
14:00-14:20 GronUP: Groningen User Profiling
Mart Busger op Vollenbroek, Talvany Carlotto, Tim Kreutz, Maria Medvedeva, Chris Pool, Johannes Bjerva, Hessel Haagsma, and Malvina Nissim
14:20-14:40 Exploring the Effects of Cross-Genre Machine Learning for Author Profiling in PAN 2016
Pashutan Modaresi, Matthias Liebeck, Stefan Conrad
14:40-15:00 CAPS: A Cross-Genre Author Profiling System
Ivan Bilan, Desislava Zhekova
15:00-15:20 Author Profiling using Complementary Second Order Attributes and Stylometric Features
Konstantinos Bougiatiotis and Anastasia Krithara
15:20-15:30 Discussion
15:30-16:15 Break
Author Identification, Chair: Michael Tschuggnall
16:15-17:00 Clustering by Authorship Within and Across Documents
Efstathios Stamatatos, Michael Tschuggnall, Ben Verhoeven, Walter Daelemans, Günther Specht, Benno Stein, and Martin Potthast
17:00-17:20 Authorship Clustering using Multi-headed Recurrent Neural Networks
Douglas Bagnall
17:20-17:40 UniNE at CLEF 2016: Author Clustering
Mirco Kocher
17:40-18:00 Methods for Intrinsic Plagiarism Detection and Author Diarization
Mikhail Kuznetsov, Anastasia Motrenko, Rita Kuznetsova, Vadim Strijov
18:00-18:15 Discussion
19:45 "Petiscos" and "Cante Alentejano"

Sponsors

Organizing Committee