Workshop Program

PAN is co-located with the CLEF conference and will be held from September 17 to 20, 2012.

Schedule

.
.
September 17
14:00-16:00Lab overviews (Room Loyola)
15 min. talkPAN'12 - Uncovering Plagiarism, Authorship, and Social Software Misuse
Martin Potthast
16:00-16:30Coffee Break
16:30-17:00Poster Boaster Session
17:00-18:30Poster Session (Room Galleria A)
Encoplot - Tuned for High Recall (also proposing a new plagiarism detection score) - Notebook for PAN at CLEF 2012
Cristian Grozea, Marius Popescu
A Set-Based Approach to Plagiarism Detection - Notebook for PAN at CLEF 2012
Robin Küppers, Stefan Conrad
Applying Specific Clusterization and Fingerprint Density Distribution with Genetic Algorithm Overall Tuning in External Plagiarism Detection - Notebook for PAN at CLEF 2012
Yurii Palkovskii, Alexei Belov
Detailed Comparison Module In CoReMo 1.9 Plagiarism Detector - Notebook for PAN at CLEF 2012
Diego A. Rodríguez Torrejón, José Manuel Martín Ramos
Optimized Fuzzy Text Alignment for Plagiarism Detection - Notebook for PAN at CLEF 2012
Fernando Sánchez-Vega, Manuel Montes-y-Gómez, Luis Villaseñor-Pineda
Bootstrapped Authorship Attribution in Compression Space - Notebook for PAN at CLEF 2012
Ramon de Graaff, Cor J. Veenman
Paragraph Clustering for Intrinsic Plagiarism Detection using a Stylistic Vector-Space Model with Extrinsic Features - Notebook for PAN at CLEF 2012
Julian Brooke, Graeme Hirst
Sub-Profiling by Linguistic Dimensions to Solve the Authorship Attribution Task - Notebook for PAN at CLEF 2012
Upendra Sapkota, Thamar Solorio
Information Retrieval and Classification based Approaches for the Sexual Predator Identification - Notebook for PAN at CLEF 2012
Darnes Vilariño, Esteban Castillo, David Pinto, Iván Olmos, Saul León
September 19 (Room Trilussa)
Quality Flaw Prediction in Wikipedia, Chair: Matthias Hagen
10:30-10:45Overview of the 1st International Competition on Quality Flaw Prediction inWikipedia
Maik Anderka, Benno Stein
10:45-11:15On the Use of PU Learning for Quality Flaw Prediction in Wikipedia - Notebook for PAN at CLEF 2012
Edgardo Ferretti, Donato Hernández Fusilier, Rafael Guzmán Cabrera, Manuel Montes-y-Gómez, Marcelo Errecalde, Paolo Rosso
11:15-11:30FlawFinder: A Modular System for Predicting Quality Flaws in Wikipedia - Notebook for PAN at CLEF 2012
Oliver Ferschke, Iryna Gurevych, Marc Rittberger
Plagiarism Detection, Chair: Matthias Hagen
11:30-12:00Overview of the 4th International Competition on Plagiarism Detection
Martin Potthast, Tim Gollub, Matthias Hagen, Jan Graßegger, Johannes Kiesel, Maximilian Michel, Arnd Oberländer, Martin Tippmann, Alberto Barrón-Cedeño, Parth Gupta, Paolo Rosso, Benno Stein
12:00-12:15Approaches for Candidate Document Retrieval and Detailed Comparison of Plagiarism Detection - Notebook for PAN at CLEF 2012
Kong Leilei, Qi Haoliang, Wang Shuai, Du Cuixia, Wang Suhong, Han Yong
12:15-12:30Educated guesses and equality judgements: using search engines and pairwise match for external plagiarism detection - Notebook for PAN at CLEF 2012
Lee Gillam, Neil Newbold, Neil Cooke
12:30-14:00Lunch
Plagiarism Detection, Chair: Paolo Rosso
14:00-14:15Three way search engine queries with multi-feature document comparison for plagiarism detection - Notebook for PAN at CLEF 2012
Simon Suchomel, Jan Kasprzak, and Michal Brandejs
Cross-Language Plagiarism Detection (Keynotes and Panel Discussion), Chair: Paolo Rosso
14:15-15:00Cross-lingual Similarity Calculation for Plagiarism Detection and More - Tools and Resources
Ralf Steinberger
15:00-15:45Babelplagiarism: What can BabelNet do for Cross-language Plagiarism Detection?
Roberto Navigli
15:45-16:00Panel discussion
September 20 (Room Leopardi)
Traditional Authorship Attribution, Chair: Efstathios Stamatatos
9:30- 10:00An Overview of the Traditional Authorship Attribution Subtask + Mixture of Experts Authorship Attribution
Patrick Juola and Michael Ryan, John Noecker Jr
10:00-10:15Authorship attribution: using rich linguistic features when training data is scarcen - Notebook for PAN at CLEF 2012
Ludovic Tanguy, Franck Sajous, Basilio Calderone, Nabil Hathout
10:15-10:30Feature Bagging for Author Attribution - Notebook for PAN at CLEF 2012
François-Marie Giraud, Thierry Artières
10:30-11:00Break
Traditional Authorship Attribution & Sexual Predator Identification, Chair: Patrick Juola
11:00-11:15Graph-based and Lexical-Syntactic Approaches for the Authorship Attribution Task - Notebook for PAN at CLEF 2011
Esteban Castillo, Darnes Vilariño, David Pinto, Iván Olmos, Jesús A. González, Maya Carrillos
11:15-11:45Overview of the International Sexual Predator Identification Competition at PAN-2012
Giacomo Inches, Fabio Crestani
11:45-12:00Vote/Veto Classification, Ensemble Clustering and Sequence Classification for Author Identification - Notebook for PAN at CLEF 2012
Roman Kern, Stefan Klampfl and Mario Zechner
12:00-12:15Quite Simple Approaches for Authorship Attribution, Intrinsic Plagiarism Detection and Sexual Predator Identification - Notebook for PAN at CLEF 2012
Anna Vartapetiance, Lee Gillam
12:15-12:30Kernel Methods and String Kernels for Authorship Analysis - Notebook for PAN at CLEF 2012
Marius Popescu, Cristian Grozea
12:30-12:45Conversation Level Constraints on Pedophile Detection in Chat Rooms - Notebook for PAN at CLEF 2012
Claudia Peersman, Frederik Vaassen, Vincent Van Asch, Walter Daelemans
12:45-13:00Identifying Predators Using ChatCoder 2.0 - Notebook for PAN at CLEF 2012
April Kontostathis, Will West, Andy Garron, Kelly Reynolds, Lynne Edwards
13:00-14:00Lunch
Sexual Predator Identification, Chair: Giacomo Inches
14:00-14:15A Two-step Approach for Effective Detection of Misbehaving Users in Chats - Notebook for PAN at CLEF 2012
Esaú Villatoro-Tello, Antonio Juárez-González, Hugo Jair Escalante, Manuel Montes-y-Gómez, and Luis Villaseñor-Pineda
14:15-14:30A Learning-Based Approach for the Identification of Sexual Predators in Chat Logs - Notebook for PAN at CLEF 2012
Javier Parapar, David E. Losada, Alvaro Barreiro
14:30-14:45Features for modelling characteristics of conversations - Notebook for PAN at CLEF 2012
Gunnar Eriksson, Jussi Karlgren
14:45-15:00Identifying Sexual Predators by SVM Classification with Lexical and Behavioral Features - Notebook for PAN at CLEF 2012
Colin Morris, Graeme Hirst

The workshop program is an excerpt that pertains to PAN from the CLEF conference program.

View conference program »

closedRegistration
Mar 16, 2012Training data release
May 18, 2012Test data release
Jun 15, 2012Run submission
Jun 22, 2012Abstract submission
Jul 06, 2012Result notification
Jul 06, 2012Review notification
Aug 17, 2012Notebook submission
Sep 17-20, 2012Conference

Registration

To register, fill out and submit this form.

Notebooks

We ask you to submit a notebook paper, and a short abstract for printing. Please prepare the two texts as outlined below. For your convenience, we provide templates for both.

Abstract

Please prepare an abstract to be included in the printed proceedings of the CLEF conference. The abstract should at most 400 words, and be structured as follows:

  • Title, authors, affiliations, e-mail addresses
  • Body of text, which should contain information on:
    • tasks performed
    • main objectives of experiments
    • approach(es) used
    • resources employed
    • results obtained

To prepare your abstract, please use this Microsoft Word template. Please do not use any other document format.

Download templates

Notebook

All participants who submit runs for a task are entitled to publish a notebook describing their approach in the PAN proceedings.

However, before preparing your notebook, we ask you to submit an extended abstract describing your approach first. In your abstract, please describe in detail your approach to solve the task you worked on, describing all the technologies, algorithms, features, and parameters as well as how you adjusted them and how they are combined into a process. The extended abstract must be written in English and it may not exceed 2 pages using the abstract template found below. You may submit one extended abstract for each sub-task you took part in. An introduction or a conclusion may be omitted for brevity. Please submit your notebook by Jun 22, 2012 to our conference managment system.

Your extended abstract will be peer-reviewed for quality. Based upon the review and based upon your performance in the competition, we will decide how long your notebook paper shall be, and what kind of presentation at the conference is expected of you. The length of a notebook may be one of the following (including references):

  • full paper, max. 12 pages
  • short paper, max. 8 pages
  • poster paper, max. 4 pages

The notebook must be written in English and formatted using the notebook template found below.

We encourage you to also submit the source code of your prototype as a ZIP archive alongside your notebook. This will allow for better reproducibility of your results and it will allow those following your approach to better understand how you did it. The source code will be published online alongside your paper. It may be published under any license of your choosing.

We expect that at least one author of each notebook will attend the PAN workshop at the CLEF conference and present their approach.

We have prepared templates for extended abstracts and notebooks:

Download templates

The notebooks are formatted in the LNCS proceedings style. Please do not change the formatting.

Please submit your notebook by Aug 17, 2012.

Submission System

Submission of abstracts and notebooks is to be done electronically through the Easychair system.

Organizing Committee

Martin Potthast

Martin Potthast

Bauhaus-Universität Weimar

Benno Stein

Benno Stein

Bauhaus-Universität Weimar

Tim Gollub

Tim Gollub

Bauhaus-Universität Weimar

Matthias Hagen

Matthias Hagen

Bauhaus-Universität Weimar

Maik Anderka

Maik Anderka

Bauhaus-Universität Weimar

Alberto Barrón-Cedeño

Alberto Barrón-Cedeño

Universitat Politècnica de València

Paolo Rosso

Paolo Rosso

Universitat Politècnica de València

Parth Gupta

Parth Gupta

Universitat Politècnica de València

Efstathios Stamatatos

Efstathios Stamatatos

University of the Aegean

Patrick Juola

Patrick Juola

Duquesne University

Shlomo Argamon

Shlomo Argamon

Illinois Institute of Technology

Moshe Koppel

Moshe Koppel

Bar-Ilan University

Giacomo Inches

Giacomo Inches

University of Lugano

Fabio Crestani

Fabio Crestani

University of Lugano

Program Committee

Eneko Agirre University of the Basque Country
Rafael E. Banchs Institute for Infocomm Research Singapore
Anabela Barreiro Universidade do Porto
Alberto Barrón-Cedeño Universidad Politécnica de Valencia
Steven Burrows Bauhaus-Universität Weimar
Fazli Can Bilkent University
Carole Chaski Institute for Linguistic Evidence
Tommy Chow City University of Hong Kong
Jordi Cicres Bosch Universitat de Girona / Universitat Pompeu Fabra
Paul Clough University of Sheffield
Malcolm Coulthard Aston University
Atsushi Fujita Future University Hakodate
Christian Guetl University of Technology Graz
Graeme Hirst University of Toronto
Alison Johnson University of Leeds
Moshe Koppel Bar-Ilan University
Kim Luyckx University of Antwerp
Antonia Marti Universitat de Barcelona
Sven Meyer zu Eissen Bayer Business Services
Manuel Montes-y-Gómez INAOE-Puebla
George Mikros National and Capodestrian University of Athens
David Pinto Benemerita Universidad Autonoma Puebla
Martin Potthast Bauhaus-Universität Weimar
Mike Reddy Newport Business School
Horacio Rodriguez Universidad Politécnica de Cataluña
Paolo Rosso Universidad Politécnica de Valencia
Thamar Solorio University of Alabama at Birmingham
Efstathios Stamatatos University of the Aegean
Benno Stein Bauhaus-Universität Weimar
Dan Tufis Romanian Academy
M. Teresa Turell Juliá Universitat Pompeu Fabra
Ozlem Uzuner State University of New York
Vasudeva Varma IIIT Hyderabad
Marta Vila Universitat de Barcelona

© pan.webis.de