Shared Tasks

PAN fosters digital text forensics research by organizing shared task evaluations. Shared tasks are computer science events that invite researchers and practitioners to work on a specific problem of interest, the task. You are welcome to take part in any of the tasks shown below.

Register now Next: Data 0 already signed up

Author Identification

Authorship Attribution

Given a document and a set of candidate authors, determine which of them wrote the document.

2011 2012

Authorship Verification

Given a pair of documents, determine whether they are written by the same author.

2013 2014 2015

Author Clustering

Given a set of documents, group them by authorship.

2016

Author Diarization

Given a document, identify and group parts that have been written by the same author.

2016

Author Profiling

Gender Prediction

Given a document, determine its author's gender.

2013 2014 2015 2016

Age Prediction

Given a document, determine its author's age.

2013 2014 2015 2016

Personality Prediction

Given a document, determine its author's personality type.

2015

Author Obfuscation

Author Masking

Given a document and a set of documents from the same author, paraphrase the former so that its author cannot be identified, anymore.

2016

Author Imitation

Given a document and a set of documents from another author, paraphrase the former so that it appears to have been written by the latters' author.

2016

Text Reuse Detection (aka Plagiarism Detection)

Source Retrieval

Given a document and a search engine, retrieve likely candidates from which text might have been reused in the document, while minimizing retrieval costs.

2012 2013 2014 2015

Text Alignment

Given a pair of documents, exracts all pairs of reused passages of maximal length.

2012 2013 2014 2015

Intrinsic Plagiarism Detection

Given a document, determine if parts of it have been written by different authors.

2009 2010 2011

Cross-language Text Reuse Detection

Given a document in one language and a set of documents from another, determine whether parts of the former have been reused by translation from the latter.

External Plagiarism Detection

Given a document and a set of documents, determine whether parts of the former have been reused from the latter.

2009 2010 2011

Source Code Reuse Detection

Given a piece of code and code repository, determine whether parts of the former have been reused from the latter.

Credibility Analysis

Wikipedia Vandalism Detection

Given a an edit on a Wikipedia article, determine whether it is vandalism.

2010 2011

Wikipedia Quality Flaw Prediction

Given a Wikipedia article, predict whether it comprises quality flaws, and if so, which.

2012

Deception Detection

Sexual Predator Identification

Given a chat log between two people, determine whether one pretends to be a child.

2012

Propose a Task

Miss an important task for the digital text forensics? Don't hesitate to contact us and propose the task you have in mind. Organizing a task usually requires a sufficiently large dataset comprising instances of the task's underlying problem, and performance measures.

Send proposal