PAN fosters digital text forensics research by organizing shared task evaluations. Shared tasks are computer science events that invite researchers and practitioners to work on a specific problem of interest, the task. You are welcome to take part in any of the tasks shown below.
Given a document and a search engine, retrieve likely candidates from which text might have been reused in the document, while minimizing retrieval costs.
Given a pair of documents, exracts all pairs of reused passages of maximal length.
Given a document in one language and a set of documents from another, determine whether parts of the former have been reused by translation from the latter.
Given a document and a set of documents, determine whether parts of the former have been reused from the latter.
Given a piece of code and code repository, determine whether parts of the former have been reused from the latter.
Given a news article, predict whether it is hyperpartisan or not.
Given an edit on a Wikipedia article, determine whether it is vandalism.
Given a Wikipedia article, predict whether it comprises quality flaws, and if so, which.
Miss an important task for the digital text forensics? Don't hesitate to contact us and propose the task you have in mind. Organizing a task usually requires a sufficiently large dataset comprising instances of the task's underlying problem, and performance measures.