Shared Tasks

Important Dates

  • January 31, 2024: Training Data Released
  • May 05, 2024: Software submission deadline
  • May 31, 2024: Participant paper submission [submission] [paper template]
  • June 24, 2024: Peer review notification
  • July 08, 2024: Camera-ready participant papers submission
  • September 09-12, 2024: Conference

The time of all deadlines is Midnight CEST.

Keynotes

Julio Gonzalo
Will an A.I. be the Shakespeare of the XXI Century? Experiments and Thoughts on Large Language Models as Creative Text Writers
UNED

One of the most remarkable aspects of Large Language Models is their ease of writing creative texts; under certain conditions, they have been shown to match and improve average human writing skills. But is their writing truly creative, or just a repetition of the clichés they have been pre-trained with? Do they have a distinctive creative writing style? What is the role of (human) prompting in the creation process?

In the talk, I will discuss the creative writing potential of LLMs and their intrinsic limitations, paying special attention to the experiments carried out at UNED. These include a contest between GPT-4 and one of the best contemporary novelists in Spanish, Patricio Pron. The contest is inspired by past AI duels (such as DeepBlue vs Kasparov and AlphaGo vs Lee Sidol), and was designed to test whether LLMs can already challenge a top (rather than an average) fiction writer.

Julio Gonzalo apart from being the sax of Trevithick (see picture) and one of the fastest reviewers of the NLP & IR circuit (aka Julio Speedy Gonzales), he is the director of the UNED Research Center in Natural Language Processing (NLP) and Information Retrieval (IR) in Madrid. Along his career he has worked on topics such as online reputation monitoring, toxicity and misinformation in Social Media, interactive cross-language search, computational creativity and semantic similarity. He has also worked extensively in the design and assessment of evaluation metrics for a wide range of Artificial Intelligence problems, which led to a Google Faculty Research Award (together with Enrique Amigó and Stefano Mizzaro). He has recently been general co-chair of ACM SIGIR 2022 and of IberLEF 2019-2022, the annual evaluation campaign for NLP systems in Spanish and other Iberian languages. He is currently leading ODESIA (odesia.es), a Spanish government initiative to measure the state of the art of language technologies in Spanish.

Read more… Read less…

Program (preliminary)

PAN's program is part of the CLEF 2024 conference program. All times are Central European Summer Time - CEST.

Monday, September 9
11:15-12:45Keynote and Lab Session, Chair: Paolo Rosso
11:15-12:15Keynote: Will an A.I. be the Shakespeare of the XXI Century?
Julio Gonzales
12:15-12:45Overview: Voight-Kampff Generative AI Authorship Verification
Janek Bevendorff, Matti Wiegmann, Efstathios Stamatatos, Benno Stein, and Martin Potthast
15:20-16:30Poster Session
16:30-18:00Lab Session, Chair: tbd.
16:30-17:10Participant Presentations: Authorship Verification
17:10-17:30Overview: Multi-Author Writing Style Analysis
Eva Zangerle, Maximilian Mayerl, Martin Potthast, and Benno Stein
17:30-18:00Participant Presentation Multi-Author Writing Style Analysis
Tuesday, September 10
09:00-10:40CLEF Session: Lab Overviews (ImageCLEF, JokerLab, PAN, iDPP, Touché)
11:15-12:45Lab Session, Chair: tbd.
11:15-11:45Overview: Oppositional Thinking Analysis
Damir Korenčić, Berta Chulvi, Xavier Bonet, Mariona Taulé, Francisco Rangel, and Paolo Rosso
11:45-11:50Best System Award: Oppositional Thinking Analysis
Symanto
11:50-12:45Participant Presentation Oppositional Thinking Analysis
16:30-18:00Lab Session, Chair: tbd.
16:30-17:00Overview: Multilingual Text Detoxification
Daryna Dementieva, Alexander Panchenko
17:00-17:50Participant Presentations Multilingual Text Detoxification
17:50-18:00Closing

Participation Modalities

To participate at PAN, first register for your task of choice at CLEF, then follow the instructions below.

Submission

We use TIRA for all submissions to PAN. Please go to tira.io, create an account, and register for the individual tasks you want to participate in. You need to submit your software or your results via TIRA. You can find all submission guides in TIRA's forum.

Data

You can download PAN's datasets here. For details, please check the individual task's website.

Evaluation and Baseline Code

All code used at PAN is published at GitHub. You can find all validators, evaluators, and baselines there.

Software Submissions

PAN promotes reproducible science with software submissions. Please prepare and submit your software as Docker image(s). You can find guides and examples in the resources linked above. Some tasks allow only software submissions and only release the test data after the conference.

Paper Submission and Presentation

PAN is co-located with CLEF 2024 in Grenoble. Every participant is expected to write a notebook paper describing their approach to CLEF (published at CEUR-WS, which is indexed by DBLP). At the CLEF conference, all submissions will be presented as talks or posters. CLEF will be a hybrid conference.

Organizing Committee