CODI CRAC 2025 : Joint Workshop on Computational Approaches to Discourse (CODI) and Computational Models of Reference, Anaphora and Coreference (CRAC)

posted by user: grupocole || 1832 views || tracked by 4 users: [display]

CODI CRAC 2025 : Joint Workshop on Computational Approaches to Discourse (CODI) and Computational Models of Reference, Anaphora and Coreference (CRAC)

Link: https://sites.google.com/view/codi-crac2025/

When	Nov 5, 2025 - Nov 9, 2025
Where	Suzhou, China
Submission Deadline	Jul 1, 2025
Notification Due	Jul 1, 2026
Final Version Due	Jun 4, 2025

Categories NLP computational linguistics artificial intelligence

Call For Papers

CODI CRAC 2025 Workshop: joint call for papers

November 5-9 2025 - EMNLP 25 - Suzhou, China

We are pleased to announce that we are organizing in 2025 the first joint CODI-CRAC workshop that will be held during EMNLP! More information on: https://sites.google.com/view/codi-crac2025/

We will host 2 shared tasks, the CRAC and the DISRPT shared tasks.
Aims and scope

The last few years have seen a dramatic improvement in the ability of NLP systems and Large Language Models to understand and produce words, sentences and in some cases longer texts. This development has created a renewed interest in discourse problems as researchers move towards the processing of long-form documents and conversations. There is a surge of activity in discourse pretraining tasks, coherence models, summarization for long texts and conversations, corpora for discourse level reading comprehension and formal parsing, as well as discourse related/aided representation learning, to name a few.

Discourse, roughly the interactions of context, form and meaning above the sentence level, is at the intersection of many areas in Computational Linguistics and NLP, since it is concerned with all levels of linguistic representation, allowing the modeling of textual coherence and inference leveraging long-distance links within documents. It thus brings together researchers working on different areas but facing similar issues with coherence and cohesion, document-level structure, long text and long context.

In 2025, we organize the first joint CODI-CRAC workshop. The CODI workshop has been a forum for a broad range of work at the discourse level. The CRAC workshop has been a primary venue for researchers interested in the computational modeling of reference, anaphora, and coreference. Together, these workshops have catalyzed work to advance research on discourse level problems and have served as a forum for the discussion of suitable datasets and reliable evaluation methods.

This joint edition corresponds to the 6th CODI workshop and the 8th CRAC workshop. It will welcome contributions from all the areas below, including state of the art textual NLU and NLG work using LLMs, as well as classic structured work on automatic discourse analysis -- corresponding to challenging tasks such as coreference resolution or discourse parsing -- to encourage interaction between communities. The workshop is set to host the fourth edition of the DISRPT shared task on Discourse Relation Parsing and Treebanking and the fourth edition of the CRAC shared task on Multilingual Coreference Resolution.

The workshop is planned as a 1 day event which brings together different subcommunities. It will feature invited talks and regular papers. We also accept papers accepted at other major conferences for non-archival presentation, including Findings papers.

Topics of interest

We welcome papers on symbolic and probabilistic approaches, corpus development and analysis, as well as machine and deep learning approaches to discourse. We appreciate theoretical contributions as well as practical applications, including demos of systems and tools. The goal of the workshop is to provide a forum for the community of NLP researchers working on all aspects of discourse.

Topics of interest include, but are not limited to:

discourse structure
discourse connectives
discourse relations
annotation tools and schemes for discourse phenomena
corpora annotated with discourse phenomena
discourse parsing
cross-lingual discourse processing
cross-domain discourse processing
anaphora and coreference resolution
event coreference
argument mining
coherence modeling
discourse and semantics
discourse in applications such as machine translation, summarization, etc.
evaluation methodology for discourse processing
discourse pretraining tasks
long-text modeling and generation

Submissions

We solicit three categories of papers: regular (long and short) workshop papers, demos and extended abstracts. Only regular workshop papers and demos will be included in the proceedings as archival publications.

Double submission of papers is allowed, but this information will need to be disclosed at submission time.

Regular papers must describe original unpublished research. Long papers may consist of up to 8 pages of content, plus unlimited pages for references.

Short papers can be up to 4 pages, plus unlimited pages for references.

Demo submissions may describe systems, tools, visualizations, etc., and may consist of up to 4 pages, plus unlimited pages for references.

Each submission can contain unlimited pages for Appendices but the paper submissions need to remain fully self-contained, as these supplementary materials are completely optional, and reviewers are not even asked to review them.

Extended abstracts can describe work in progress. These may be two pages long (without references). Extended abstracts are non-archival. They will be included in the workshop program and handbook, but will not appear in the workshop proceedings.
Paper accepted or rejected at one of the main conferences

We also invite presentations of paper accepted at another main conference, a specific deadline and submission process will be communicated later on. They will be included in the workshop program and handbook, but will not appear in the workshop proceedings.

We also fast-track ARR papers with reviews, with timeline TBA.
Submission website

All submissions must be anonymous and follow the EMNLP 2025 formatting instructions described here: https://aclrollingreview.org/cfp

Submission website will be announced later.
Tentative schedule

2025-07: CODI papers due
2025-?: Direct submission (papers rejected at a main conference)
2025-09: Notification of acceptance
2025-09: Camera ready deadline for main conference and CODI
2025-11: CODI-CRAC workshop

All deadlines are 11.59 pm UTC -12h ("anywhere on Earth").
Invited Speakers

Tanya Goyal, Cornell University.
Nancy F. Chen, Institute for Infocomm Research, Agency for Science, Technology, and Research, Singapore

Organizers

Chloé Braud, CNRS-IRIT
Christian Hardmeier, IT University of Copenhagen
Chuyuan (Lisa) Li, University of British Columbia
Jessy Li, University of Texas, Austin
Sharid Loáiciga, University of Gothenburg
Vincent Ng, University of Texas at Dallas
Michal Novák, Charles University, Prague
Maciej Ogrodniczuk, Institute of Computer Science, Polish Academy of Sciences
Massimo Poesio, Queen Mary University of London and University of Utrecht
Sameer Pradhan, University of Pennsylvania and cemantix
Michael Strube, Heidelberg Institute for Theoretical Studies
Amir Zeldes
Vincent Ng
Maciej Ogrodniczuk
Michal Novák
Massimo Poesio
Sameer Pradhan

To contact the organizers, please send an email to: codi-crac-workshop@googlegroups.com