|   | 
| 
 | |||||||||||||||
| CODI CRAC 2025 : Joint Workshop on Computational Approaches to Discourse (CODI) and Computational Models of Reference, Anaphora and Coreference (CRAC) | |||||||||||||||
| Link: https://sites.google.com/view/codi-crac2025/ | |||||||||||||||
| 
 | |||||||||||||||
| Call For Papers | |||||||||||||||
| 
CODI CRAC 2025 Workshop: joint call for papers
 November 5-9 2025 - EMNLP 25 - Suzhou, China We are pleased to announce that we are organizing in 2025 the first joint CODI-CRAC workshop that will be held during EMNLP! More information on: https://sites.google.com/view/codi-crac2025/ We will host 2 shared tasks, the CRAC and the DISRPT shared tasks. Aims and scope The last few years have seen a dramatic improvement in the ability of NLP systems and Large Language Models to understand and produce words, sentences and in some cases longer texts. This development has created a renewed interest in discourse problems as researchers move towards the processing of long-form documents and conversations. There is a surge of activity in discourse pretraining tasks, coherence models, summarization for long texts and conversations, corpora for discourse level reading comprehension and formal parsing, as well as discourse related/aided representation learning, to name a few. Discourse, roughly the interactions of context, form and meaning above the sentence level, is at the intersection of many areas in Computational Linguistics and NLP, since it is concerned with all levels of linguistic representation, allowing the modeling of textual coherence and inference leveraging long-distance links within documents. It thus brings together researchers working on different areas but facing similar issues with coherence and cohesion, document-level structure, long text and long context. In 2025, we organize the first joint CODI-CRAC workshop. The CODI workshop has been a forum for a broad range of work at the discourse level. The CRAC workshop has been a primary venue for researchers interested in the computational modeling of reference, anaphora, and coreference. Together, these workshops have catalyzed work to advance research on discourse level problems and have served as a forum for the discussion of suitable datasets and reliable evaluation methods. This joint edition corresponds to the 6th CODI workshop and the 8th CRAC workshop. It will welcome contributions from all the areas below, including state of the art textual NLU and NLG work using LLMs, as well as classic structured work on automatic discourse analysis -- corresponding to challenging tasks such as coreference resolution or discourse parsing -- to encourage interaction between communities. The workshop is set to host the fourth edition of the DISRPT shared task on Discourse Relation Parsing and Treebanking and the fourth edition of the CRAC shared task on Multilingual Coreference Resolution. The workshop is planned as a 1 day event which brings together different subcommunities. It will feature invited talks and regular papers. We also accept papers accepted at other major conferences for non-archival presentation, including Findings papers. Topics of interest We welcome papers on symbolic and probabilistic approaches, corpus development and analysis, as well as machine and deep learning approaches to discourse. We appreciate theoretical contributions as well as practical applications, including demos of systems and tools. The goal of the workshop is to provide a forum for the community of NLP researchers working on all aspects of discourse. Topics of interest include, but are not limited to: discourse structure discourse connectives discourse relations annotation tools and schemes for discourse phenomena corpora annotated with discourse phenomena discourse parsing cross-lingual discourse processing cross-domain discourse processing anaphora and coreference resolution event coreference argument mining coherence modeling discourse and semantics discourse in applications such as machine translation, summarization, etc. evaluation methodology for discourse processing discourse pretraining tasks long-text modeling and generation Submissions We solicit three categories of papers: regular (long and short) workshop papers, demos and extended abstracts. Only regular workshop papers and demos will be included in the proceedings as archival publications. Double submission of papers is allowed, but this information will need to be disclosed at submission time. Regular papers must describe original unpublished research. Long papers may consist of up to 8 pages of content, plus unlimited pages for references. Short papers can be up to 4 pages, plus unlimited pages for references. Demo submissions may describe systems, tools, visualizations, etc., and may consist of up to 4 pages, plus unlimited pages for references. Each submission can contain unlimited pages for Appendices but the paper submissions need to remain fully self-contained, as these supplementary materials are completely optional, and reviewers are not even asked to review them. Extended abstracts can describe work in progress. These may be two pages long (without references). Extended abstracts are non-archival. They will be included in the workshop program and handbook, but will not appear in the workshop proceedings. Paper accepted or rejected at one of the main conferences We also invite presentations of paper accepted at another main conference, a specific deadline and submission process will be communicated later on. They will be included in the workshop program and handbook, but will not appear in the workshop proceedings. We also fast-track ARR papers with reviews, with timeline TBA. Submission website All submissions must be anonymous and follow the EMNLP 2025 formatting instructions described here: https://aclrollingreview.org/cfp Submission website will be announced later. Tentative schedule 2025-07: CODI papers due 2025-?: Direct submission (papers rejected at a main conference) 2025-09: Notification of acceptance 2025-09: Camera ready deadline for main conference and CODI 2025-11: CODI-CRAC workshop All deadlines are 11.59 pm UTC -12h ("anywhere on Earth"). Invited Speakers Tanya Goyal, Cornell University. Nancy F. Chen, Institute for Infocomm Research, Agency for Science, Technology, and Research, Singapore Organizers Chloé Braud, CNRS-IRIT Christian Hardmeier, IT University of Copenhagen Chuyuan (Lisa) Li, University of British Columbia Jessy Li, University of Texas, Austin Sharid Loáiciga, University of Gothenburg Vincent Ng, University of Texas at Dallas Michal Novák, Charles University, Prague Maciej Ogrodniczuk, Institute of Computer Science, Polish Academy of Sciences Massimo Poesio, Queen Mary University of London and University of Utrecht Sameer Pradhan, University of Pennsylvania and cemantix Michael Strube, Heidelberg Institute for Theoretical Studies Amir Zeldes Vincent Ng Maciej Ogrodniczuk Michal Novák Massimo Poesio Sameer Pradhan To contact the organizers, please send an email to: codi-crac-workshop@googlegroups.com | 
| 
 |