Eval4SD 2026 : First Workshop on Evaluating LLMs for Specialized Domains

posted by user: grupocole || 991 views || tracked by 3 users: [display]

Eval4SD 2026 : First Workshop on Evaluating LLMs for Specialized Domains

When	Sep 14, 2026 - Sep 17, 2026
Where	Hamburg, Germany
Submission Deadline	Jul 3, 2026
Notification Due	Jul 31, 2026
Final Version Due	Aug 15, 2026

Categories NLP artificial intelligence computational linguistics

Call For Papers

Dear colleagues,
We invite submissions to the First Workshop on Evaluating LLMs for Specialized Domains (Eval4SD), to be held co-located with KONVENS 2026 in Hamburg, Germany (September 14th - 17th).

The workshop focuses on the evaluation of large language models in specialized domains such as—but not limited to—law, medicine, science, finance, digital humanities, social sciences, education, and politics. In this space, we have identified three core areas detailed below: LLM Benchmarking, Domain Research Replication, and Evaluation Methodology. Work that fits within the general theme but not any of the focus areas is also welcome!

- **LLM Benchmarking:** We invite contributions that evaluate multiple models, datasets, inference methods, or prompting techniques on existing data or introduce novel, specialized benchmarking datasets. Papers in this direction may seek to answer questions like: ‘Which model should I use for my social science project?’ ‘Are open-weight models inferior for specialized tasks?’, or ‘Given a limited budget, what is my best choice of LLM for my digital humanities question?’ We especially encourage submissions that evaluate performance in low- and medium-resource languages.
- **Domain Research Replication:** Does information automatically extracted using a different model or a slightly altered approach still support the same domain conclusions? We invite submissions that attempt to replicate existing domain research using a tweaked LLM setup. For us, testing open-weight models is especially important in light of replicability. We are excited to see how robust domain research is to adaptations of the automation setups, from prompting to model weights and training data.
- **Metrics and Evaluation Methodology:** We invite submissions on methodology for assessing LLM outputs in complex tasks. This includes work on LLM judge setups or novel rule-based metrics for specialized tasks.

We allow submissions in two categories:
- **Long Papers (up to 8 pages + references):** Complete research contributions with novel findings, experimental results, and thorough analysis. Suitable for mature work on LLM evaluation methodology or new benchmark proposals.
- **Short & Position Papers (up to 4 pages + references):** Preliminary results, position papers, system descriptions, and focused contributions. Great for provocative arguments or narrowly scoped empirical studies.

Submissions follow the ACL template; reviews are double-blind and are conducted via OpenReview.
Additionally, we welcome non-archival submissions to present recently published work or seek feedback on work-in-progress without violating dual-submission policies. Accepted papers will be presented at the workshop, but will not be included in the official proceedings.

Important dates:
- Submission deadline: July 03, 2026 (23:59 CEST)
- Notification of acceptance: July 31, 2026
- Camera-ready deadline: August 15, 2026
- Workshop date: co-located with KONVENS (14th - 17th), exact day TBA

Website:
Contact: eval4sd-organizers@googlegroups.com

Related Resources

Ei/Scopus-AI2A 2026 2026 IEEE 6th International Conference on Artificial Intelligence, Automation and Algorithms (AI2A 2026)

IEEE-ICECCS 2026 2025 IEEE International Conference on Electronics, Communications and Computer Science (ICECCS 2026)

DEPLING 2023 International Conference on Dependency Linguistics

Ei/Scopus-ACEPE 2026 2026 3rd IEEE Asia Conference on Advances in Electrical and Power Engineering (ACEPE 2026)

AAIML 2027 IEEE--2027 2nd International Conference on Advances in Artificial Intelligence and Machine Learning

MLDS 2026 7th International Conference on Machine Learning Techniques and Data Science

Cyber-AI 2026 The 2nd IEEE 2026 International Conference on Cybersecurity and AI-Based Systems (Scopus)

DSML 2026 7th International Conference on Data Science and Machine Learning

CiVEJ 2026 Civil Engineering and Urban Planning: An International Journal

EDU 2026 11th International Conference on Education