RepSys 2013 : Reproducibility and Replication in Recommender Systems Evaluation

posted by organizer: abellogin || 4739 views || tracked by 7 users: [display]

RepSys 2013 : Reproducibility and Replication in Recommender Systems Evaluation

When	Oct 12, 2013 - Oct 16, 2013
Where	Hong Kong, China
Submission Deadline	Jul 22, 2013
Notification Due	Aug 16, 2013
Final Version Due	Aug 30, 2013

Categories recommender systems evaluation reproducibility replication

Call For Papers

Goals

This workshop aims to gather researchers and practitioners interested in defining clear guidelines for their experimental needs to allow fair comparisons to related work. The workshop will provide an informal setting for exchanging and discussing ideas, sharing experiences and viewpoints. We seek to identify and better understand the current gaps in the implementation of recommender system evaluation methodologies, help lay directions for progress in addressing them, and foster the consolidation and convergence of experimental methods and practice. As a particular focus of interest, the workshop aims to discover which are the main challenges related to reproduction and replication of prior research, along with an exploration of possible directions to overcome these limitations.

Specific questions that the workshop aims to address include the following:

* How important is the reproducibility and replication of experiments for the community?
* What are the challenges for replication of evaluation in the RS field? How could we facilitate easier and more accurate comparison with prior work?
* How can methods and metrics be more clearly and/or formally defined within specific tasks and contexts for which a recommender application is deployed?
* What parts -if any- of an online experiment could be reproducible (and how)?
* How should the academic evaluation methodologies be described to improve their relevance, usefulness, and replicability for industrial settings?
* What type of public resources (data sets, benchmarks) should be available, and how can they be built? Is it possible to have a generic framework for the evaluation (and replication) of recommender systems?
* To what extent is it possible to reuse experimental methodologies across domains and/or businesses?
* How do we envision the evaluation of recommender systems in the future and how does this affect the replicability of said systems?

Scope and topics

Papers explicitly dealing with replication of previously published experimental conditions/algorithms/metrics and the resulting analysis are encouraged. In particular, we seek discussions on the difficulties the authors may find in this process, along with their limitations or successes on reproducing the original results.

Within the broader scope of recommender system evaluation, the presented papers and discussions to be held at the workshop will address –though need not be limited to– the following topics:

* Limitations and challenges of experimental reproducibility and replication
* Reproducible experimental design
* Replicability of algorithms
* Standardization of metrics: definition and computation protocols
* Evaluation software: frameworks, utilities, services
* Reproducibility in user-centric studies
* Datasets and benchmarks
* Recommender software reuse
* Replication of already published work
* Reproducibility within and across domains and organizations
* Reproduction and replication guidelines

Submissions

We invite the submission of papers reporting original research, studies, advances, or experiences in this area. Two submission types are accepted: long papers of up to 8 pages, and short papers up to 4 pages, in the standard ACM SIG proceedings format. Paper submissions and reviews will be handled electronically.

Each paper will be evaluated by at least three reviewers from the Program Committee. The papers will be evaluated for their originality, contribution significance, soundness, clarity, and overall quality. The interest of contributions will be assessed in terms of technical and scientific findings, contribution to the knowledge and understanding of the problem, methodological advancements, or applicative value. Besides, the papers will be evaluated based on their reproducibility in the context of a standard recommender implementation, such as open source frameworks (e.g., LensKit, MyMediaLite, Mahout) and industry products (e.g., Gravity, Mendeley, Plista, Telefonica).

Related Resources

WIDRS 2025 Workshop on Intelligent Decision and Recommender Systems

DSA 2025 The 12th International Conference on Dependability Systems and Their Applications

Bench 2025 The 17th BenchCouncil International Symposium on Evaluation Science and Engineering

ACM RecSys 2025 19th ACM Conference on Recommender Systems

DaQuaMRec 2025 The 1st International Workshop on Data Quality-Aware Multimodal Recommendation

IEEE-DSIS 2025 2025 International Conference on Data Science and Intelligent Systems (DSIS 2025)

ACM REP 2025 3rd ACM Conference on Reproducibility and Replicability (2025)

IEEE ICCCAS 2026 2026 IEEE the 15th International Conference on Communications, Circuits, and Systems (ICCCAS 2026)

IIR 2025 15th Italian Information Retrieval Workshop

ICCCAS 2026 2026 IEEE the 15th International Conference on Communications, Circuits, and Systems (ICCCAS 2026)