posted by organizer: abellogin || 3906 views || tracked by 7 users: [display]

RepSys 2013 : Reproducibility and Replication in Recommender Systems Evaluation

FacebookTwitterLinkedInGoogle

Link: http://repsys.project.cwi.nl
 
When Oct 12, 2013 - Oct 16, 2013
Where Hong Kong, China
Submission Deadline Jul 22, 2013
Notification Due Aug 16, 2013
Final Version Due Aug 30, 2013
Categories    recommender systems   evaluation   reproducibility   replication
 

Call For Papers

Goals

This workshop aims to gather researchers and practitioners interested in defining clear guidelines for their experimental needs to allow fair comparisons to related work. The workshop will provide an informal setting for exchanging and discussing ideas, sharing experiences and viewpoints. We seek to identify and better understand the current gaps in the implementation of recommender system evaluation methodologies, help lay directions for progress in addressing them, and foster the consolidation and convergence of experimental methods and practice. As a particular focus of interest, the workshop aims to discover which are the main challenges related to reproduction and replication of prior research, along with an exploration of possible directions to overcome these limitations.

Specific questions that the workshop aims to address include the following:

* How important is the reproducibility and replication of experiments for the community?
* What are the challenges for replication of evaluation in the RS field? How could we facilitate easier and more accurate comparison with prior work?
* How can methods and metrics be more clearly and/or formally defined within specific tasks and contexts for which a recommender application is deployed?
* What parts -if any- of an online experiment could be reproducible (and how)?
* How should the academic evaluation methodologies be described to improve their relevance, usefulness, and replicability for industrial settings?
* What type of public resources (data sets, benchmarks) should be available, and how can they be built? Is it possible to have a generic framework for the evaluation (and replication) of recommender systems?
* To what extent is it possible to reuse experimental methodologies across domains and/or businesses?
* How do we envision the evaluation of recommender systems in the future and how does this affect the replicability of said systems?

Scope and topics

Papers explicitly dealing with replication of previously published experimental conditions/algorithms/metrics and the resulting analysis are encouraged. In particular, we seek discussions on the difficulties the authors may find in this process, along with their limitations or successes on reproducing the original results.

Within the broader scope of recommender system evaluation, the presented papers and discussions to be held at the workshop will address –though need not be limited to– the following topics:

* Limitations and challenges of experimental reproducibility and replication
* Reproducible experimental design
* Replicability of algorithms
* Standardization of metrics: definition and computation protocols
* Evaluation software: frameworks, utilities, services
* Reproducibility in user-centric studies
* Datasets and benchmarks
* Recommender software reuse
* Replication of already published work
* Reproducibility within and across domains and organizations
* Reproduction and replication guidelines

Submissions

We invite the submission of papers reporting original research, studies, advances, or experiences in this area. Two submission types are accepted: long papers of up to 8 pages, and short papers up to 4 pages, in the standard ACM SIG proceedings format. Paper submissions and reviews will be handled electronically.

Each paper will be evaluated by at least three reviewers from the Program Committee. The papers will be evaluated for their originality, contribution significance, soundness, clarity, and overall quality. The interest of contributions will be assessed in terms of technical and scientific findings, contribution to the knowledge and understanding of the problem, methodological advancements, or applicative value. Besides, the papers will be evaluated based on their reproducibility in the context of a standard recommender implementation, such as open source frameworks (e.g., LensKit, MyMediaLite, Mahout) and industry products (e.g., Gravity, Mendeley, Plista, Telefonica).

Related Resources

HiPEAC SC 2024   HiPEAC Reproducibility Student Challenge
RSsCI 2024   FLINS 2024 Special Session on Recommender systems supported by computational intelligence: emerging topics and applications
Learning 2024   Thirty-First International Conference on Learning
BIAS 2024   International Workshop on Algorithmic Bias in Search and Recommendation
PCDS 2024   The 1st International Symposium on Parallel Computing and Distributed Systems
LREC-COLING 2024   The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation
ACIIDS 2024   16th Asian Conference on Intelligent Information and Database Systems
ASPLOS 2025   The ACM International Conference on Architectural Support for Programming Languages and Operating Systems
EASE 2024   28th International Conference on Evaluation and Assessment in Software Engineering
DDECS 2024   27th International Symposium on Design and Diagnostics of Electronic Circuits and Systems