posted by organizer: ndebard || 2707 views || tracked by 2 users: [display]

FTXS 2015 : The 5th Fault Tolerance for HPC at eXtreme Scale (FTXS) Workshop

FacebookTwitterLinkedInGoogle

Link: https://sites.google.com/site/ftxsworkshop/home/ftxs-2015
 
When Jun 15, 2015 - Jun 15, 2015
Where Portland, OR
Submission Deadline Feb 9, 2015
Notification Due Mar 9, 2015
Categories    resilience   fault tolerance   HPC   supercomputing
 

Call For Papers

CALL FOR PAPERS
5th Workshop on Fault-Tolerance for HPC at eXtreme Scale (FTXS 2015)

In conjunction with
The 24th International ACM Symposium on
High Performance Distributed Computing (HPDC 2015)
Portland, Oregon, USA on June 15 – 19, 2015

Authors are invited to submit original papers on the research and practice of
fault-tolerance in extreme scale (HPC) computing. Resilience and
fault-tolerance remain a major concern for supercomputing and advances in this
area are needed to allow applications to compute accurate (or within error
tolerance) answers in a timely and efficient manner in the presence of
degradations or failures of platform components (both hardware and software).

Topics include, but are not limited to:
* Failure data analysis and field studies
* Power, performance, resilience (PPR) assessments / tradeoffs
* Novel fault-tolerance techniques and implementations
* Emerging hardware and software technology for resilience
* Silent data corruption (SDC) detection / correction techniques
* Advances in reliability monitoring, analysis, and control of highly
complex systems
* Failure prediction, error preemption, and recovery techniques
* Fault-tolerant programming models
* Models for software and hardware reliability
* Metrics and standards for measuring, improving, and enforcing effective
fault-tolerance
* Scalable Byzantine fault-tolerance and security from single-fault and
fail-silent violations
* Atmospheric evaluations relevant to HPC systems (terrestrial neutrons,
temperature, voltage, etc.)
* Near-threshold-voltage implications and evaluations for reliability
* Benchmarks and experimental environments including fault injection
* Frameworks and APIs for fault-tolerance and fault management

See https://sites.google.com/site/ftxsworkshop/home/ftxs-2015 and
http://www.hpdc.org/2015/ for more information.

AMD will sponsor the FTXS 2015 best paper award! The award will be chosen by
the PC and awarded at the workshop.

PAPER SUBMISSIONS
Submissions are solicited in the following categories:
* Regular papers presenting innovative ideas improving the state of the
art.
* Experience papers discussing the issues seen on existing extreme-scale
systems, including some form of analysis and evaluation.
* Extended abstracts proposing disruptive ideas in the field, including
some form of preliminary results.

Submissions shall be sent electronically, must conform to ACM conference
proceedings style and should not exceed eight (8) pages including all text,
appendices, and figures. Position papers should not exceed six (6) pages.

IMPORTANT DATES
Submission of papers: February 9th, 2015
Author notification: March 9th, 2015
Camera-ready papers: April 2015
Workshop: June 15th, 2015

FTXS 2015 PROGRAM CHAIRS
Nathan DeBardeleben – Los Alamos National Laboratory
Franck Cappello – Argonne National Laboratory and UIUC
Robert Clay – Sandia National Laboratories

PROGRAM COMMITTEE
Leonardo Bautista Gomez – Argonne National Laboratory
Aurélien Bouteiller – University of Tennessee Knoxville
Greg Bronevetsky - Lawrence Livermore National Laboratory
John Daly - Department of Defense
Christian Engelmann – Oak Ridge National Laboratory
Kurt Ferreira – Sandia National Laboratories
Ana Gainaru – University of Illinois at Urbana-Champaign
Qiang Guan – Los Alamos National Laboratory
Saurabh Gupta – Oak Ridge National Laboratory
Saurabh Hukerikar – Information Sciences Institute/USC
Hideyuki Jitsumoto – Tokyo Institute of Technology
Zhiling Lan – Illinois Institute of Technology
Scot Levy – University of New Mexico
Naoya Maruyama – RIKEN AICS
Bogdan Nicolae – IBM Research – Ireland
Thomas Ropars - EPFL
Yves Robert - ENS Lyon
Anthony Skjellum - Auburn University
Vilas Sridharan – AMD, Inc.
Devesh Tiwari – Oak Ridge National Laboratory
Abhinav Vishnu - Pacific Northwest National Laboratory

https://sites.google.com/site/ftxsworkshop/home/ftxs-2015

Related Resources

CFP&CFSP-DFT 2024   DFT 2024 | 37th IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems
OpenSuCo @ ISC HPC 2017   2017 International Workshop on Open Source Supercomputing
SSS 2024   The 26th International Symposium on Stabilization, Safety, and Security of Distributed Systems
ISPDC 2024   23rd International Symposium on Parallel and Distributed Computing
PPAM 2024   15th International Conference on Parallel Processing & Applied Mathematics
IEEE TDMR special issue on DFT 2024   CALL FOR PAPERS for the Special Issue on Defect and Fault Tolerance in VLSI and Nanotechnology Systems In the IEEE Transactions on Device and Materials Reliability
AGCA 2024   Workshop on Advancements of Global Challenges Applications
ITADATA 2024   The 3rd Italian Conference on Big Data and Data Science
RISC-V PPAM 2024   First PPAM Workshop on RISC-V
EDCC 2024   19th European Dependable Computing Conference