When viewing the Technical Program schedule, on the far righthand side
is a column labeled "PLANNER." Use this planner to build your own
schedule. Once you select an event and want to add it to your personal
schedule, just click on the calendar icon of your choice (outlook
calendar, ical calendar or google calendar) and that event will be
stored there. As you select events in this manner, you will have your
own schedule to guide you through the week.
You can also create your personal schedule on the SC11 app (Boopsie) on your smartphone. Simply select a session you want to attend and "add" it to your plan. Continue in this manner until you have created your own personal schedule. All your events will appear under "My Event Planner" on your smartphone.
System Implications of Memory Reliability in Exascale Computing
SESSION: Reliability
EVENT TYPE: Paper
TIME: 2:30PM - 3:00PM
AUTHOR(S):Sheng Li, Ke Chen, Ming-Yu Hsieh, Naveen Muralimanohar, Chad Kersey, Jay B. Brockman, Arun F. Rodrigues, Norman P. Jouppi
ROOM:TCC 304
ABSTRACT: Resiliency will be one of the toughest challenges in future exascale systems. Memory errors contribute more than 40% of the total hardware-related failures and are projected to increase in future exascale systems. Error correction code (ECC) and checkpointing are two effective approaches to fault tolerance. While there are numerous studies on ECC or checkpointing in isolation, this is the first paper to investigate the combined effect of both on overall system performance and power. We study the impact of various ECC schemes (SECDEC, BCH, and chipkill) in conjunction with checkpointing on future exascale systems. Our simulation results show that while chipkill is better for computation-intensive applications, BCH has advantage in system energy-delay product (EDP) for memory-intensive applications. We also propose to use BCH in tagged memory systems with commodity DRAMs where chipkill is impractical. The proposed architecture achieves 2X better system EDP and reliability than conventional tagged memory systems.