When viewing the Technical Program schedule, on the far righthand side
is a column labeled "PLANNER." Use this planner to build your own
schedule. Once you select an event and want to add it to your personal
schedule, just click on the calendar icon of your choice (outlook
calendar, ical calendar or google calendar) and that event will be
stored there. As you select events in this manner, you will have your
own schedule to guide you through the week.
You can also create your personal schedule on the SC11 app (Boopsie) on your smartphone. Simply select a session you want to attend and "add" it to your plan. Continue in this manner until you have created your own personal schedule. All your events will appear under "My Event Planner" on your smartphone.
M12: Scalable Heterogeneous Computing on GPU Clusters
SESSION: M12: Scalable Heterogeneous Computing on GPU Clusters
EVENT TYPE: Tutorial
TIME: 8:30AM - 5:00PM
Presenter(s):Jeffrey Vetter, Kyle Spafford, Philip Roth, Allen Malony
ABSTRACT: This tutorial, suitable for attendees with an intermediate-level in Parallel programing in MPI, and GPU programming in CUDA or OpenCL, will provide a comprehensive overview on the optimization techniques to port, analyze, and accelerate applications on scalable heterogeneous computing systems. We will focus on methods, tools, and techniques to migrate existing applications to large scale GPU clusters using MPI and OpenCL/CUDA. First, we will review our methodology for successfully identifying and selecting portions of applications to accelerate with a GPU, motivated with several application case studies. Second, we will present an overview of several performance and correctness tools, which provide performance measurement, profiling, and tracing information about applications running on these systems. Third, we will present a set of best practices for optimizing these applications: GPU and NUDA optimization techniques, optimizing interactions between MPI and GPU programming models. A hands-on session will be conducted on the NSF Keeneland Initial Delivery System, after each part to give participants the opportunity to investigate techniques and performance optimizations on such a system. Existing tutorial codes and benchmark suites will be provided to facilitate individual discovery. Additionally, participants may bring and work on their own applications.
Jeffrey Vetter - Oak Ridge National Laboratory and Georgia Tech