SC is the International Conference for
High Performance Computing, Networking,
Storage and Analysis

SCHEDULE: NOV 12-18, 2011

When viewing the Technical Program schedule, on the far righthand side is a column labeled "PLANNER." Use this planner to build your own schedule. Once you select an event and want to add it to your personal schedule, just click on the calendar icon of your choice (outlook calendar, ical calendar or google calendar) and that event will be stored there. As you select events in this manner, you will have your own schedule to guide you through the week.

You can also create your personal schedule on the SC11 app (Boopsie) on your smartphone. Simply select a session you want to attend and "add" it to your plan. Continue in this manner until you have created your own personal schedule. All your events will appear under "My Event Planner" on your smartphone.

M06: Dealing with MPI Bugs at Scale: Best Practices, Automatic Detection, Debugging, and Formal Verification

SESSION: M06: Dealing with MPI Bugs at Scale: Best Practices, Automatic Detection, Debugging, and Formal Verification

EVENT TYPE: Tutorial

TIME: 8:30AM - 5:00PM

Presenter(s):Matthias S. Mueller, Ganesh Gopalakrishnan, Bronis R. de Supinski, David Lecomber, Tobias Hilbrich


MPI programming is error prone, due to the large number of different MPI-calls, the complex MPI semantics and the general difficulties of parallel programming. Debugging is a cumbersome and time-consuming task that is complicated by the number of parallel processes, non-determinism as well as platform dependent bugs. We start with a summary of common MPI errors and programming techniques to avoid those errors in the first place. Afterwards, tools that automatically detect these errors are introduced. Techniques to apply debuggers for cases where these tools provide no insight are also presented. Finally we present a technique for formal verification. We also focus on how these techniques can work together and complement each other. Workflows and comparisons will guide application developers in applying these tools to real world scenarios. The runtime error detection tool MUST can automatically detect and report deadlocks, a large number of different messaging errors, and resource leaks. For parallel debugging we present DDT, which proved to scale up to 220,000 tasks while offering an intuitive and easy to use user interface. The verification tool ISP achieves guaranteed coverage of non-deterministic matches by enforcing relevant interleavings. After the demonstration of these tools, we will provide hands-on exercises.

Chair/Presenter Details:

Matthias S. Mueller - Technische Universitaet Dresden

Ganesh Gopalakrishnan - University of Utah

Bronis R. de Supinski - Lawrence Livermore National Laboratory

David Lecomber - Allinea Software

Tobias Hilbrich - Technische Universitaet Dresden

Add to iCal  Click here to download .ics calendar file

Add to Outlook  Click here to download .vcs calendar file

Add to Google Calendarss  Click here to add event to your Google Calendar

   Sponsors    ACM    IEEE