When viewing the Technical Program schedule, on the far righthand side
is a column labeled "PLANNER." Use this planner to build your own
schedule. Once you select an event and want to add it to your personal
schedule, just click on the calendar icon of your choice (outlook
calendar, ical calendar or google calendar) and that event will be
stored there. As you select events in this manner, you will have your
own schedule to guide you through the week.
You can also create your personal schedule on the SC11 app (Boopsie) on your smartphone. Simply select a session you want to attend and "add" it to your plan. Continue in this manner until you have created your own personal schedule. All your events will appear under "My Event Planner" on your smartphone.
Scalable Fast Multipole Methods on Distributed Heterogeneous Clusters
SESSION: GPU Applications
EVENT TYPE: Paper, Best Student Paper (BSP) Finalist
TIME: 11:00AM - 11:30AM
AUTHOR(S):Qi Hu, Nail A. Gumerov, Ramani Duraiswami
ROOM:TCC 303
ABSTRACT: We reconsider implementation of the Fast-Multipole-Method (FMM) on a computing node with a heterogeneous architecture with multicore CPU(s) and one or more GPUs, as well as on an interconnected cluster of such nodes. The FMM is a divide-and-conquer algorithm that performs a N-body sum using a spatial decomposition. Using the observation that the local-summation and the analysis-based translation parts of the FMM are independent, we map these respectively to the GPUs and CPUs. Careful analysis of the FMM is performed to distribute work optimally between the CPUs and GPUs. We first develop a single node version where the CPU part is parallelized using OpenMP and the GPU version via CUDA. New parallel algorithms for creating FMM data structures are presented together with load balancing strategies for the single node and distributed versions. Our 8-GPU performance is comparable with the 256-GPU results of the 2009 Bell-prize winner (Hamada et al., 2009).