Papers
The SC11 Technical Papers program received 352 high quality submissions covering a variety of advanced research topics in HPC spanning six areas--Applications, Architecture/Networks, Clouds and Grids, Performance, Storage, and Systems Software. After an extremely rigorous peer review process in which all the papers were subject to at least three (most four) careful reviews, a two-day face-to-face committee meeting was held June 6-7 in Seattle. The meeting was attended by over 100 technical paper committee members who discussed each and every paper and finalized the selections. At the conclusion of the meeting, 74 papers were accepted for presentation. They cover some of today's hottest topics, such as exploiting multicore systems and GPUs for large-scale computing, scalable systems and storage, and programming models and fault tolerance for exascale computing. The result is one of the most exciting Technical Papers programs in the history of SC. With an acceptance rate of 21 percent, SC11 is one of the most competitive technical conferences in high-performance computing.
Among the excellent contributions, one outstanding paper will be presented at the conference as a best paper candidate and four papers as the best student paper finalists.
The Best Student Paper and the Best Paper awards were announced at the conference
awards ceremony on Thursday, Nov. 17. The Best Student Paper Award winner is
"Simplified Parallel Domain Traversal" by Wesley Kendall, Jingyuan Wang, Melissa
Allen, Tom Peterka, Jian Huang, David Erickson. The Best Paper Award winner is
"Parallel Random Numbers: As Easy as 1, 2, 3" by John K. Salmon, Mark A. Moraes, Ron
O. Dror, David E. Shaw.
Questions: papers@info.supercomputing.org
View the SC11 conference schedule.
Tuesday, November 15th
TIME
| PRESENTATION
| SPEAKER
| LOCATION
| PLANNER
|
10:30AM - 11:00AM |
Optimizing Symmetric Dense Matrix-Vector Multiplication on GPUs |
Rajib Nath, Stanimire Tomov, Tingxing Dong, Jack Dongarra |
TCC 305 |
 |
10:30AM - 11:00AM |
Liszt: A Domain Specific Language for Building Portable Mesh-based PDE Solvers |
Zachary DeVito, Niels Joubert, Francisco Palacios, Stephen Oakley, Montserrat Medina, Mike Barrientos, Erich Elsen, Frank Ham, Alex Aiken, Karthik Duraisamy, Eric Darve, Juan Alonso, Pat Hanrahan |
TCC 304 |
 |
10:30AM - 11:00AM |
CudaDMA: Optimizing GPU Memory Bandwidth via Warp Specialization |
Michael Bauer, Henry Cook, Brucek Khailany |
TCC 303 |
 |
11:00AM - 11:30AM |
Tiled QR factorization algorithms |
Henricus M. Bouwmeester, Mathias Jacquelin, Julien Langou, Yves Robert |
TCC 305 |
 |
11:00AM - 11:30AM |
Simplified Parallel Domain Traversal |
Wesley Kendall, Jingyuan Wang, Melissa Allen, Tom Peterka, Jian Huang, David Erickson |
TCC 304 |
 |
11:00AM - 11:30AM |
Dymaxion: Optimizing Memory Access Patterns for Heterogeneous Systems |
Shuai Che, Jeremy Sheaffer, Kevin Skadron |
TCC 303 |
 |
11:30AM - 12:00PM |
Parallel Reduction to Condensed Forms for Symmetric Eigenvalue Problems using Aggregated Fine-Grained and Memory-Aware Kernels |
Azzam Haidar, Hatem Ltaief, Jack Dongarra |
TCC 305 |
 |
11:30AM - 12:00PM |
Physis: An Implicitly Parallel Programming Model for Stencil Computations on Large-Scale GPU-Accelerated Supercomputers |
Naoya Maruyama, Tatsuo Nomura, Kento Sato, Satoshi Matsuoka |
TCC 304 |
 |
11:30AM - 12:00PM |
GROPHECY: GPU Performance Projection from CPU Code Skeletons |
Jiayuan Meng, Vitali Morozov, Kalyan Kumaran, Venkatram Vishwanath, Thomas Uram |
TCC 303 |
 |
1:30PM - 2:00PM |
Server-Side I/O Coordination for Parallel File Systems |
Huaiming Song, Yanlong Yin, Xian-He Sun, Rajeev Thakur, Samuel Lang |
TCC 305 |
 |
1:30PM - 2:00PM |
GreenSlot: Scheduling Energy Consumption in Green Datacenters |
Íñigo Goiri, Kien Le, Md. E. Haque, Ryan Beauchea, Thu D. Nguyen, Jordi Guitart, Jordi Torres, Ricardo Bianchini |
TCC 304 |
 |
1:30PM - 2:15PM |
Multithreaded Global Address Space Communication Techniques for Gyrokinetic Fusion Applications on Ultra-Scale Platforms |
Robert Preissl, Nathan Wichmann, Bill Long, John Shalf, Stephane Ethier, Alice Koniges |
TCC 303 |
 |
2:00PM - 2:30PM |
QoS Support for End Users of I/O-intensive Applications using Shared Storage Systems |
Xuechen Zhang, Kei Davis, Song Jiang |
TCC 305 |
 |
2:00PM - 2:30PM |
A `Cool' Load Balancer for Parallel Applications |
Osman Sarood, Laxmikant Kale |
TCC 304 |
 |
2:15PM - 3:00PM |
Parallel Random Numbers: As Easy as 1, 2, 3 |
John K. Salmon, Mark A. Moraes, Ron O. Dror, David E. Shaw |
TCC 303 |
 |
2:30PM - 3:00PM |
Topology-aware data movement and staging for I/O acceleration on Blue Gene/P supercomputing systems |
Venkatram Vishwanath, Mark Hereld, Vitali Morozov, Michael E. Papka |
TCC 305 |
 |
2:30PM - 3:00PM |
Reducing Electricity Cost Through Virtual Machine Placement in High Performance Computing Clouds |
Kien Le, Jingru Zhang, Jiandong Meng, Yogesh Jaluria, Thu Nguyen, Ricardo Bianchini |
TCC 304 |
 |
3:30PM - 4:00PM |
Gyrokinetic Toroidal Simulations on Leading Multi- and Manycore HPC Systems |
Kamesh Madduri, Khaled Z. Ibrahim, Samuel Williams, Eun-Jin Im, Stephane Ethier, John Shalf, Leonid Oliker |
TCC 304 |
 |
3:30PM - 4:00PM |
The IBM Blue Gene/Q Interconnection Network and Message Unit |
Dong Chen, Noel A. Eisley, Philip Heidelberger, Robert M. Senger, Burkhard Steinmacher-Burow, Yutaka Sugawara, Sameer Kumar, Jeffrey J. Parker, Valentina Salapura, David L. Satterfield |
TCC 303 |
 |
3:30PM - 4:00PM |
I/O Streaming Evaluation of Batch Queries for Data-Intensive Computational Turbulence |
Kalin Kanov, Eric Perlman, Randal Burns, Yanif Ahmad, Alexander Szalay |
TCC 305 |
 |
4:00PM - 4:30PM |
Unitary Qubit Lattice Simulations of Multiscale Phenomena in Quantum Turbulence |
George Vahala, Min Soe, Bo Zhang, Jeffrey Yepez, Linda Vahala, Jonathan Carter, Sean Ziegeler |
TCC 304 |
 |
4:00PM - 4:30PM |
High-Efficiency Server Design |
Eitan Frachtenberg, Ali Heydari, Harry Li, Amir Michael, Jacob Na, Avery Nisbet, Pierluigi Sarti |
TCC 303 |
 |
4:00PM - 4:30PM |
Parallel Index and Query for Large Scale Data Analysis |
Jerry Chou, Kesheng Wu, Mark Howison, Mr. Prabhat, Oliver Ruebel, Brian Austin, E. Wes Bethel, Ji Qiang, Robert D. Ryne, Arie Shoshani |
TCC 305 |
 |
4:30PM - 5:00PM |
An Image Compositing Solution at Scale |
Kenneth Moreland, Wesley Kendall, Tom Peterka, Jian Huang |
TCC 304 |
 |
4:30PM - 5:00PM |
Using the TOP500 to Trace and Project Technology and Architecture Trends |
Peter Michael Kogge, Timothy J. Dysart |
TCC 303 |
 |
4:30PM - 5:00PM |
ISABELA-QA: Query-driven Data Analytics over ISABELA-compressed Extreme-Scale Scientific Data |
Sriram Lakshminarasimhan, Jonathan Jenkins, Robert Latham, Robert Ross, Nagiza F. Samatova, Isha Arkatkar, Zhenhuan Gong, Hemanth Kolla, Jackie Chen, Seung-Hoe Ku, C.S. Chang, Stephane Ethier, Scott Klasky |
TCC 305 |
 |
Wednesday, November 16th
TIME
| PRESENTATION
| SPEAKER
| LOCATION
| PLANNER
|
10:30AM - 11:00AM |
FTI: high performance Fault Tolerance Interface for hybrid systems |
Leonardo Arturo Bautista Gomez, Dimitri Komatitsch, Naoya Maruyama, Seiji Tsuboi, Franck Cappello, Satoshi Matsuoka, Takeshi Nakamura |
TCC 304 |
 |
10:30AM - 11:00AM |
Fast Implementation of DGEMM on Fermi GPU |
Guangming Tan, Linchuan Li, Sean Triechler, Everett Phillips, Yungang Bao, Ninghui Sun |
TCC 303 |
 |
10:30AM - 11:00AM |
Virtual I/O Caching: Effective Storage Cache Management for Concurrent Workloads |
Michael R. Frasca, Ramya Prabhakar, Padma Raghavan, Mahmut Kandemir |
TCC 305 |
 |
11:00AM - 11:30AM |
Checkpointing strategies for parallel jobs |
Marin Bougeret, Henri Casanova, Mikael Rabie, Yves Robert, Frédéric Vivien |
TCC 304 |
 |
11:00AM - 11:30AM |
Scalable Fast Multipole Methods on Distributed Heterogeneous Clusters |
Qi Hu, Nail A. Gumerov, Ramani Duraiswami |
TCC 303 |
 |
11:00AM - 11:30AM |
SCMFS: A File System for Storage Class Memory |
Xiaojian Wu, Narasimha Reddy |
TCC 305 |
 |
11:30AM - 12:00PM |
BlobCR: Efficient Checkpoint-Restart for HPC Applications on IaaS Clouds using Virtual Disk Image Snapshots |
Bogdan Nicolae, Franck Cappello |
TCC 304 |
 |
11:30AM - 12:00PM |
Multi-Science Applications with Single Codebase - GAMER - for Massively Parallel Architectures |
Hemant Shukla, Hsi-Yu Schive, Tak-Pong Woo, Tzihong Chiueh |
TCC 303 |
 |
11:30AM - 12:00PM |
Optimized Pre-Copy Live Migration for Memory Intensive Applications |
Khaled Z. Ibrahim, Costin Iancu, Steven Hofmeyr, Eric Roman |
TCC 305 |
 |
1:30PM - 2:00PM |
Scalable Hashing for Shared Memory Supercomputers |
Eric L. Goodman, M. Nicole Lemaster, Edward Jimenez |
TCC 303 |
 |
1:30PM - 2:00PM |
Evaluating the Viability of Process Replication Reliability for Exascale Systems |
Kurt, B. Ferreira, Rolf Riesen, Patrick Bridges, Dorian Arnold, Jon Stearley, James H. Laros, Ron A. Oldfield, Kevin Pedretti, Ron Brightwell |
TCC 304 |
 |
1:30PM - 2:00PM |
TRACON: Interference-Aware Scheduling for Data-Intensive Applications in Virtualized Environments |
Ron C. Chiang, H. Howie Huang |
TCC 305 |
 |
2:00PM - 2:30PM |
An Early Performance Analysis of POWER7-IH HPC Systems |
Kevin Barker, Adolfy Hoisie, Darren Kerbyson |
TCC 303 |
 |
2:00PM - 2:30PM |
Modeling and Tolerating Heterogeneous Failures in Large Parallel Systems |
Eric M. Heien, Derrick Kondo, Ana Gainaru, Dan LaPine, Bill Kramer, Franck Cappello |
TCC 304 |
 |
2:00PM - 2:30PM |
Flexible Resource Allocation for Reliable Virtual Cluster Computing Systems |
Thomas Hacker, Kanak Mahadik |
TCC 305 |
 |
2:30PM - 3:00PM |
A Similarity Measure for Time, Frequency, and Dependencies in Large-Scale Workloads |
Mario Lassnig, Thomas Fahringer, Vincent Garonne, Angelos Molfetas, Martin Barisits |
TCC 303 |
 |
2:30PM - 3:00PM |
System Implications of Memory Reliability in Exascale Computing |
Sheng Li, Ke Chen, Ming-Yu Hsieh, Naveen Muralimanohar, Chad Kersey, Jay B. Brockman, Arun F. Rodrigues, Norman P. Jouppi |
TCC 304 |
 |
2:30PM - 3:00PM |
Auto-Scaling to Minimize Cost and Meet Application Deadlines in Cloud Workflows |
Ming Mao, Marty Humphrey |
TCC 305 |
 |
3:30PM - 4:00PM |
Large Scale Debugging of Parallel Tasks with AutomaDeD |
Ignacio Laguna, Todd Gamblin, Bronis R. de Supinski, Saurabh Bagchi, Greg Bronevetsky, Dong H. Ahn, Martin Schulz, Barry Rountree |
TCC 304 |
 |
3:30PM - 4:00PM |
Sniper: Exploring the Level of Abstraction for Scalable and Accurate Parallel Multi-Core Simulation |
Trevor E. Carlson, Wim Heirman, Lieven Eeckhout |
TCC 305 |
 |
4:00PM - 4:30PM |
Efficient Data Race Detection for Distributed Memory Parallel Programs |
Chang-Seo Park, Paul Hargrove, Costin Iancu, Koushik Sen |
TCC 304 |
 |
4:00PM - 4:30PM |
MAximum Multicore POwer (MAMPO) - An Automatic Multithreaded Synthetic Power Virus Generation Framework for Multicore Systems |
Karthik Ganesan, Lizy John |
TCC 305 |
 |
Thursday, November 17th
TIME
| PRESENTATION
| SPEAKER
| LOCATION
| PLANNER
|
10:30AM - 11:00AM |
Performance of the Community Earth System Model |
Patrick H. Worley, Anthony P. Craig, John M. Dennis, Arthur A. Mirin, Mark A. Taylor, Mariana Vertenstein |
TCC 303 |
 |
10:30AM - 11:00AM |
Hadoop Acceleration Through Network Levitated Merge |
Yandong Wang, Xinyu Que, Weikuan Yu, Dror Goldenberg, Dhiraj Sehgal |
TCC 305 |
 |
10:30AM - 11:00AM |
Copernicus: A New Paradigm for Parallel Adaptive Molecular Dynamics |
Sander Pronk, Per Larsson, Iman Pouya, Greg Bowman, Imran Haque, Kyle Beauchamp, Berk Hess, Vijay Pande, Peter Kasson, Erik Lindahl |
TCC 304 |
 |
11:00AM - 11:30AM |
Extracting Ultra-Scale Lattice Boltzmann Performance via Hierarchical and Distributed Auto-Tuning |
Samuel Williams, Leonid Oliker, Jonathan Carter, John Shalf |
TCC 303 |
 |
11:00AM - 11:30AM |
Purlieus: Locality-aware Resource Allocation for MapReduce in a Cloud |
Balaji Palanisamy, Aameek Singh, Ling Liu, Bhushan Jain |
TCC 305 |
 |
11:00AM - 11:30AM |
Enabling and Scaling Biomolecular Simulations of 100 Million Atoms on Petascale Machines with a Multicore-optimized Message-driven Runtime |
Chao Mei, Yanhua Sun, Gengbin Zheng, James C. Phillips, Eric J. Bohm, Chris Harrison, Laxmikant V. Kale |
TCC 304 |
 |
11:30AM - 12:00PM |
Highly Scalable Ab Initio Genomic Motif Identification |
Benoit Marchand, Vadimir Bajic, Dinesh Kaushik |
TCC 303 |
 |
11:30AM - 12:00PM |
A Distributed Look-up Architecture for Text Mining Applications using MapReduce |
Atilla S. Balkir, Ian Foster, Andrey Rzhetsky |
TCC 305 |
 |
11:30AM - 12:00PM |
Parallelization Design for Multi-core Platforms in Density Matrix Renormalization Group toward 2-D Quantum Strongly-correlated Systems |
Susumu Yamada, Toshiyuki Imamura, Masahiko Machida |
TCC 304 |
 |
1:30PM - 2:00PM |
A Scalable Eigensolver for Large Scale-Free Graphs Using 2D Partitioning |
Andy Yoo, Allison Baker, Roger Pearce, Van Henson |
TCC 305 |
 |
1:30PM - 2:00PM |
SciHadoop: Array-based Query Processing in Hadoop |
Joe Buck, Noah Watkins, Jeff LeFevre, Kleoni Ioannidou, Carlos Maltzahn, Neoklis Polyzotis, Scott Brandt |
TCC 303 |
 |
1:30PM - 2:00PM |
High-Performance Lattice QCD for Multi-core Based Parallel Systems Using a Cache-Friendly Hybrid Threaded-MPI Approach |
Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Jee Choi, Balint Joo, Jatin Chhugani, Michael A. Clark, Pradeep Dubey |
TCC 304 |
 |
2:00PM - 2:30PM |
Scalable Stochastic Optimization of Complex Energy Systems |
Miles Lubin, Cosmin G. Petra, Mihai Anitescu, Victor Zavala |
TCC 305 |
 |
2:00PM - 2:30PM |
On the Duality of Data-intensive File System Design: Reconciling HDFS and PVFS |
Wittawat Tantisiriroj, Swapnil Patil, Garth Gibson, Seung Son, Samuel Lang, Robert Ross |
TCC 303 |
 |
2:00PM - 2:30PM |
Scaling Lattice QCD beyond 100 GPUs |
Ronald Babich, Michael A. Clark, Bálint Joó, Guochun Shi, Richard C. Brower, Steven Gottlieb |
TCC 304 |
 |
2:30PM - 3:00PM |
Parallel Breadth-First Search on Distributed Memory Systems |
Aydin Buluc, Kamesh Madduri |
TCC 305 |
 |
2:30PM - 3:00PM |
End-to-End Network QoS via Scheduling of Flexible Resource Reservation Requests |
Sushant Sharma, Dimitrios Katramatos, Dantong Yu |
TCC 303 |
 |
2:30PM - 3:00PM |
Large Scale Plane Wave Pseudopotential Density Functional Theory Calculations on GPU Clusters |
Long Wang, Weile Jia, Xuebin Chi, Yue Wu, Weiguo Gao, Lin-Wang Wang |
TCC 304 |
 |
3:30PM - 4:00PM |
Scalable Implementations of Accurate Excited-state Coupled Cluster Theories: Application of High-level Methods to Porphyrin-based Systems |
Karol Kowalski, Sriram Krishnamoorthy, Ryan Olson, Vinod Tipparaju, Eduardo Apra |
TCC 304 |
 |
3:30PM - 4:00PM |
Optimizing the Barnes-Hut Algorithm in UPC |
Junchao Zhang, Babak Behzad, Marc Snir |
TCC 305 |
 |
4:00PM - 4:30PM |
Hardware, Software Co-design for Energy Efficient Seismic Modeling |
Jens Krueger, David Donofrio, John Shalf, Samuel Williams, Leonid Oliker, Marghoob Mohiyuddin, Franz-Josef Pfreundt |
TCC 304 |
 |
4:00PM - 4:30PM |
Avoiding hot-spots on two-level direct networks |
Abhinav Bhatele, Nikhil Jain, William D. Gropp, Laxmikant V. Kale |
TCC 305 |
 |
4:30PM - 5:00PM |
A Fast Solver for Modeling the Evolution of Virus Populations |
Gerhard Niederbrucker, Wilfried N. Gansterer |
TCC 304 |
 |
4:30PM - 5:00PM |
Improving Communication Performance in Dense Linear Algebra via Topology Aware Collectives |
Edgar Solomonik, Abhinav Bhatele, James Demmel |
TCC 305 |
 |
|