Write a Blog >>
PLDI 2021
Sun 20 - Sat 26 June 2021 Virtual Conference

This program is tentative and subject to change.

Mon 21 Jun 2021 18:00 - 18:25 at ARRAY - Session 4 (short talks) Chair(s): Jonathan Ragan-Kelley

The goal of this paper is to demonstrate performance enhancements of the high performance dense linear algebra matrix-matrix multiply DGEMM kernel, widely implemented by vendors in the basic linear algebra subroutine BLAS library. The mathematics of arrays (MoA) paradigm due to Mullin (1988) results in contiguous memory accesses in combination with Church-Rosser complete language constructs optimized for target processor architectures. Our performance studies demonstrate that the MoA implementation of DGEMM combined with optimal cache-blocking strategies results in at least a 25% performance gain on both Intel Xeon Skylake and IBM Power-9 processors over the vendor supplied Intel MKL and IBM ESSL basic linear algebra libraries.
Results are presented for the NREL Eagle and ORNL Summit supercomputers.

Extended abstract (ARRAY_2021_paper_4.pdf)553KiB

This program is tentative and subject to change.

Conference Day
Mon 21 Jun

Displayed time zone: Eastern Time (US & Canada) change

18:00 - 21:00
Session 4 (short talks)ARRAY at ARRAY
Chair(s): Jonathan Ragan-KelleyMIT CSAIL
18:00
25m
Talk
Improving the Performance of DGEMM with MoA and Cache-Blocking
ARRAY
Stephen ThomasNational Renewable Energy Laboratory, Lenore MullinSUNY Albany, USA, Kasia SwirydowiczPacific Northwest National Laboratory
File Attached
18:25
25m
Talk
Nested Object Support in a Structure-of-Arrays Dynamic Objector Allocator
ARRAY
Jizhe ChenxinTokyo Institute of Technology, Hidehiko MasuharaTokyo Institute of Technology
File Attached
18:50
25m
Talk
Data Layouts are Important (Extended Abstract)
ARRAY
Doru Thom PopoviciLawrence Berkeley National Lab, Andrew CanningLawrence Berkeley National Laboratory, Zhengji ZhaoLawrence Berkeley National Laboratory, Lin-Wang WangLawrence Berkeley National Laboratory, John ShalfLawrence Berkeley National Laboratory
File Attached