ACCL: FPGA-Accelerated Collectives over 100 Gbps TCP-IP


연구 분야: Networking



학회: 2021 IEEE/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC)


초록

Collective operations such as scatter, gather, reduce, etc are utilized broadly to implement distributed HPC applications and are the target of extensive optimization in all MPI implementations as well as dedicated collective libraries by accelerator vendors (e.g. NCCL and RCCL by NVidia and AMD respectively). We present ACCL, an open-source FPGA-accelerated collectives library designed to serve applications running primarily in Xilinx FPGAs. Compared to previous collective communication solutions for FPGA, ACCL is flexible and extensible, easily portable, and fast. We evaluate ACCL up to 8 nodes and demonstrate that ACCL outperforms OpenMPI over 100 Gbps TCP-IP for large messages.


Author Profile
Zhenhao He

Systems Group ETH Zurich Zurich Switzerland

Ethiopia
Author Profile
Daniele Parravicini

Research Labs Xilinx Dublin Ireland

Ireland
Author Profile
Lucian Petrica

Research Labs Xilinx Dublin Ireland

Ireland

📄 논문 정보

발행 연도 2021년
인용수 19
출판 국가 Ethiopia, Ireland
사이트 IEEE
좋아요 수 0

연관 논문 목록 (9건)