SUV: Static Analysis Guided Unified Virtual Memory


연구 분야: Strategies



학회: 2024 57th IEEE/ACM International Symposium on Microarchitecture (MICRO)


초록

Unified Virtual Memory (UVM) eases GPU programming and enables oversubscription of the limited GPU memory capacity. Unfortunately, UVM may cause significant slowdowns due to thrashing of the GPU memory and overheads of page faults, especially under memory oversubscription. We propose to leverage high-level memory access patterns of CUDA applications to reduce the overheads of UVM. We create SUV, a hybrid framework that leverages compiler-inferred (static analysis) memory access semantics to make proactive memory management decisions where possible and selectively leverages runtime page migrations where needed. It can pin data structures entirely or partially on the GPU memory or CPU's DRAM based on their inferred usefulness and automatically issue software prefetches at kernel boundaries. It also selectively lets parts of data structures migrate on demand onto reserved HBM capacity at runtime. SUV reduces execution times of a variety of applications by 74% over UVM under memory oversubscription.


Author Profile
Pratheek B

Indian Institute of Science Bengaluru India

India
Author Profile
Guilherme Cox

NVIDIA Santa Clara USA

United States
Author Profile
Jan Vesely

NVIDIA Santa Clara USA

United States

📄 논문 정보

발행 연도 2024년
인용수 649
출판 국가 India, United States
사이트 IEEE
좋아요 수 0

연관 논문 목록 (86건)