Improving the Robustness and Efficiency of PIM-Based Architecture by SW/HW Co-Design


연구 분야: Verification



학회: ASPDAC '23: Proceedings of the 28th Asia and South Pacific Design Automation Conference


초록

Processing-in-memory (PIM) based architecture shows great potential to process several emerging artificial intelligence workloads, including vision and language models. Cross-layer optimizations could bridge the gap between computing density and the available resources by reducing the computation and memory cost of the model and improving the model's robustness against non-ideal hardware effects. We first introduce several hardware-aware training methods to improve the model robustness to the PIM device's non-ideal effects, including stuck-at-fault, process variation, and thermal noise. Then, we further demonstrate a software/hardware (SW/HW) co-design methodology to efficiently process the state-of-the-art attention-based model on PIM-based architecture by performing sparsity exploration for the attention-based model and circuit-architecture co-design to support the sparse processing.


Author Profile
Yiran Chen

Duke University

정보 없음
Author Profile
Xiaoxuan Yang

Duke University

정보 없음
Author Profile
Shiyu Li

Duke University

정보 없음

📄 논문 정보

발행 연도 2023년
인용수 1
출판 국가
사이트 ACM
좋아요 수 0

연관 논문 목록 (323건)