Mobile Foundation Model as Firmware


연구 분야: Analysis



학회: ACM MobiCom '24: Proceedings of the 30th Annual International Conference on Mobile Computing and Networking


초록

In the current AI era, mobile devices such as smartphones are tasked with executing a myriad of deep neural networks (DNNs) locally. It presents a complex landscape, as these models are highly fragmented in terms of architecture, operators, and implementations. Such fragmentation poses significant challenges to the co-optimization of hardware, systems, and algorithms for efficient and scalable mobile AI. Inspired by the recent groundbreaking progress in large foundation models, this work introduces a novel paradigm for mobile AI, where mobile OS and hardware jointly manage a foundation model that is capable of serving a wide array of mobile AI tasks. This foundation model functions akin to firmware, unmodifiable by apps or the OS, exposed as a system service to Apps. They can invoke this foundation model through a small, offline fine-tuned "adapter" for various downstream tasks. We propose a tangible design of this vision called M4, and prototype it from publicly available pre-trained models. To assess its capability, we also build a comprehensive benchmark consisting of 38 mobile AI tasks and 50 datasets, spanning 5 multimodal inputs. Extensive experiments demonstrate M4's remarkable results: it achieves comparable accuracy in 85% of tasks, offers enhanced scalability regarding storage and memory, and has much simpler operations. In broader terms, this work paves a new way towards efficient and scalable mobile AI in the post-LLM era.


Author Profile
Mengwei Xu

Beijing University of Posts and Telecommunications Beijing China

Andorra
Author Profile
Jinliang Yuan

Beijing University of Posts and Telecommunications Beijing China

Andorra
Author Profile
Chen Yang

Beijing University of Posts and Telecommunications Beijing China

Andorra

📄 논문 정보

발행 연도 2024년
인용수 21
출판 국가 Andorra
사이트 ACM
좋아요 수 0

연관 논문 목록 (255건)