Traversing chemical space with active deep learning for low-data drug discovery


연구 분야: Artificial Intelligence



학회: Nature Computational Science


초록

Deep learning is accelerating drug discovery. However, current approaches are often affected by limitations in the available data, in terms of either size or molecular diversity. Active deep learning has high potential for low-data drug discovery, as it allows iterative model improvement during the screening process. However, there are several ‘known unknowns’ that limit the wider adoption of active deep learning in drug discovery: (1) what the best computational strategies are for chemical space exploration, (2) how active learning holds up to traditional, non-iterative, approaches and (3) how it should be used in the low-data scenarios typical of drug discovery. To provide answers, this study simulates a low-data drug discovery scenario, and systematically analyzes six active learning strategies combined with two deep learning architectures, on three large-scale molecular libraries. We identify the most important determinants of success in low-data regimes and show that active learning can achieve up to a sixfold improvement in hit discovery when compared with traditional screening methods.


Author Profile
Derek van Tilborg

Institute for Complex Molecular Systems (ICMS) Department of Biomedical Engineering Eindhoven University of Technology Eindhoven The Netherlands

Netherlands
Author Profile
Francesca Grisoni

Centre for Living Technologies Alliance TU/e WUR UU UMC Utrecht Utrecht The Netherlands

Netherlands

📄 논문 정보

발행 연도 2024년
인용수 0
출판 국가 Netherlands
사이트 Springer
좋아요 수 0

연관 논문 목록 (62건)