Machine Learning Scales Virtual Screens to >10B Compound Libraries

July 29, 2020
Events, AI Technology, AtomNet, ACS2020

At the American Chemical Society Fall 2020 Virtual Meeting & Expo, several Atomwise members and partners were selected to present their research and work. Learn what our Atoms have been working on below and visit Atomwise at ACS Fall 2020 Virtual Meeting & Expo for other presentation sessions. 

VMVenkatesh Mysore, PhD 

Atomwise Collaborators: Tushita Gupta, Greg Friedland, PhD, Izhar Wallach, PhD

Title: Billion Trawler Answer: A Scalable and Effective Machine Learning Based Solution for the Virtual High-Throughput Screening of Ultra-Large Libraries

Division: COMP



The availability and accessibility of chemistry-on-demand and ultra-large screening libraries (ULSLs) give rise to an opportunity for mining new chemical spaces for novel scaffolds. This combination of cheap à la carte synthesis that covers tens of billions of molecules may mark the inception of a golden age of computational approaches, where a comprehensive physical high-throughput screen is no longer a viable option for early-stage drug discovery. We have previously presented AtomNet® -- our convolutional network based model for predicting binding affinities from the three-dimensional protein-ligand complex. In this talk, we introduce our approach for trawling through billions of compounds using a more performant model (“trawler”) to predict their AtomNet® scores. The most promising compounds found by the trawler are then evaluated on AtomNet®, and these scores are used to improve the trawler model. The implementation exploits the recent advances in machine-learning algorithms and scalable on-demand computing resources. We detail the algorithmic and engineering challenges of deploying ULSLs in a production setting, and the tradeoffs between speed and accuracy. We present some early results from a few projects, and discuss avenues for further improvement.


Poster Presentation

On-demand Poster (click for hi-resolution)


On-demand audio recording


Join our team

Our team is comprised of over 30 PhD scientists who contribute to a high-performance academic-like culture that fosters robust scientific and technical excellence. We strongly believe that data wins over opinions, and aim for as little dogma as possible in our decision making. Learn more about our team and opportunities at Atomwise.