Using the Structure-Activity Landscape Index (SALI) to Analyze Data From the SARS-CoV-2 MPro Screen
Last week the good folks at the COVID Moonshot Project released their first set of screening data for compounds designed based on the fragment crystal structures released by the Diamond Light Source . Like many others in the community, I was eager to see the data. I thought it might be useful to share some of my thoughts and some of the techniques that I use to sift through screening data and decide what to do next. All of the code used to perform the analyses in this post is available in a Jupyter notebook on GitHub . Introduction In this post, I'm going to focus on a metric known as the Structure-Activity Landscape Index (SALI) . This technique was first published in 2008 by Rajarshi Guha and Jonn Van Drie and provides a simple means of identifying pairs of compounds where a small change in chemical structure brings about a large change in biological activity or physical properties. These changes can often help us to identify the parts of the molecule that are most imp