Machine learning approaches for computer aided drug discovery

Moesser, M

Thesis

Machine learning approaches for computer aided drug discovery

Abstract:: Pharmaceutical drug discovery is expensive, time consuming and scientifically challenging. In order to increase efficiency of the pre-clinical drug discovery pathway, computational drug discovery methods and most recently, machine learning-based methods are increasingly used as powerful tools to aid early stage drug discovery.

In this thesis, I present three complementary computer-aided drug discovery methods, with a focus on aiding hit discovery and hit-to-lead optimization. In addition, this thesis particularly focuses on exploring different molecular representations used to featurise machine learning models, in order explore how best to capture valuable information about protein, ligands and 3D protein-ligand complexes to build more robust, more interpretable and more accurate machine learning models.

First, I developed ligand-based models using a Gaussian Process (GP) as an easy-to-implement tool to guide exploration of chemical space for the optimization of protein-ligand binding affinity. I explored different topological fingerprint and autoencoder representations for Bayesian optimisation (BO) and showed that BO is a powerful tool to help medicinal chemists to prioritise which new compounds to make for single-target as well as multi-target optimisation. The algorithm achieved high enrichment of top compounds for both single target and multiobjective optimisation when tested on a well known benchmark dataset of the drug target matrix metalloproteinase-12 and a real, ongoing drug optimisation dataset targeting four bacterial metallo-β-lactamases.

Next, I present the development of a knowledge-based approach to drug design, combining new protein-ligand interaction fingerprints with a fragment-based drug discovery approach to understand SARS-CoV-2 Mpro-substrate specificity and to design novel small molecule inhibitors in silico. In combination with a fragment-based drug discovery approach, I show how this knowledge-based interaction fingerprint-driven approach can reveal fruitful fragment-growth design strategies.

Lastly, I expand on the knowledge-based contact fingerprints to create a ligand-shaped molecular graph representation (Protein Ligand Interaction Graphs, PLIGs) to develop novel graph-based deep learning protein-ligand binding affinity scoring functions. PLIGs encode all intermolecular interactions in a protein-ligand complex within the node features of the graph and are therefore simple and fully interpretable. I explore a variety of Graph Neural Network architectures in combination with PLIGs and found Graph Attention Networks to perform slightly better than other GNN architectures, performing amongst the best known protein-ligand binding affinity scoring functions.

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Cite

Cite this record

APA Style

Moesser, M. (2022). Machine learning approaches for computer aided drug discovery [PhD thesis]. University of Oxford.

MLA Style

Moesser, M. Machine Learning Approaches for Computer Aided Drug Discovery. University of Oxford, 2022.

Chicago Style

Moesser, M. 2022. “Machine Learning Approaches for Computer Aided Drug Discovery.” PhD thesis, University of Oxford.
Share
Print

Access Document

Files:: PhD_Thesis_Marc_Moesser.pdf

(Preview, Dissemination version, pdf, 73.2MB, Terms of use)

Authors

+ Moesser, M More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Statistics
Oxford college:: Green Templeton College
Role:: Author

Contributors

+ Morris, G

Role:: Supervisor
ORCID:: 0000-0003-1731-8405

+ Engineering and Physical Sciences Research Council More from this funder

Funder identifier:: http://dx.doi.org/10.13039/501100000266
Grant:: EP/R513295/1
Programme:: SABS IDC

+ GlaxoSmithKline More from this funder

Funder identifier:: http://dx.doi.org/10.13039/100004330
Programme:: Industrial Sponsorship

DOI:: 10.5287/ora-kexaer6ky
Type of award:: DPhil
Level of award:: Doctoral
Awarding institution:: University of Oxford

Language:: English
Keywords:: structure-based drug discovery

deep learning

drug discovery

cheminformatics

Bayesian optimization

machine learning

molecular docking

graph neural networks
Subjects:: Drug Discovery

Statistics

Machine learning

Bioinformatics

Cheminformatics
Deposit date:: 2023-08-02

Terms of use

Copyright holder:: Marc Moesser

Licence:: CC Attribution (CC BY)

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

Thesis

Machine learning approaches for computer aided drug discovery

Actions

Access Document

Authors

Contributors

Terms of use

Views and Downloads

Altmetrics

Dimensions

Thesis

Machine learning approaches for computer aided drug discovery

Actions

Access Document

Authors

Contributors

Funding

Bibliographic Details

Item Description

Related Items

Terms of use

Metrics

Views and Downloads

Altmetrics

Dimensions