What is it about?
We curated select whole cell and reverse transcriptase datasets from the NIAID ChemDB database and used these with other datasets to build and validate machine learning methods (deep learning, bayesian, support vector machines etc.). We performed 5 fold cross validation and external validation with different test sets.
Featured Image
Why is it important?
We describe how there is really not a huge difference between different machine learning methods. We also describe a new metric that can be used to summarize many different model scores. We demonstrate how Assay Central Bayesian models may be a useful starting point for drug discovery efforts.
Perspectives
Application of machine learning methods to HIV and RT datasets is rare. Also there have been few efforts to use the ChemDB database. This would suggest it is worthy of assessment and further curation to make the data ready for models.
Dr Sean Ekins
Collaborations in Chemistry
Read the Original
This page is a summary of: Multiple Machine Learning Comparisons of HIV Cell-Based and Reverse Transcriptase Datasets, Molecular Pharmaceutics, February 2019, American Chemical Society (ACS),
DOI: 10.1021/acs.molpharmaceut.8b01297.
You can read the full text:
Resources
Contributors
The following have contributed to this page