What is it about?

We curated select whole cell and reverse transcriptase datasets from the NIAID ChemDB database and used these with other datasets to build and validate machine learning methods (deep learning, bayesian, support vector machines etc.). We performed 5 fold cross validation and external validation with different test sets.

Featured Image

Why is it important?

We describe how there is really not a huge difference between different machine learning methods. We also describe a new metric that can be used to summarize many different model scores. We demonstrate how Assay Central Bayesian models may be a useful starting point for drug discovery efforts.

Perspectives

Application of machine learning methods to HIV and RT datasets is rare. Also there have been few efforts to use the ChemDB database. This would suggest it is worthy of assessment and further curation to make the data ready for models.

Dr Sean Ekins
Collaborations in Chemistry

Read the Original

This page is a summary of: Multiple Machine Learning Comparisons of HIV Cell-Based and Reverse Transcriptase Datasets, Molecular Pharmaceutics, February 2019, American Chemical Society (ACS),
DOI: 10.1021/acs.molpharmaceut.8b01297.
You can read the full text:

Read

Resources

Contributors

The following have contributed to this page