What is it about?

The recent advancements of machine learning are making it possible to create systems that automatically mine patterns and learn from data. In chemistry, in particular drug discovery and properties prediction, machine learning is changing the way of creating and testing new molecules. After presenting the principal learning methods, focusing on neural networks and deep learning, a study case is presented. Using a large data set of molecules data tested for the mutagenicity property, three models for predicting this property are derived from data, considering the molecules as graphs, as text, or as images. The knowledge extracted from the networks is analyzed and positively compared with the toxicity rules for mutagenicity.

Featured Image

Why is it important?

There major challenges are considered. • One challenge is to improve the process of constructing predictive models for chemistry. • Another challenge is to avoid, or to reduce, the bias of the expert in modeling. • The last challenge is to provide an explanation of the result obtained by the model.

Perspectives

Neural net have an exploratory power that makes them a powerful tool for further exploring the chemical space in cases when a theory is missing, as in most cases of toxicity. Those neural models can discover statistical patterns that would be very hard to express as simple rules.

Prof Giuseppina Carla Gini
Politecnico di Milano

Read the Original

This page is a summary of: Big data and deep learning: extracting and revising chemical knowledge from data, January 2023, Elsevier,
DOI: 10.1016/b978-0-323-85713-0.00030-x.
You can read the full text:

Read

Contributors

The following have contributed to this page