Learning Molecular Representations for Medicinal Chemistry

J Med Chem. 2020 Aug 27;63(16):8705-8722. doi: 10.1021/acs.jmedchem.0c00385. Epub 2020 May 15.

Abstract

The accurate modeling and prediction of small molecule properties and bioactivities depend on the critical choice of molecular representation. Decades of informatics-driven research have relied on expert-designed molecular descriptors to establish quantitative structure-activity and structure-property relationships for drug discovery. Now, advances in deep learning make it possible to efficiently and compactly learn molecular representations directly from data. In this review, we discuss how active research in molecular deep learning can address limitations of current descriptors and fingerprints while creating new opportunities in cheminformatics and virtual screening. We provide a concise overview of the role of representations in cheminformatics, key concepts in deep learning, and argue that learning representations provides a way forward to improve the predictive modeling of small molecule bioactivities and properties.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Cheminformatics
  • Chemistry, Pharmaceutical / methods*
  • Deep Learning*
  • Models, Molecular
  • Molecular Structure
  • Organic Chemicals / chemistry*
  • Quantitative Structure-Activity Relationship

Substances

  • Organic Chemicals