A suite of designed protein cages using machine learning and protein fragment-based protocols

Structure. 2024 Jun 6;32(6):751-765.e11. doi: 10.1016/j.str.2024.02.017. Epub 2024 Mar 20.

Abstract

Designed protein cages and related materials provide unique opportunities for applications in biotechnology and medicine, but their creation remains challenging. Here, we apply computational approaches to design a suite of tetrahedrally symmetric, self-assembling protein cages. For the generation of docked conformations, we emphasize a protein fragment-based approach, while for sequence design of the de novo interface, a comparison of knowledge-based and machine learning protocols highlights the power and increased experimental success achieved using ProteinMPNN. An analysis of design outcomes provides insights for improving interface design protocols, including prioritizing fragment-based motifs, balancing interface hydrophobicity and polarity, and identifying preferred polar contact patterns. In all, we report five structures for seven protein cages, along with two structures of intermediate assemblies, with the highest resolution reaching 2.0 Å using cryo-EM. This set of designed cages adds substantially to the body of available protein nanoparticles, and to methodologies for their creation.

Keywords: X-ray crystallography; cryoEM; de novo interface design; machine learning; protein cages; protein-protein docking; self-assembly; symmetry.

MeSH terms

  • Cryoelectron Microscopy / methods
  • Hydrophobic and Hydrophilic Interactions
  • Machine Learning*
  • Models, Molecular
  • Molecular Docking Simulation
  • Protein Conformation
  • Proteins* / chemistry

Substances

  • Proteins