📢 Chen et al. recently reported H3-OPT, a model for predicting structures of the heavy chain 3 of antibody (Ab) complementarity-determining regions (CDR-H3) based on AlphaFold2 (AF2). For both monoclonal Abs (having both heavy and light chains) and nanobodies (with a single-domain heavy chain), the CDR-H3 loop plays a key role in antigen binding and is thus the most diverse Ab region in terms of amino acid length and composition.
💉 Given the importance of the CDR-H3, accurate development of Ab-based therapeutics is dependent on experimental structures of candidate Abs, a process which is bottlenecked by both cost and time. Thus, accurate computational prediction methods would facilitate the therapeutic Ab development pipeline. However, existing methods struggle to generate high quality predictions of the CDR-H3 loop regions because of the inherent challenge of predicting loop structures.
🔥 H3-OPT integrates the strengths of AF2 with a pre-trained protein language model (PLM) to predict CDR-H3 structures. Based on the observation that AF2-predicted Ab structures show an overall high quality, H3-OPT extracts the structural features and uses the information to generate refined CDR-H3 structures.
H3-OPT consists of two modules: a template module and a PLM-based structure prediction module (PSPM). The template module dictates whether the PSPM is deployed for a particular query. The PSPM is further comprised of two sub-modules: a confidence-based module that evaluates the quality of the CDR-H3 from AF2, and a template-grafting module which identifies a suitable PDB template to replace low quality AF2 H3 loop.
🔢 On a benchmark dataset, H3-OPT outperforms other state-of-the-art methods, including AF2, IgFold, HelixFold-Single, ESMFold, and OmegaFold, by achieving an RMSD (to experimental structure) of 2.24 Å, compared to 2.85 Å and 2.87 Å for the next best methods, AF2 and IgFold, respectively.
Furthermore, H3-OPT was evaluated using recently deposited PDB structures of three anti-VEGF variants, giving predictions with RMSDs of 1.510 Ã…, 1.541 Ã…, and 1.411 Ã… for the three variants. Predictions by IgFold, however, gave RMSDs of 2.776 Ã…, 2.888 Ã…, and 2.448 Ã….
💪 H3-OPT also exhibited improved performance on other tasks, such as the prediction of Ab CDR-H3 surface properties (including surface amino acids, solvent-accessible surface areas, and surface charge distribution) and antibody-antigen interactions.
H3-OPT stands out as an effective method for overcoming the challenges associated with predicting the structure of antibody CDR-H3 regions. By integrating the robust capabilities of AlphaFold2 with a protein language model, H3-OPT achieves higher accuracy and reliability in its predictions. Its modular design allows for flexible adaptation based on the quality of initial predictions, making it a valuable tool in the field of antibody development.