Deep Learning Docking: Speed and Accuracy Gains, But a Hidden Flaw
Deep learning (DL) has revolutionized many fields, including protein-ligand docking, a crucial step in drug discovery. Recent DL-based docking methods have shown impressive improvements in speed and accuracy, often surpassing classical methods when evaluated using Root Mean Square Deviation (RMSD) to the native pose.
However, a new study by Buttenschoen et al. reveals a hidden limitation in these DL methods. They introduced PoseBusters, a powerful evaluation suite that goes beyond RMSD and assesses the chemical and physical plausibility of docked ligands using RDKit, a popular cheminformatics toolkit.
Here's where things get interesting:
Example of a prediction improved by post-docking energy minimization. The initial Uni-Mol prediction (RMSD 2.0 Å) is depicted in white, the optimized prediction (RMSD 1.1 Å) is in pink, and the reference crystal ligand is shown in light blue. The optimization flattens the aromatic rings and shortens the leftmost bond, enabling the prediction to pass all PoseBusters checks.
The PoseBusters Benchmark:
PoseBusters takes docked ligand structures, the true ligand structure, and the protein structure as input and performs three key sets of checks:
By introducing PoseBusters, Buttenschoen et al. provide a more comprehensive evaluation method that goes beyond just spatial positioning (RMSD). This highlights a critical area for improvement in DL docking methods – ensuring they not only place ligands in the correct spot but also generate physically realistic poses. Addressing this limitation will be crucial for their continued development and real-world application in drug discovery.
Overall, the message is clear: While DL docking methods show promise in terms of speed and raw accuracy, they currently struggle with generating physically plausible poses. PoseBusters offers a valuable tool to address this limitation and guide the development of more robust and reliable DL docking methods for the future.
Note: AlphaFold3, released recently, demonstrates exceptional ligand docking capabilities. Evaluated using PoseBusters, it surpasses traditional docking software like AutoDock Vina and Gold, as well as other machine learning methods, by generating a significantly higher number of chemically valid poses. Read more about this exciting development in our recent post (link here).
References and Resources
GitHub: https://github.com/maabuu/posebusters
PyPI: https://pypi.org/project/posebusters
Documentation: https://posebusters.readthedocs.io/en/latest