Related papers: Split Optimization for Protein/Ligand Binding Mode…

Most Ligand-Based Classification Benchmarks Reward Memorization Rather than Generalization

Undetected overfitting can occur when there are significant redundancies between training and validation data. We describe AVE, a new measure of training-validation redundancy for ligand-based classification problems that accounts for the…

Quantitative Methods · Quantitative Biology 2018-05-11 Izhar Wallach , Abraham Heifets

DualBind: A Dual-Loss Framework for Protein-Ligand Binding Affinity Prediction

Accurate prediction of protein-ligand binding affinities is crucial for drug development. Recent advances in machine learning show promising results on this task. However, these methods typically rely heavily on labeled data, which can be…

Machine Learning · Computer Science 2024-06-13 Meng Liu , Saee Gopal Paliwal

APObind: A Dataset of Ligand Unbound Protein Conformations for Machine Learning Applications in De Novo Drug Design

Protein-ligand complex structures have been utilised to design benchmark machine learning methods that perform important tasks related to drug design such as receptor binding site detection, small molecule docking and binding affinity…

Biomolecules · Quantitative Biology 2021-08-26 Rishal Aggarwal , Akash Gupta , U Deva Priyakumar

SPIN: SE(3)-Invariant Physics Informed Network for Binding Affinity Prediction

Accurate prediction of protein-ligand binding affinity is crucial for rapid and efficient drug development. Recently, the importance of predicting binding affinity has led to increased attention on research that models the three-dimensional…

Machine Learning · Computer Science 2024-07-17 Seungyeon Choi , Sangmin Seo , Sanghyun Park

Improved prediction of ligand-protein binding affinities by meta-modeling

The accurate screening of candidate drug ligands against target proteins through computational approaches is of prime interest to drug development efforts. Such virtual screening depends in part on methods to predict the binding affinity…

Machine Learning · Computer Science 2024-10-22 Ho-Joon Lee , Prashant S. Emani , Mark B. Gerstein

Rethinking the generalization of drug target affinity prediction algorithms via similarity aware evaluation

Drug-target binding affinity prediction is a fundamental task for drug discovery. It has been extensively explored in literature and promising results are reported. However, in this paper, we demonstrate that the results may be misleading…

Machine Learning · Computer Science 2025-04-15 Chenbin Zhang , Zhiqiang Hu , Chuchu Jiang , Wen Chen , Jie Xu , Shaoting Zhang

Measure Twice, Cut Once: Quantifying Bias and Fairness in Deep Neural Networks

Algorithmic bias is of increasing concern, both to the research community, and society at large. Bias in AI is more abstract and unintuitive than traditional forms of discrimination and can be more difficult to detect and mitigate. A clear…

Machine Learning · Computer Science 2021-10-12 Cody Blakeney , Gentry Atkinson , Nathaniel Huish , Yan Yan , Vangelis Metris , Ziliang Zong

Unsupervised Protein-Ligand Binding Energy Prediction via Neural Euler's Rotation Equation

Protein-ligand binding prediction is a fundamental problem in AI-driven drug discovery. Prior work focused on supervised learning methods using a large set of binding affinity data for small molecules, but it is hard to apply the same…

Biomolecules · Quantitative Biology 2023-12-14 Wengong Jin , Siranush Sarkizova , Xun Chen , Nir Hacohen , Caroline Uhler

Development and evaluation of a deep learning model for protein-ligand binding affinity prediction

Structure based ligand discovery is one of the most successful approaches for augmenting the drug discovery process. Currently, there is a notable shift towards machine learning (ML) methodologies to aid such procedures. Deep learning has…

Machine Learning · Statistics 2018-06-12 Marta M. Stepniewska-Dziubinska , Piotr Zielenkiewicz , Pawel Siedlecki

Binding Affinity Prediction: From Conventional to Machine Learning-Based Approaches

Protein-ligand binding is the process by which a small molecule (drug or inhibitor) attaches to a target protein. Binding affinity, which characterizes the strength of biomolecular interactions, is essential for tackling diverse challenges…

Quantitative Methods · Quantitative Biology 2025-10-08 Xuefeng Liu , Songhao Jiang , Xiaotian Duan , Archit Vasan , Qinan Huang , Chong Liu , Michelle M. Li , Heng Ma , Thomas Brettin , Arvind Ramanathan , Fangfang Xia , Mengdi Wang , Abhishek Pandey , Marinka Zitnik , Ian T. Foster , Jinbo Xu , Rick L. Stevens

EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction

Predicting how a drug-like molecule binds to a specific protein target is a core problem in drug discovery. An extremely fast computational binding method would enable key applications such as fast virtual screening or drug engineering.…

Biomolecules · Quantitative Biology 2022-06-07 Hannes Stärk , Octavian-Eugen Ganea , Lagnajit Pattanaik , Regina Barzilay , Tommi Jaakkola

On Statistical Bias In Active Learning: How and When To Fix It

Active learning is a powerful tool when labelling data is expensive, but it introduces a bias because the training data no longer follows the population distribution. We formalize this bias and investigate the situations in which it can be…

Machine Learning · Statistics 2021-06-01 Sebastian Farquhar , Yarin Gal , Tom Rainforth

Learning to Split for Automatic Bias Detection

Classifiers are biased when trained on biased datasets. As a remedy, we propose Learning to Split (ls), an algorithm for automatic bias detection. Given a dataset with input-label pairs, ls learns to split this dataset so that predictors…

Machine Learning · Computer Science 2022-07-22 Yujia Bao , Regina Barzilay

Evaluating Metrics for Bias in Word Embeddings

Over the last years, word and sentence embeddings have established as text preprocessing for all kinds of NLP tasks and improved the performances significantly. Unfortunately, it has also been shown that these embeddings inherit various…

Computation and Language · Computer Science 2024-09-13 Sarah Schröder , Alexander Schulz , Philip Kenneweg , Robert Feldhans , Fabian Hinder , Barbara Hammer

Cross-Fitting and Averaging for Machine Learning Estimation of Heterogeneous Treatment Effects

We investigate the finite sample performance of sample splitting, cross-fitting and averaging for the estimation of the conditional average treatment effect. Recently proposed methods, so-called meta-learners, make use of machine learning…

Methodology · Statistics 2020-08-27 Daniel Jacob

ViBE: Dressing for Diverse Body Shapes

Body shape plays an important role in determining what garments will best suit a given person, yet today's clothing recommendation methods take a "one shape fits all" approach. These body-agnostic vision methods and datasets are a barrier…

Computer Vision and Pattern Recognition · Computer Science 2020-03-31 Wei-Lin Hsiao , Kristen Grauman

Automatic Differentiation Variational Inference with Mixtures

Automatic Differentiation Variational Inference (ADVI) is a useful tool for efficiently learning probabilistic models in machine learning. Generally approximate posteriors learned by ADVI are forced to be unimodal in order to facilitate use…

Machine Learning · Computer Science 2020-06-25 Warren R. Morningstar , Sharad M. Vikram , Cusuh Ham , Andrew Gallagher , Joshua V. Dillon

Sampling Bias Correction for Supervised Machine Learning: A Bayesian Inference Approach with Practical Applications

Given a supervised machine learning problem where the training set has been subject to a known sampling bias, how can a model be trained to fit the original dataset? We achieve this through the Bayesian inference framework by altering the…

Machine Learning · Statistics 2022-03-16 Max Sklar

TVAE: Triplet-Based Variational Autoencoder using Metric Learning

Deep metric learning has been demonstrated to be highly effective in learning semantic representation and encoding information that can be used to measure data similarity, by relying on the embedding learned from metric learning. At the…

Machine Learning · Statistics 2023-02-09 Haque Ishfaq , Assaf Hoogi , Daniel Rubin

Using Attribution to Decode Dataset Bias in Neural Network Models for Chemistry

Deep neural networks have achieved state of the art accuracy at classifying molecules with respect to whether they bind to specific protein targets. A key breakthrough would occur if these models could reveal the fragment pharmacophores…

Machine Learning · Computer Science 2020-02-12 Kevin McCloskey , Ankur Taly , Federico Monti , Michael P. Brenner , Lucy Colwell