English
Related papers

Related papers: Diffusion Boosted Trees

200 papers

Fraud detection is to identify, monitor, and prevent potentially fraudulent activities from complex data. The recent development and success in AI, especially machine learning, provides a new data-driven way to deal with fraud. From a…

Machine Learning · Statistics 2023-05-19 Biao Xu , Yao Wang , Xiuwu Liao , Kaidong Wang

Gradient boosted decision trees are a popular machine learning technique, in part because of their ability to give good accuracy with small models. We describe two extensions to the standard tree boosting algorithm designed to increase this…

Machine Learning · Statistics 2017-11-01 Natalia Ponomareva , Thomas Colthurst , Gilbert Hendry , Salem Haykal , Soroush Radpour

We present DiffusionBERT, a new generative masked language model based on discrete diffusion models. Diffusion models and many pre-trained language models have a shared training objective, i.e., denoising, making it possible to combine the…

Computation and Language · Computer Science 2022-12-02 Zhengfu He , Tianxiang Sun , Kuanning Wang , Xuanjing Huang , Xipeng Qiu

TF Boosted Trees (TFBT) is a new open-sourced frame-work for the distributed training of gradient boosted trees. It is based on TensorFlow, and its distinguishing features include a novel architecture, automatic loss differentiation,…

Multi-layered representation is believed to be the key ingredient of deep neural networks especially in cognitive tasks like computer vision. While non-differentiable models such as gradient boosting decision trees (GBDTs) are the dominant…

Machine Learning · Computer Science 2020-07-07 Ji Feng , Yang Yu , Zhi-Hua Zhou

Boosted decision trees are a very powerful machine learning technique. After introducing specific concepts of machine learning in the high-energy physics context and describing ways to quantify the performance and training quality of…

Data Analysis, Statistics and Probability · Physics 2022-06-22 Yann Coadou

Despite the rise to dominance of deep learning in unstructured data domains, tree-based methods such as Random Forests (RF) and Gradient Boosted Decision Trees (GBDT) are still the workhorses for handling discriminative tasks on tabular…

Machine Learning · Computer Science 2025-04-21 João Bravo

The gradient boosting machine is a powerful ensemble-based machine learning method for solving regression problems. However, one of the difficulties of its using is a possible discontinuity of the regression function, which arises when…

Machine Learning · Computer Science 2020-06-22 Andrei V. Konstantinov , Lev V. Utkin

Score-based generative models can effectively learn the distribution of data by estimating the gradient of the distribution. Due to the multi-step denoising characteristic, researchers have recently considered combining score-based…

Machine Learning · Computer Science 2024-12-17 Changyuan Zhao , Hongyang Du , Guangyuan Liu , Dusit Niyato

Random forest is widely exploited as an ensemble learning method. In many practical applications, however, there is still a significant challenge to learn from imbalanced data. To alleviate this limitation, we propose a deep dynamic boosted…

Machine Learning · Computer Science 2022-03-08 Haixin Wang , Xingzhang Ren , Jinan Sun , Wei Ye , Long Chen , Muzhi Yu , Shikun Zhang

Decision trees and diffusion models are ostensibly disparate model classes, one discrete and hierarchical, the other continuous and dynamic. This work unifies the two by establishing a crisp mathematical correspondence between hierarchical…

Machine Learning · Computer Science 2026-05-22 Sai Niranjan Ramachandran , Suvrit Sra

Diffusion Transformer (DiT) has emerged as the new trend of generative diffusion models on image generation. In view of extremely slow convergence in typical DiT, recent breakthroughs have been driven by mask strategy that significantly…

Computer Vision and Pattern Recognition · Computer Science 2024-03-26 Rui Zhu , Yingwei Pan , Yehao Li , Ting Yao , Zhenglong Sun , Tao Mei , Chang Wen Chen

Additive models, such as produced by gradient boosting, and full interaction models, such as classification and regression trees (CART), are widely used algorithms that have been investigated largely in isolation. We show that these models…

Tree ensembles such as XGBoost are often preferred for discriminative tasks in mixed-type tabular data, due to their inductive biases, minimal hyperparameter tuning, and training efficiency. We argue that these qualities, when leveraged…

Machine Learning · Computer Science 2026-03-10 Jim Achterberg , Marcel Haas , Bram van Dijk , Marco Spruit

Deep learning model effectiveness in classification tasks is often challenged by the quality and quantity of training data whenever they are affected by strong spurious correlations between specific attributes and target labels. This…

In many applications of supervised learning, multiple classification or regression outputs have to be predicted jointly. We consider several extensions of gradient boosting to address such problems. We first propose a straightforward…

Machine Learning · Statistics 2019-05-21 Arnaud Joly , Louis Wehenkel , Pierre Geurts

Neural networks have proved to be very robust at processing unstructured data like images, text, videos, and audio. However, it has been observed that their performance is not up to the mark in tabular data; hence tree-based models are…

Machine Learning · Computer Science 2022-04-25 Tushar Sarkar

Diffusion models demonstrate remarkable capabilities in capturing complex data distributions and have achieved compelling results in many generative tasks. While they have recently been extended to dense prediction tasks such as depth…

Computer Vision and Pattern Recognition · Computer Science 2025-12-16 Haorui Ji , Taojun Lin , Hongdong Li

Object detectors often suffer a decrease in performance due to the large domain gap between the training data (source domain) and real-world data (target domain). Diffusion-based generative models have shown remarkable abilities in…

Computer Vision and Pattern Recognition · Computer Science 2025-06-05 Boyong He , Yuxiang Ji , Zhuoyue Tan , Liaoni Wu

Tree ensemble algorithms as RandomForest and GradientBoosting are currently the dominant methods for modeling discrete or tabular data, however, they are unable to perform a hierarchical representation learning from raw data as…

Machine Learning · Computer Science 2024-02-07 Ángel Delgado-Panadero , José Alberto Benítez-Andrades , María Teresa García-Ordás
‹ Prev 1 2 3 10 Next ›