Related papers: Efficient and Accurate In-Database Machine Learnin…

Enhanced prediction accuracy with uncertainty quantification in monitoring CO2 sequestration using convolutional neural networks

Monitoring changes inside a reservoir in real time is crucial for the success of CO2 injection and long-term storage. Machine learning (ML) is well-suited for real-time CO2 monitoring because of its computational efficiency. However, most…

Geophysics · Physics 2022-12-12 Yanhua Liu , Xitong Zhang , Ilya Tsvankin , Youzuo Lin

Towards Expectation-Maximization by SQL in RDBMS

Integrating machine learning techniques into RDBMSs is an important task since there are many real applications that require modeling (e.g., business intelligence, strategic analysis) as well as querying data in RDBMSs. In this paper, we…

Databases · Computer Science 2021-01-25 Kangfei Zhao , Jeffrey Xu Yu , Yu Rong , Ming Liao , Junzhou Huang

Binary Code based Hash Embedding for Web-scale Applications

Nowadays, deep learning models are widely adopted in web-scale applications such as recommender systems, and online advertising. In these applications, embedding learning of categorical features is crucial to the success of deep learning…

Information Retrieval · Computer Science 2021-09-07 Bencheng Yan , Pengjie Wang , Jinquan Liu , Wei Lin , Kuang-Chih Lee , Jian Xu , Bo Zheng

iPDB -- Optimizing Semantic SQL Queries

Structured Query Language (SQL) has remained the standard query language for databases. SQL is highly optimized for processing structured data laid out in relations. Meanwhile, in the present application development landscape, it is highly…

Databases · Computer Science 2026-04-24 Udesh Kumarasinghe , Tyler Liu , Ahmed R. Mahmood , Chunwei Liu , Walid G. Aref

Optimizing Code Embeddings and ML Classifiers for Python Source Code Vulnerability Detection

In recent years, the growing complexity and scale of source code have rendered manual software vulnerability detection increasingly impractical. To address this challenge, automated approaches leveraging machine learning and code embeddings…

Software Engineering · Computer Science 2025-09-17 Talaya Farasat , Joachim Posegga

Efficient end-to-end learning for quantizable representations

Embedding representation learning via neural networks is at the core foundation of modern similarity based search. While much effort has been put in developing algorithms for learning binary hamming code representations for search…

Machine Learning · Computer Science 2018-06-13 Yeonwoo Jeong , Hyun Oh Song

Multi-Class Uncertainty Calibration via Mutual Information Maximization-based Binning

Post-hoc multi-class calibration is a common approach for providing high-quality confidence estimates of deep neural network predictions. Recent work has shown that widely used scaling methods underestimate their calibration error, while…

Machine Learning · Computer Science 2022-11-28 Kanil Patel , William Beluch , Bin Yang , Michael Pfeiffer , Dan Zhang

Hybrid Quantum Neural Networks with Amplitude Encoding: Advancing Recovery Rate Predictions

Recovery rate prediction plays a pivotal role in bond investment strategies by enhancing risk assessment, optimizing portfolio allocation, improving pricing accuracy, and supporting effective credit risk management. However, accurate…

Computational Finance · Quantitative Finance 2026-01-27 Ying Chen , Paul Griffin , Paolo Recchia , Lei Zhou , Hongrui Zhang

Projection Bank: From High-dimensional Data to Medium-length Binary Codes

Recently, very high-dimensional feature representations, e.g., Fisher Vector, have achieved excellent performance for visual recognition and retrieval. However, these lengthy representations always cause extremely heavy computational and…

Computer Vision and Pattern Recognition · Computer Science 2015-09-17 Li Liu , Mengyang Yu , Ling Shao

Data-efficient LLM Fine-tuning for Code Generation

Large language models (LLMs) have demonstrated significant potential in code generation tasks. However, there remains a performance gap between open-source and closed-source models. To address this gap, existing approaches typically…

Computation and Language · Computer Science 2025-04-18 Weijie Lv , Xuan Xia , Sheng-Jun Huang

$\text{M}^3\text{PDB}$: A Multimodal, Multi-Label, Multilingual Prompt Database for Speech Generation

Recent advancements in zero-shot speech generation have enabled models to synthesize speech that mimics speaker identity and speaking style from speech prompts. However, these models' effectiveness is significantly limited in real-world…

Audio and Speech Processing · Electrical Eng. & Systems 2025-08-14 Boyu Zhu , Cheng Gong , Muyang Wu , Ruihao Jing , Fan Liu , Xiaolei Zhang , Chi Zhang , Xuelong Li

SEHFS: Structural Entropy-Guided High-Order Correlation Learning for Multi-View Multi-Label Feature Selection

In recent years, multi-view multi-label learning (MVML) has attracted extensive attention due to its close alignment to real-world scenarios. Information-theoretic methods have gained prominence for learning nonlinear correlations. However,…

Machine Learning · Computer Science 2026-03-04 Cheng Peng , Yonghao Li , Wanfu Gao , Jie Wen , Weiping Ding

Introspective Deep Metric Learning for Image Retrieval

This paper proposes an introspective deep metric learning (IDML) framework for uncertainty-aware comparisons of images. Conventional deep metric learning methods produce confident semantic distances between images regardless of the…

Computer Vision and Pattern Recognition · Computer Science 2023-09-06 Wenzhao Zheng , Chengkun Wang , Jie Zhou , Jiwen Lu

A Subsequence Interleaving Model for Sequential Pattern Mining

Recent sequential pattern mining methods have used the minimum description length (MDL) principle to define an encoding scheme which describes an algorithm for mining the most compressing patterns in a database. We present a novel…

Machine Learning · Statistics 2016-11-14 Jaroslav Fowkes , Charles Sutton

Efficient Syndrome Decoder for Heavy Hexagonal QECC via Machine Learning

Error syndromes for heavy hexagonal code and other topological codes such as surface code have typically been decoded by using Minimum Weight Perfect Matching (MWPM) based methods. Recent advances have shown that topological codes can be…

Information Theory · Computer Science 2024-04-03 Debasmita Bhoumik , Ritajit Majumdar , Dhiraj Madan , Dhinakaran Vinayagamurthy , Shesha Raghunathan , Susmita Sur-Kolay

SINDybrid: automatic generation of hybrid models for dynamic systems

Hybrid modelling enhances the accuracy and predictive capability of dynamic models by integrating first principles with data-driven methods, effectively mitigating epistemic uncertainties inherent in mechanistic approaches. However, hybrid…

Dynamical Systems · Mathematics 2025-06-17 Ulderico Di Caprio , M. Enis Leblebici

IDEL: In-Database Entity Linking with Neural Embeddings

We present a novel architecture, In-Database Entity Linking (IDEL), in which we integrate the analytics-optimized RDBMS MonetDB with neural text mining abilities. Our system design abstracts core tasks of most neural entity linking systems…

Databases · Computer Science 2018-03-14 Torsten Kilias , Alexander Löser , Felix A. Gers , Richard Koopmanschap , Ying Zhang , Martin Kersten

Hardness-Aware Deep Metric Learning

This paper presents a hardness-aware deep metric learning (HDML) framework. Most previous deep metric learning methods employ the hard negative mining strategy to alleviate the lack of informative samples for training. However, this mining…

Computer Vision and Pattern Recognition · Computer Science 2019-12-05 Wenzhao Zheng , Zhaodong Chen , Jiwen Lu , Jie Zhou

High-Dimensional Probability Estimation with Deep Density Models

One of the fundamental problems in machine learning is the estimation of a probability distribution from data. Many techniques have been proposed to study the structure of data, most often building around the assumption that observations…

Machine Learning · Statistics 2013-02-22 Oren Rippel , Ryan Prescott Adams

Scalable Angular Discriminative Deep Metric Learning for Face Recognition

With the development of deep learning, Deep Metric Learning (DML) has achieved great improvements in face recognition. Specifically, the widely used softmax loss in the training process often bring large intra-class variations, and feature…

Computer Vision and Pattern Recognition · Computer Science 2018-05-02 Bowen Wu , Huaming Wu , Monica M. Y. Zhang