English
Related papers

Related papers: Multi-Objective Hyperparameter Selection via Hypot…

200 papers

Hyperparameter selection is a critical step in the deployment of artificial intelligence (AI) models, particularly in the current era of foundational, pre-trained, models. By framing hyperparameter selection as a multiple hypothesis testing…

Machine Learning · Computer Science 2025-02-07 Amirmohammad Farzaneh , Osvaldo Simeone

Recent work in graph models has found that probabilistic hyperedge replacement grammars (HRGs) can be extracted from graphs and used to generate new random graphs with graph properties and substructures close to the original. In this paper,…

Social and Information Networks · Computer Science 2018-06-22 Xinyi Wang , Salvador Aguinaga , Tim Weninger , David Chiang

Deep reinforcement learning (deep RL) has achieved tremendous success on various domains through a combination of algorithmic design and careful selection of hyper-parameters. Algorithmic improvements are often the result of iterative…

Machine Learning · Computer Science 2024-12-02 Johan Obando-Ceron , João G. M. Araújo , Aaron Courville , Pablo Samuel Castro

Schema matching is the process of identifying correspondences between the elements of two given schemata, essential for database management systems, data integration, and data warehousing. For datasets across different scenarios, the…

Databases · Computer Science 2025-03-07 Longyu Feng , Huahang Li , Chen Jason Zhang

Large Language Models (LLMs) have shown promising results on various language and vision tasks. Recently, there has been growing interest in applying LLMs to graph-based tasks, particularly on Text-Attributed Graphs (TAGs). However, most…

Machine Learning · Computer Science 2024-06-10 Zhongmou He , Jing Zhu , Shengyi Qian , Joyce Chai , Danai Koutra

Large Language Models (LLMs) have garnered considerable interest within both academic and industrial. Yet, the application of LLMs to graph data remains under-explored. In this study, we evaluate the capabilities of four LLMs in addressing…

Artificial Intelligence · Computer Science 2023-09-12 Chang Liu , Bo Wu

Uncertain, or probabilistic, graphs have been increasingly used to represent noisy linked data in many emerging applications, and have recently attracted the attention of the database research community. A fundamental problem on uncertain…

Social and Information Networks · Computer Science 2019-04-11 Xiangyu Ke , Arijit Khan , Leroy Lim Hong Quan

Informative data selection is a key requirement for large language models (LLMs) to minimize the amount of data required for fine-tuning, network distillation, and token pruning, enabling fast and efficient deployment, especially under…

Machine Learning · Computer Science 2026-02-03 Ahmad Sarlak , Abolfazl Razi

Multi-target tracking (MTT) serves as a cornerstone technology in information fusion, yet faces significant challenges in robustness and efficiency when dealing with model uncertainties, clutter interference, and target interactions.…

Systems and Control · Electrical Eng. & Systems 2025-07-21 Ming Lei , Shufan Wu

Prospect Theory (PT) models human decision-making behaviour under uncertainty, among which linguistic uncertainty is commonly adopted in real-world scenarios. Although recent studies have developed some frameworks to test PT parameters for…

Artificial Intelligence · Computer Science 2026-04-13 Rui Wang , Qihan Lin , Jiayu Liu , Qing Zong , Tianshi Zheng , Dadi Guo , Haochen Shi , Weiqi Wang , Yangqiu Song

Long Chain-of-Thought (LCoT), achieved by Reinforcement Learning with Verifiable Rewards (RLVR), has proven effective in enhancing the reasoning capabilities of Large Language Models (LLMs). However, reasoning in current LLMs is primarily…

We introduce adaptive learn-then-test (aLTT), an efficient hyperparameter selection procedure that provides finite-sample statistical guarantees on the population risk of AI models. Unlike the existing learn-then-test (LTT) technique, which…

Machine Learning · Statistics 2025-02-03 Matteo Zecchin , Sangwoo Park , Osvaldo Simeone

There is an increasing interest in leveraging Large Language Models (LLMs) for managing structured data and enhancing data science processes. Despite the potential benefits, this integration poses significant questions regarding their…

Artificial Intelligence · Computer Science 2023-11-21 Nathalia Nascimento , Cristina Tavares , Paulo Alencar , Donald Cowan

Routing problems such as Hamiltonian Path Problem (HPP), seeks a path to visit all the vertices in a graph while minimizing the path cost. This paper studies a variant, HPP with Probabilistic Terminals (HPP-PT), where each vertex has a…

Robotics · Computer Science 2026-01-21 Yunpeng Lyu , Chao Cao , Ji Zhang , Howie Choset , Zhongqiang Ren

Large language models (LLMs) have become mainstream technology with their versatile use cases and impressive performance. Despite the countless out-of-the-box applications, LLMs are still not reliable. A lot of work is being done to improve…

Computation and Language · Computer Science 2023-06-13 Aisha Khatun , Daniel G. Brown

While Retrieval Augmented Generation (RAG) has emerged as a popular technique for improving Large Language Model (LLM) systems, it introduces a large number of choices, parameters and hyperparameters that must be made or tuned. This…

Machine Learning · Computer Science 2025-05-09 Matthew Barker , Andrew Bell , Evan Thomas , James Carr , Thomas Andrews , Umang Bhatt

High-quality training data is critical to the performance of large language models (LLMs). Recent work has explored using LLMs to rate and select data based on a small set of human-designed criteria (rules), but these approaches often rely…

Computation and Language · Computer Science 2025-11-12 Xiaomin Li , Mingye Gao , Zhiwei Zhang , Chang Yue , Hong Hu

Large Language Models (LLMs) are increasingly adopted for complex scientific text generation tasks, yet they often suffer from limitations in accuracy, consistency, and hallucination control. This thesis introduces a Parameter-Efficient…

Computation and Language · Computer Science 2024-11-12 Daniil Sulimov

The maximum likelihood threshold (MLT) of a graph $G$ is the minimum number of samples to almost surely guarantee existence of the maximum likelihood estimate in the corresponding Gaussian graphical model. We give a new characterization of…

Ranking documents using Large Language Models (LLMs) by directly feeding the query and candidate documents into the prompt is an interesting and practical problem. However, researchers have found it difficult to outperform fine-tuned…

Information Retrieval · Computer Science 2024-03-29 Zhen Qin , Rolf Jagerman , Kai Hui , Honglei Zhuang , Junru Wu , Le Yan , Jiaming Shen , Tianqi Liu , Jialu Liu , Donald Metzler , Xuanhui Wang , Michael Bendersky
‹ Prev 1 2 3 10 Next ›