English
Related papers

Related papers: CRAFT: ClusteR-specific Assorted Feature selecTion

200 papers

Selecting a small, high-quality subset from a large corpus for fine-tuning is increasingly important as corpora grow to tens of millions of datapoints, making full fine-tuning expensive and often unnecessary. We propose CRAFT (Clustered…

Computation and Language · Computer Science 2026-04-27 Parthasarathi Panda , Asheswari Swain , Subhrakanta Panda

Building high-quality datasets for specialized tasks is a time-consuming and resource-intensive process that often requires specialized domain knowledge. We propose Corpus Retrieval and Augmentation for Fine-Tuning (CRAFT), a method for…

Computation and Language · Computer Science 2025-12-08 Ingo Ziegler , Abdullatif Köksal , Desmond Elliott , Hinrich Schütze

Graph clustering is an unsupervised machine learning method that partitions the nodes in a graph into different groups. Despite achieving significant progress in exploiting both attributed and structured data information, graph clustering…

Machine Learning · Computer Science 2025-01-03 Rui Zhang , Xiaoyang Hou , Zhihua Tian , Yan he , Enchao Gong , Jian Liu , Qingbiao Wu , Kui Ren

We introduce CRAFT, a neuro-symbolic framework for interpretable affordance grounding, which identifies the objects in a scene that enable a given action (e.g., "cut"). CRAFT integrates structured commonsense priors from ConceptNet and…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Zhou Chen , Joe Lin , Sathyanarayanan N. Aakur

We present a structural clustering algorithm for large-scale datasets of small labeled graphs, utilizing a frequent subgraph sampling strategy. A set of representatives provides an intuitive description of each cluster, supports the…

Databases · Computer Science 2016-10-03 Till Schäfer , Petra Mutzel

Clustering is one of the fundamental tasks in computer vision and pattern recognition. Recently, deep clustering methods (algorithms based on deep learning) have attracted wide attention with their impressive performance. Most of these…

Computer Vision and Pattern Recognition · Computer Science 2021-06-14 Yanhai Gan , Xinghui Dong , Huiyu Zhou , Feng Gao , Junyu Dong

With the growing volume of diverse information, the demand for classifying arbitrary topics has become increasingly critical. To address this challenge, we introduce DRAFT, a simple framework designed to train a classifier for few-shot…

Information Retrieval · Computer Science 2023-12-06 Keonwoo Kim , Younggun Lee

Clustering algorithms are fundamental tools across many fields, with density-based methods offering particular advantages in identifying arbitrarily shaped clusters and handling noise. However, their effectiveness is often limited by the…

Machine Learning · Computer Science 2025-12-01 Meysam Shirdel Bilehsavar , Razieh Ghaedi , Samira Seyed Taheri , Xinqi Fan , Christian O'Reilly

Aligning Diffusion models has achieved remarkable breakthroughs in generating high-quality, human preference-aligned images. Existing techniques, such as supervised fine-tuning (SFT) and DPO-style preference optimization, have become…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 Zening Sun , Zhengpeng Xie , Lichen Bai , Shitong Shao , Shuo Yang , Zeke Xie

Graph-based clustering methods have demonstrated the effectiveness in various applications. Generally, existing graph-based clustering methods first construct a graph to represent the input data and then partition it to generate the…

Machine Learning · Computer Science 2019-12-17 Yuheng Jia , Hui Liu , Junhui Hou , Sam Kwong

Clustering is a widely used technique in data mining applications for discovering patterns in underlying data. Most traditional clustering algorithms are limited to handling datasets that contain either numeric or categorical attributes.…

Artificial Intelligence · Computer Science 2007-05-23 Zengyou He , Xiaofei Xu , Shengchun Deng

A major limitation of clustering approaches is their lack of explainability: methods rarely provide insight into which features drive the grouping of similar observations. To address this limitation, we propose an ensemble-based clustering…

Machine Learning · Statistics 2026-03-23 Federico Maria Quetti , Elena Ballante , Silvia Figini , Paolo Giudici

With the explosive growth of multi-source data, multi-view clustering has attracted great attention in recent years. Most existing multi-view methods operate in raw feature space and heavily depend on the quality of original feature…

Machine Learning · Computer Science 2022-05-20 Liang Liu , Peng Chen , Guangchun Luo , Zhao Kang , Yonggang Luo , Sanchu Han

Clustering is a core task in machine learning with wide-ranging applications in data mining and pattern recognition. However, its unsupervised nature makes it inherently challenging. Many existing clustering algorithms suffer from critical…

Machine Learning · Computer Science 2025-07-29 Ahmed Shokry , Ayman Khalafallah

Grounded multi-video question answering over real-world news events requires systems to surface query-relevant evidence across heterogeneous video archives while attributing every claim to its supporting source. We introduce CRAFT…

Computer Vision and Pattern Recognition · Computer Science 2026-05-20 Mahesh Bhosale , Abdul Wasi , Vishvesh Trivedi , Pengyu Yan , Akhil Gorugantu , David Doermann

Attributed graph clustering is challenging as it requires joint modelling of graph structures and node attributes. Recent progress on graph convolutional networks has proved that graph convolution is effective in combining structural and…

Machine Learning · Computer Science 2019-06-05 Xiaotong Zhang , Han Liu , Qimai Li , Xiao-Ming Wu

We propose a novel methodology for feature screening in clustering massive datasets, in which both the number of features and the number of observations can potentially be very large. Taking advantage of a fusion penalization based convex…

Methodology · Statistics 2017-10-05 Trambak Banerjee , Gourab Mukherjee , Peter Radchenko

Object detection is a fundamental problem in image understanding. One popular solution is the R-CNN framework and its fast versions. They decompose the object detection problem into two cascaded easier tasks: 1) generating object proposals…

Computer Vision and Pattern Recognition · Computer Science 2016-04-13 Bin Yang , Junjie Yan , Zhen Lei , Stan Z. Li

We develop a new density-based clustering algorithm named CRAD which is based on a new neighbor searching function with a robust data depth as the dissimilarity measure. Our experiments prove that the new CRAD is highly competitive at…

Computation · Statistics 2019-04-09 Xin Huang , Yulia R. Gel

Clustering is one of the most common unsupervised learning tasks in machine learning and data mining. Clustering algorithms have been used in a plethora of applications across several scientific fields. However, there has been limited…

Machine Learning · Computer Science 2017-02-09 Quang N. Tran , Ba-Ngu Vo , Dinh Phung , Ba-Tuong Vo
‹ Prev 1 2 3 10 Next ›