Related papers: MarkMatch: Same-Hand Stuffing Detection

BoundMatch: Boundary detection applied to semi-supervised segmentation

Semi-supervised semantic segmentation (SS-SS) aims to mitigate the heavy annotation burden of dense pixel labeling by leveraging abundant unlabeled images alongside a small labeled set. While current consistency regularization methods…

Computer Vision and Pattern Recognition · Computer Science 2025-10-01 Haruya Ishikawa , Yoshimitsu Aoki

SimMatch: Semi-supervised Learning with Similarity Matching

Learning with few labeled data has been a longstanding problem in the computer vision and machine learning research community. In this paper, we introduced a new semi-supervised learning framework, SimMatch, which simultaneously considers…

Computer Vision and Pattern Recognition · Computer Science 2022-03-18 Mingkai Zheng , Shan You , Lang Huang , Fei Wang , Chen Qian , Chang Xu

SelfMatch: Combining Contrastive Self-Supervision and Consistency for Semi-Supervised Learning

This paper introduces SelfMatch, a semi-supervised learning method that combines the power of contrastive self-supervised learning and consistency regularization. SelfMatch consists of two stages: (1) self-supervised pre-training based on…

Machine Learning · Computer Science 2021-01-19 Byoungjip Kim , Jinho Choo , Yeong-Dae Kwon , Seongho Joe , Seungjai Min , Youngjune Gwon

MultiMatch: Multihead Consistency Regularization Matching for Semi-Supervised Text Classification

We introduce MultiMatch, a novel semi-supervised learning (SSL) algorithm combining the paradigms of co-training and consistency regularization with pseudo-labeling. At its core, MultiMatch features a pseudo-label weighting module designed…

Computation and Language · Computer Science 2025-11-04 Iustin Sirbu , Robert-Adrian Popovici , Cornelia Caragea , Stefan Trausan-Matu , Traian Rebedea

MATCHA: Matching Text via Contrastive Semantic Alignment

Reliable evaluation is essential for understanding large language model (LLM) performance, yet today's go-to metrics, namely token-overlap scores (e.g., ROUGE) and embedding-based measures (e.g., BERTScore), often misjudge semantic…

Computation and Language · Computer Science 2026-05-27 Siran Li , Ece Sena Etoglu , Carsten Eickhoff , Seyed Ali Bahrainian

compare-mt: A Tool for Holistic Comparison of Language Generation Systems

In this paper, we describe compare-mt, a tool for holistic analysis and comparison of the results of systems for language generation tasks such as machine translation. The main goal of the tool is to give the user a high-level and coherent…

Computation and Language · Computer Science 2019-09-20 Graham Neubig , Zi-Yi Dou , Junjie Hu , Paul Michel , Danish Pruthi , Xinyi Wang , John Wieting

JointMatch: A Unified Approach for Diverse and Collaborative Pseudo-Labeling to Semi-Supervised Text Classification

Semi-supervised text classification (SSTC) has gained increasing attention due to its ability to leverage unlabeled data. However, existing approaches based on pseudo-labeling suffer from the issues of pseudo-label bias and error…

Computation and Language · Computer Science 2023-10-24 Henry Peng Zou , Cornelia Caragea

MarginMatch: Improving Semi-Supervised Learning with Pseudo-Margins

We introduce MarginMatch, a new SSL approach combining consistency regularization and pseudo-labeling, with its main novelty arising from the use of unlabeled data training dynamics to measure pseudo-label quality. Instead of using only the…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Tiberiu Sosea , Cornelia Caragea

RB2: Robotic Manipulation Benchmarking with a Twist

Benchmarks offer a scientific way to compare algorithms using objective performance metrics. Good benchmarks have two features: (a) they should be widely useful for many research groups; (b) and they should produce reproducible findings. In…

Robotics · Computer Science 2022-11-01 Sudeep Dasari , Jianren Wang , Joyce Hong , Shikhar Bahl , Yixin Lin , Austin Wang , Abitha Thankaraj , Karanbir Chahal , Berk Calli , Saurabh Gupta , David Held , Lerrel Pinto , Deepak Pathak , Vikash Kumar , Abhinav Gupta

Matching Handwritten Document Images

We address the problem of predicting similarity between a pair of handwritten document images written by different individuals. This has applications related to matching and mining in image collections containing handwritten content. A…

Computer Vision and Pattern Recognition · Computer Science 2016-05-20 Praveen Krishnan , C. V. Jawahar

They may look and look, yet not see: BMDs cannot be tested adequately

Bugs, misconfiguration, and malware can cause ballot-marking devices (BMDs) to print incorrect votes. Several approaches to testing BMDs have been proposed. In logic and accuracy testing (LAT) and parallel or live testing, auditors input…

Applications · Statistics 2022-07-27 Philip B. Stark , Ran Xie

I Know You Did Not Write That! A Sampling Based Watermarking Method for Identifying Machine Generated Text

Potential harms of Large Language Models such as mass misinformation and plagiarism can be partially mitigated if there exists a reliable way to detect machine generated text. In this paper, we propose a new watermarking method to detect…

Computation and Language · Computer Science 2023-12-12 Kaan Efe Keleş , Ömer Kaan Gürbüz , Mucahid Kutlu

DualMatch: Robust Semi-Supervised Learning with Dual-Level Interaction

Semi-supervised learning provides an expressive framework for exploiting unlabeled data when labels are insufficient. Previous semi-supervised learning methods typically match model predictions of different data-augmented views in a…

Computer Vision and Pattern Recognition · Computer Science 2023-10-26 Cong Wang , Xiaofeng Cao , Lanzhe Guo2 , Zenglin Shi

CoMatch: Semi-supervised Learning with Contrastive Graph Regularization

Semi-supervised learning has been an effective paradigm for leveraging unlabeled data to reduce the reliance on labeled data. We propose CoMatch, a new semi-supervised learning method that unifies dominant approaches and addresses their…

Machine Learning · Computer Science 2021-03-04 Junnan Li , Caiming Xiong , Steven Hoi

ReRankMatch: Semi-Supervised Learning with Semantics-Oriented Similarity Representation

This paper proposes integrating semantics-oriented similarity representation into RankingMatch, a recently proposed semi-supervised learning method. Our method, dubbed ReRankMatch, aims to deal with the case in which labeled and unlabeled…

Computer Vision and Pattern Recognition · Computer Science 2021-09-28 Trung Quang Tran , Mingu Kang , Daeyoung Kim

PatentMatch: A Dataset for Matching Patent Claims & Prior Art

Patent examiners need to solve a complex information retrieval task when they assess the novelty and inventive step of claims made in a patent application. Given a claim, they search for prior art, which comprises all relevant publicly…

Information Retrieval · Computer Science 2020-12-29 Julian Risch , Nicolas Alder , Christoph Hewel , Ralf Krestel

Supervised machine learning techniques for data matching based on similarity metrics

Businesses, governmental bodies and NGO's have an ever-increasing amount of data at their disposal from which they try to extract valuable information. Often, this needs to be done not only accurately but also within a short time frame.…

Machine Learning · Computer Science 2021-09-16 Pim Verschuuren , Serena Palazzo , Tom Powell , Steve Sutton , Alfred Pilgrim , Michele Faucci Giannelli

SilkMoth: An Efficient Method for Finding Related Sets with Maximum Matching Constraints

Determining if two sets are related - that is, if they have similar values or if one set contains the other - is an important problem with many applications in data cleaning, data integration, and information retrieval. A particularly…

Databases · Computer Science 2017-12-19 Dong Deng , Albert Kim , Samuel Madden , Michael Stonebraker

RankingMatch: Delving into Semi-Supervised Learning with Consistency Regularization and Ranking Loss

Semi-supervised learning (SSL) has played an important role in leveraging unlabeled data when labeled data is limited. One of the most successful SSL approaches is based on consistency regularization, which encourages the model to produce…

Computer Vision and Pattern Recognition · Computer Science 2021-10-12 Trung Q. Tran , Mingu Kang , Daeyoung Kim

Benchmarks Saturate When The Model Gets Smarter Than The Judge

Benchmarks are important tools to track progress in the development of Large Language Models (LLMs), yet inaccuracies in datasets and evaluation methods consistently undermine their effectiveness. Here, we present Omni-MATH-2, a manually…

Artificial Intelligence · Computer Science 2026-01-28 Marthe Ballon , Andres Algaba , Brecht Verbeken , Vincent Ginis