English
Related papers

Related papers: MarkMatch: Same-Hand Stuffing Detection

200 papers

Semi-supervised semantic segmentation (SS-SS) aims to mitigate the heavy annotation burden of dense pixel labeling by leveraging abundant unlabeled images alongside a small labeled set. While current consistency regularization methods…

Computer Vision and Pattern Recognition · Computer Science 2025-10-01 Haruya Ishikawa , Yoshimitsu Aoki

Learning with few labeled data has been a longstanding problem in the computer vision and machine learning research community. In this paper, we introduced a new semi-supervised learning framework, SimMatch, which simultaneously considers…

Computer Vision and Pattern Recognition · Computer Science 2022-03-18 Mingkai Zheng , Shan You , Lang Huang , Fei Wang , Chen Qian , Chang Xu

This paper introduces SelfMatch, a semi-supervised learning method that combines the power of contrastive self-supervised learning and consistency regularization. SelfMatch consists of two stages: (1) self-supervised pre-training based on…

Machine Learning · Computer Science 2021-01-19 Byoungjip Kim , Jinho Choo , Yeong-Dae Kwon , Seongho Joe , Seungjai Min , Youngjune Gwon

We introduce MultiMatch, a novel semi-supervised learning (SSL) algorithm combining the paradigms of co-training and consistency regularization with pseudo-labeling. At its core, MultiMatch features a pseudo-label weighting module designed…

Computation and Language · Computer Science 2025-11-04 Iustin Sirbu , Robert-Adrian Popovici , Cornelia Caragea , Stefan Trausan-Matu , Traian Rebedea

Reliable evaluation is essential for understanding large language model (LLM) performance, yet today's go-to metrics, namely token-overlap scores (e.g., ROUGE) and embedding-based measures (e.g., BERTScore), often misjudge semantic…

Computation and Language · Computer Science 2026-05-27 Siran Li , Ece Sena Etoglu , Carsten Eickhoff , Seyed Ali Bahrainian

In this paper, we describe compare-mt, a tool for holistic analysis and comparison of the results of systems for language generation tasks such as machine translation. The main goal of the tool is to give the user a high-level and coherent…

Computation and Language · Computer Science 2019-09-20 Graham Neubig , Zi-Yi Dou , Junjie Hu , Paul Michel , Danish Pruthi , Xinyi Wang , John Wieting

Semi-supervised text classification (SSTC) has gained increasing attention due to its ability to leverage unlabeled data. However, existing approaches based on pseudo-labeling suffer from the issues of pseudo-label bias and error…

Computation and Language · Computer Science 2023-10-24 Henry Peng Zou , Cornelia Caragea

We introduce MarginMatch, a new SSL approach combining consistency regularization and pseudo-labeling, with its main novelty arising from the use of unlabeled data training dynamics to measure pseudo-label quality. Instead of using only the…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Tiberiu Sosea , Cornelia Caragea

Benchmarks offer a scientific way to compare algorithms using objective performance metrics. Good benchmarks have two features: (a) they should be widely useful for many research groups; (b) and they should produce reproducible findings. In…

We address the problem of predicting similarity between a pair of handwritten document images written by different individuals. This has applications related to matching and mining in image collections containing handwritten content. A…

Computer Vision and Pattern Recognition · Computer Science 2016-05-20 Praveen Krishnan , C. V. Jawahar

Bugs, misconfiguration, and malware can cause ballot-marking devices (BMDs) to print incorrect votes. Several approaches to testing BMDs have been proposed. In logic and accuracy testing (LAT) and parallel or live testing, auditors input…

Applications · Statistics 2022-07-27 Philip B. Stark , Ran Xie

Potential harms of Large Language Models such as mass misinformation and plagiarism can be partially mitigated if there exists a reliable way to detect machine generated text. In this paper, we propose a new watermarking method to detect…

Computation and Language · Computer Science 2023-12-12 Kaan Efe Keleş , Ömer Kaan Gürbüz , Mucahid Kutlu

Semi-supervised learning provides an expressive framework for exploiting unlabeled data when labels are insufficient. Previous semi-supervised learning methods typically match model predictions of different data-augmented views in a…

Computer Vision and Pattern Recognition · Computer Science 2023-10-26 Cong Wang , Xiaofeng Cao , Lanzhe Guo2 , Zenglin Shi

Semi-supervised learning has been an effective paradigm for leveraging unlabeled data to reduce the reliance on labeled data. We propose CoMatch, a new semi-supervised learning method that unifies dominant approaches and addresses their…

Machine Learning · Computer Science 2021-03-04 Junnan Li , Caiming Xiong , Steven Hoi

This paper proposes integrating semantics-oriented similarity representation into RankingMatch, a recently proposed semi-supervised learning method. Our method, dubbed ReRankMatch, aims to deal with the case in which labeled and unlabeled…

Computer Vision and Pattern Recognition · Computer Science 2021-09-28 Trung Quang Tran , Mingu Kang , Daeyoung Kim

Patent examiners need to solve a complex information retrieval task when they assess the novelty and inventive step of claims made in a patent application. Given a claim, they search for prior art, which comprises all relevant publicly…

Information Retrieval · Computer Science 2020-12-29 Julian Risch , Nicolas Alder , Christoph Hewel , Ralf Krestel

Businesses, governmental bodies and NGO's have an ever-increasing amount of data at their disposal from which they try to extract valuable information. Often, this needs to be done not only accurately but also within a short time frame.…

Machine Learning · Computer Science 2021-09-16 Pim Verschuuren , Serena Palazzo , Tom Powell , Steve Sutton , Alfred Pilgrim , Michele Faucci Giannelli

Determining if two sets are related - that is, if they have similar values or if one set contains the other - is an important problem with many applications in data cleaning, data integration, and information retrieval. A particularly…

Databases · Computer Science 2017-12-19 Dong Deng , Albert Kim , Samuel Madden , Michael Stonebraker

Semi-supervised learning (SSL) has played an important role in leveraging unlabeled data when labeled data is limited. One of the most successful SSL approaches is based on consistency regularization, which encourages the model to produce…

Computer Vision and Pattern Recognition · Computer Science 2021-10-12 Trung Q. Tran , Mingu Kang , Daeyoung Kim

Benchmarks are important tools to track progress in the development of Large Language Models (LLMs), yet inaccuracies in datasets and evaluation methods consistently undermine their effectiveness. Here, we present Omni-MATH-2, a manually…

Artificial Intelligence · Computer Science 2026-01-28 Marthe Ballon , Andres Algaba , Brecht Verbeken , Vincent Ginis
‹ Prev 1 2 3 10 Next ›