Related papers: Browserbite: Cross-Browser Testing via Image Proce…

XBIDetective: Leveraging Vision Language Models for Identifying Cross-Browser Visual Inconsistencies

Browser rendering bugs can be challenging to detect for browser developers, as they may be triggered by very specific conditions that are exhibited on only a very small subset of websites. Cross-browser inconsistencies (XBIs), variations in…

Software Engineering · Computer Science 2025-12-19 Balreet Grewal , James Graham , Jeff Muizelaar , Jan Honza Odvarko , Suhaib Mujahid , Marco Castelluccio , Cor-Paul Bezemer

Page Segmentation using Visual Adjacency Analysis

Page segmentation is a web page analysis process that divides a page into cohesive segments, such as sidebars, headers, and footers. Current page segmentation approaches use either the DOM, textual content, or rendering style information of…

Computer Vision and Pattern Recognition · Computer Science 2021-12-23 Mohammad Bajammal , Ali Mesbah

Cross-Domain Document Object Detection: Benchmark Suite and Method

Decomposing images of document pages into high-level semantic regions (e.g., figures, tables, paragraphs), document object detection (DOD) is fundamental for downstream tasks like intelligent document editing and understanding. DOD remains…

Computer Vision and Pattern Recognition · Computer Science 2020-03-31 Kai Li , Curtis Wigington , Chris Tensmeyer , Handong Zhao , Nikolaos Barmpalios , Vlad I. Morariu , Varun Manjunatha , Tong Sun , Yun Fu

Cross-Domain Object Matching with Model Selection

The goal of cross-domain object matching (CDOM) is to find correspondence between two sets of objects in different domains in an unsupervised way. Photo album summarization is a typical application of CDOM, where photos are automatically…

Machine Learning · Statistics 2010-12-08 Makoto Yamada , Masashi Sugiyama

Semantic Cross-View Matching

Matching cross-view images is challenging because the appearance and viewpoints are significantly different. While low-level features based on gradient orientations or filter responses can drastically vary with such changes in viewpoint,…

Computer Vision and Pattern Recognition · Computer Science 2015-11-03 Francesco Castaldo , Amir Zamir , Roland Angst , Francesco Palmieri , Silvio Savarese

Approach for Document Detection by Contours and Contrasts

This paper considers arbitrary document detection performed on a mobile device. The classical contour-based approach often fails in cases featuring occlusion, complex background, or blur. The region-based approach, which relies on the…

Computer Vision and Pattern Recognition · Computer Science 2021-07-02 Daniil V. Tropin , Sergey A. Ilyuhin , Dmitry P. Nikolaev , Vladimir V. Arlazarov

Cross-Domain Face Verification: Matching ID Document and Self-Portrait Photographs

Cross-domain biometrics has been emerging as a new necessity, which poses several additional challenges, including harsh illumination changes, noise, pose variation, among others. In this paper, we explore approaches to cross-domain face…

Computer Vision and Pattern Recognition · Computer Science 2016-11-18 Guilherme Folego , Marcus A. Angeloni , José Augusto Stuchi , Alan Godoy , Anderson Rocha

Caption-Matching: A Multimodal Approach for Cross-Domain Image Retrieval

Cross-Domain Image Retrieval (CDIR) is a challenging task in computer vision, aiming to match images across different visual domains such as sketches, paintings, and photographs. Existing CDIR methods rely either on supervised learning with…

Computer Vision and Pattern Recognition · Computer Science 2026-04-09 Lucas Iijima , Nikolaos Giakoumoglou , Tania Stathaki

Mere Contrastive Learning for Cross-Domain Sentiment Analysis

Cross-domain sentiment analysis aims to predict the sentiment of texts in the target domain using the model trained on the source domain to cope with the scarcity of labeled data. Previous studies are mostly cross-entropy-based methods for…

Computation and Language · Computer Science 2022-08-19 Yun Luo , Fang Guo , Zihan Liu , Yue Zhang

Webpage Segmentation for Extracting Images and Their Surrounding Contextual Information

Web images come in hand with valuable contextual information. Although this information has long been mined for various uses such as image annotation, clustering of images, inference of image semantic content, etc., insufficient attention…

Multimedia · Computer Science 2020-05-21 F. Fauzi , H. J. Long , M. Belkhatir

SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap

Inspired by the great success of language model (LM)-based pre-training, recent studies in visual document understanding have explored LM-based pre-training methods for modeling text within document images. Among them, pre-training that…

Computer Vision and Pattern Recognition · Computer Science 2023-09-25 Daehee Kim , Yoonsik Kim , DongHyun Kim , Yumin Lim , Geewook Kim , Taeho Kil

Don't read, just look: Main content extraction from web pages using visual features

Extracting main content from web pages provides primary informative blocks that remove a web page's minor areas like navigation menu, ads, and site templates. The main content extraction has various applications: information retrieval,…

Information Retrieval · Computer Science 2022-01-26 Geunseong Jung , Sungjae Han , Hansung Kim , Kwanguk Kim , Jaehyuk Cha

Cross-domain Human Parsing via Adversarial Feature and Label Adaptation

Human parsing has been extensively studied recently due to its wide applications in many important scenarios. Mainstream fashion parsing models focus on parsing the high-resolution and clean images. However, directly applying the parsers…

Computer Vision and Pattern Recognition · Computer Science 2018-01-09 Si Liu , Yao Sun , Defa Zhu , Guanghui Ren , Yu Chen , Jiashi Feng , Jizhong Han

Cross-Domain Visual Matching via Generalized Similarity Measure and Feature Learning

Cross-domain visual data matching is one of the fundamental problems in many real-world vision tasks, e.g., matching persons across ID photos and surveillance videos. Conventional approaches to this problem usually involves two steps: i)…

Computer Vision and Pattern Recognition · Computer Science 2016-11-17 Liang Lin , Guangrun Wang , Wangmeng Zuo , Xiangchu Feng , Lei Zhang

Web Based Cross Language Plagiarism Detection

As the Internet help us cross language and cultural border by providing different types of translation tools, cross language plagiarism, also known as translation plagiarism are bound to arise. Especially among the academic works, such…

Other Computer Science · Computer Science 2009-12-22 Chow Kok Kent , Naomie Salim

Bi-Dimensional Feature Alignment for Cross-Domain Object Detection

Recently the problem of cross-domain object detection has started drawing attention in the computer vision community. In this paper, we propose a novel unsupervised cross-domain detection model that exploits the annotated data in a source…

Computer Vision and Pattern Recognition · Computer Science 2020-11-17 Zhen Zhao , Yuhong Guo , Jieping Ye

A Review on Near Duplicate Detection of Images using Computer Vision Techniques

Nowadays, digital content is widespread and simply redistributable, either lawfully or unlawfully. For example, after images are posted on the internet, other web users can modify them and then repost their versions, thereby generating…

Computer Vision and Pattern Recognition · Computer Science 2020-09-08 K. K. Thyagharajan , G. Kalaiarasi

Identity documents recognition and detection using semantic segmentation with convolutional neural network

Object recognition and detection are well-studied problems with a developed set of almost standard solutions. Identity documents recognition, classification, detection, and localization are the tasks required in a number of applications,…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Mykola Kozlenko , Volodymyr Sendetskyi , Oleksiy Simkiv , Nazar Savchenko , Andy Bosyi

DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning

Document image segmentation is crucial for document analysis and recognition but remains challenging due to the diversity of document formats and segmentation tasks. Existing methods often address these tasks separately, resulting in…

Computer Vision and Pattern Recognition · Computer Science 2025-04-08 Xiao-Hui Li , Fei Yin , Cheng-Lin Liu

Face Verification Using Boosted Cross-Image Features

This paper proposes a new approach for face verification, where a pair of images needs to be classified as belonging to the same person or not. This problem is relatively new and not well-explored in the literature. Current methods mostly…

Computer Vision and Pattern Recognition · Computer Science 2013-10-01 Dong Zhang , Omar Oreifej , Mubarak Shah