English
Related papers

Related papers: Similar Document Template Matching Algorithm

200 papers

Precise homography estimation between multiple images is a pre-requisite for many computer vision applications. One application that is particularly relevant in today's digital era is the alignment of scanned or camera-captured document…

Computer Vision and Pattern Recognition · Computer Science 2019-11-15 Kushagra Mahajan , Monika Sharma , Lovekesh Vig

Retrieving accurate details from documents is a crucial task, especially when handling a combination of scanned images and native digital formats. This document presents a combined framework for text extraction that merges Optical Character…

Computer Vision and Pattern Recognition · Computer Science 2025-06-16 Rasha Sinha , Rekha B S

Information extraction from copy-heavy documents, characterized by massive volumes of structurally similar content, represents a critical yet understudied challenge in enterprise document processing. We present a systematic framework that…

Computation and Language · Computer Science 2025-10-14 Zilong Wang , Xiaoyu Shen

Document comparison typically relies on optical character recognition (OCR) as its core technology. However, OCR requires the selection of appropriate language models for each document and the performance of multilingual or hybrid models…

Computer Vision and Pattern Recognition · Computer Science 2024-12-06 Doyoung Park , Naresh Reddy Yarram , Sunjin Kim , Minkyu Kim , Seongho Cho , Taehee Lee

Information representation as tables are compact and concise method that eases searching, indexing, and storage requirements. Extracting and cloning tables from parsable documents is easier and widely used, however industry still faces…

Information Retrieval · Computer Science 2020-10-20 Smita Pallavi , Raj Ratn Pranesh , Sumit Kumar

In this paper, we propose a novel one-shot template-matching algorithm to automatically capture data from business documents with an aim to minimize manual data entry. Given one annotated document, our algorithm can automatically extract…

Information Retrieval · Computer Science 2019-10-23 Pranjal Dhakal , Manish Munikar , Bikram Dahal

Detecting manipulations in digital documents is becoming increasingly important for information verification purposes. Due to the proliferation of image editing software, altering key information in documents has become widely accessible.…

Computer Vision and Pattern Recognition · Computer Science 2023-06-27 Hailey Joren , Otkrist Gupta , Dan Raviv

Document similarity is an important part of Natural Language Processing and is most commonly used for plagiarism-detection and text summarization. Thus, finding the overall most effective document similarity algorithm could have a major…

Computation and Language · Computer Science 2023-04-05 Nicholas Gahman , Vinayak Elangovan

Objective:Develop and validate an algorithm for analyzing the layout of PDF clinical documents to improve the performance of downstream natural language processing tasks. Materials and Methods: We designed an algorithm to process clinical…

Computation and Language · Computer Science 2023-05-24 Christel Gérardin , Perceval Wajsbürt , Basile Dura , Alice Calliger , Alexandre Moucher , Xavier Tannier , Romain Bey

Financial documents are essential sources of information for regulators, auditors, and financial institutions, particularly for assessing the wealth and compliance of Small and Medium-sized Businesses. However, SMB documents are often…

Information Retrieval · Computer Science 2025-10-28 Yichao Jin , Yushuo Wang , Qishuai Zhong , Kent Chiu Jin-Chun , Kenneth Zhu Ke , Donald MacDonald

We propose a novel measure for template matching named Deformable Diversity Similarity -- based on the diversity of feature matches between a target image window and the template. We rely on both local appearance and geometric information…

Computer Vision and Pattern Recognition · Computer Science 2017-04-19 Itamar Talmi , Roey Mechrez , Lihi Zelnik-Manor

Measuring similarity between texts is an important task for several applications. Available approaches to measure document similarity are inadequate for document pairs that have non-comparable lengths, such as a long document and its…

Computation and Language · Computer Science 2019-03-27 Hongyu Gong , Tarek Sakakini , Suma Bhat , Jinjun Xiong

Claims documents are fundamental to healthcare and insurance operations, serving as the basis for reimbursement, auditing, and compliance. However, these documents are typically not born digital; they often exist as scanned PDFs or…

Information Retrieval · Computer Science 2026-01-06 Lilu Cheng , Jingjun Lu , Yi Xuan Chan , Quoc Khai Nguyen , John Bi , Sean Ho

Document alignment and registration play a crucial role in numerous real-world applications, such as automated form processing, anomaly detection, and workflow automation. Traditional methods for document alignment rely on image-based…

Computer Vision and Pattern Recognition · Computer Science 2025-05-27 Ross Greer , Alisha Ukani , Katherine Izhikevich , Earlence Fernandes , Stefan Savage , Alex C. Snoeren

As the number of digital documents requiring investigation increases, it has become more important to identify relevant documents to a given case. There have been continual demands for finding relevant files in order to overcome this kind…

Information Retrieval · Computer Science 2018-10-18 Hyunji Chung

In most computer vision and image analysis problems, it is necessary to define a similarity measure between two or more different objects or images. Template matching is a classic and fundamental method used to score similarities between…

Computer Vision and Pattern Recognition · Computer Science 2016-10-25 Nazanin Sadat Hashemi , Roya Babaie Aghdam , Atieh Sadat Bayat Ghiasi , Parastoo Fatemi

Optical Character Recognition (OCR) for data extraction from documents is essential to intelligent informatics, such as digitizing medical records and recognizing road signs. Multi-modal Large Language Models (LLMs) can solve this task and…

Computer Vision and Pattern Recognition · Computer Science 2026-01-06 Hyakka Nakada , Yoshiyasu Tanaka

When medical researchers conduct a systematic review (SR), screening studies is the most time-consuming process: researchers read several thousands of medical literature and manually label them relevant or irrelevant. Screening…

Information Retrieval · Computer Science 2021-12-30 Grace E. Lee , Aixin Sun

Similar Case Matching (SCM) plays a pivotal role in the legal system by facilitating the efficient identification of similar cases for legal professionals. While previous research has primarily concentrated on enhancing the performance of…

Computation and Language · Computer Science 2023-08-17 Nankai Lin , Haonan Liu , Jiajun Fang , Dong Zhou , Aimin Yang

This study explores three approaches to processing table data in scientific papers to enhance extractive question answering and develop a software tool for the systematic review process. The methods evaluated include: (1) Optical Character…

Information Retrieval · Computer Science 2025-08-27 Dongyoun Kim , Hyung-do Choi , Youngsun Jang , John Kim
‹ Prev 1 2 3 10 Next ›