Related papers: Optical Script Identification for multi-lingual In…

Handwritten Script Identification from Text Lines

In a multilingual country like India where 12 different official scripts are in use, automatic identification of handwritten script facilitates many important applications such as automatic transcription of multilingual documents, searching…

Computer Vision and Pattern Recognition · Computer Science 2020-09-17 Pawan Kumar Singh , Iman Chatterjee , Ram Sarkar , Mita Nasipuri

Script-Agnostic Language Identification

Language identification is used as the first step in many data collection and crawling efforts because it allows us to sort online text into language-specific buckets. However, many modern languages, such as Konkani, Kashmiri, Punjabi etc.,…

Computation and Language · Computer Science 2024-06-27 Milind Agarwal , Joshua Otten , Antonios Anastasopoulos

Handwritten Character Recognition of South Indian Scripts: A Review

Handwritten character recognition is always a frontier area of research in the field of pattern recognition and image processing and there is a large demand for OCR on hand written documents. Even though, sufficient studies have performed…

Computer Vision and Pattern Recognition · Computer Science 2021-09-07 John Jomy , K. V. Pramod , Balakrishnan Kannan

Kannada Character Recognition System A Review

Intensive research has been done on optical character recognition ocr and a large number of articles have been published on this topic during the last few decades. Many commercial OCR systems are now available in the market, but most of…

Computer Vision and Pattern Recognition · Computer Science 2016-09-08 K. Indira , S. Sethu Selvi

Discrimination of English to other Indian languages (Kannada and Hindi) for OCR system

India is a multilingual multi-script country. In every state of India there are two languages one is state local language and the other is English. For example in Andhra Pradesh, a state in India, the document may contain text words in…

Computer Vision and Pattern Recognition · Computer Science 2012-05-11 Ankit Kumar , Tushar Patnaik , Vivek Kr Verma

Recognition of Indian Sign Language in Live Video

Sign Language Recognition has emerged as one of the important area of research in Computer Vision. The difficulty faced by the researchers is that the instances of signs vary with both motion and appearance. Thus, in this paper a novel…

Computer Vision and Pattern Recognition · Computer Science 2013-06-07 Joyeeta Singha , Karen Das

Word level Script Identification from Bangla and Devanagri Handwritten Texts mixed with Roman Script

India is a multi-lingual country where Roman script is often used alongside different Indic scripts in a text document. To develop a script specific handwritten Optical Character Recognition (OCR) system, it is therefore necessary to…

Machine Learning · Computer Science 2010-03-25 Ram Sarkar , Nibaran Das , Subhadip Basu , Mahantapas Kundu , Mita Nasipuri , Dipak Kumar Basu

MDIW-13: a New Multi-Lingual and Multi-Script Database and Benchmark for Script Identification

Script identification plays a vital role in applications that involve handwriting and document analysis within a multi-script and multi-lingual environment. Moreover, it exhibits a profound connection with human cognition. This paper…

Computer Vision and Pattern Recognition · Computer Science 2024-05-30 Miguel A. Ferrer , Abhijit Das , Moises Diaz , Aythami Morales , Cristina Carmona-Duarte , Umapada Pal

Ancient Script Image Recognition and Processing: A Review

Ancient scripts, e.g., Egyptian hieroglyphs, Oracle Bone Inscriptions, and Ancient Greek inscriptions, serve as vital carriers of human civilization, embedding invaluable historical and cultural information. Automating ancient script image…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Xiaolei Diao , Rite Bo , Yanling Xiao , Lida Shi , Zhihan Zhou , Hao Xu , Chuntao Li , Xiongfeng Tang , Massimo Poesio , Cédric M. John , Daqian Shi

Indic Handwritten Script Identification using Offline-Online Multimodal Deep Network

In this paper, we propose a novel approach of word-level Indic script identification using only character-level data in training stage. The advantages of using character level data for training have been outlined in section I. Our method…

Computer Vision and Pattern Recognition · Computer Science 2019-10-17 Ayan Kumar Bhunia , Subham Mukherjee , Aneeshan Sain , Ankan Kumar Bhunia , Partha Pratim Roy , Umapada Pal

A Novel Approach to OCR using Image Recognition based Classification for Ancient Tamil Inscriptions in Temples

Recognition of ancient Tamil characters has always been a challenge for epigraphers. This is primarily because the language has evolved over the several centuries and the character set over this time has both expanded and diversified. This…

Computer Vision and Pattern Recognition · Computer Science 2019-07-12 Lalitha Giridhar , Aishwarya Dharani and , Velmathi Guruviah

Cross-language Framework for Word Recognition and Spotting of Indic Scripts

Handwritten word recognition and spotting of low-resource scripts are difficult as sufficient training data is not available and it is often expensive for collecting data of such scripts. This paper presents a novel cross language platform…

Computer Vision and Pattern Recognition · Computer Science 2018-02-06 Ayan Kumar Bhunia , Partha Pratim Roy , Akash Mohta , Umapada Pal

Visual Script and Language Identification

In this paper we introduce a script identification method based on hand-crafted texture features and an artificial neural network. The proposed pipeline achieves near state-of-the-art performance for script identification of video-text and…

Computer Vision and Pattern Recognition · Computer Science 2016-01-11 Anguelos Nicolaou , Andrew Bagdanov , Lluis Gomez-Bigorda , Dimosthenis Karatzas

Confronting the Constraints for Optical Character Segmentation from Printed Bangla Text Image

In a world of digitization, optical character recognition holds the automation to written history. Optical character recognition system basically converts printed images into editable texts for better storage and usability. To be completely…

Computer Vision and Pattern Recognition · Computer Science 2021-01-06 Abu Saleh Md. Abir , Sanjana Rahman , Samia Ellin , Maisha Farzana , Md Hridoy Manik , Chowdhury Rafeed Rahman

Handwritten Character Recognition In Malayalam Scripts- A Review

Handwritten character recognition is one of the most challenging and ongoing areas of research in the field of pattern recognition. HCR research is matured for foreign languages like Chinese and Japanese but the problem is much more complex…

Computer Vision and Pattern Recognition · Computer Science 2014-02-11 Anitha Mary M. O. Chacko , P. M Dhanya

Automatic Script Identification in the Wild

With the rapid increase of transnational communication and cooperation, people frequently encounter multilingual scenarios in various situations. In this paper, we are concerned with a relatively new problem: script identification at word…

Computer Vision and Pattern Recognition · Computer Science 2015-05-13 Baoguang Shi , Cong Yao , Chengquan Zhang , Xiaowei Guo , Feiyue Huang , Xiang Bai

A survey of modern optical character recognition techniques

This report explores the latest advances in the field of digital document recognition. With the focus on printed document imagery, we discuss the major developments in optical character recognition (OCR) and document image…

Computer Vision and Pattern Recognition · Computer Science 2014-12-16 Eugene Borovikov

Optical Character Recognition (OCR) for Telugu: Database, Algorithm and Application

Telugu is a Dravidian language spoken by more than 80 million people worldwide. The optical character recognition (OCR) of the Telugu script has wide ranging applications including education, health-care, administration etc. The beautiful…

Computer Vision and Pattern Recognition · Computer Science 2018-12-27 Chandra Prakash Konkimalla , Manikanta Srikar Yellapragada , Trishal Gayam , Souraj Mandal , Sumohana S. Channappayya

Recurrent neural networks based Indic word-wise script identification using character-wise training

This paper presents a novel methodology of Indic handwritten script recognition using Recurrent Neural Networks and addresses the problem of script recognition in poor data scenarios, such as when only character level online data is…

Computer Vision and Pattern Recognition · Computer Science 2018-12-31 Rohun Tripathi , Aman Gill , Riccha Tripati

Morphological Reconstruction for Word Level Script Identification

A line of a bilingual document page may contain text words in regional language and numerals in English. For Optical Character Recognition (OCR) of such a document page, it is necessary to identify different script forms before running an…

Computer Vision and Pattern Recognition · Computer Science 2011-07-05 B. V. Dhandra , Mallikarjun Hangarge