Related papers: Efficient Urdu Caption Generation using Attention …

Using Deep Learning to Generate Semantically Correct Hindi Captions

Automated image captioning using the content from the image is very appealing when done by harnessing the capability of computer vision and natural language processing. Extensive research has been done in the field with a major focus on the…

Computer Vision and Pattern Recognition · Computer Science 2026-02-17 Wasim Akram Khan , Anil Kumar Vuppala

Exploration of Deep Learning Based Recognition for Urdu Text

Urdu is a cursive script language and has similarities with Arabic and many other South Asian languages. Urdu is difficult to classify due to its complex geometrical and morphological structure. Character classification can be processed…

Computer Vision and Pattern Recognition · Computer Science 2025-08-20 Sumaiya Fazal , Sheeraz Ahmed

COCO-Urdu: A Large-Scale Urdu Image-Caption Dataset with Multimodal Quality Estimation

Urdu, spoken by over 250 million people, remains critically under-served in multimodal and vision-language research. The absence of large-scale, high-quality datasets has limited the development of Urdu-capable systems and reinforced biases…

Computer Vision and Pattern Recognition · Computer Science 2025-09-12 Umair Hassan

Urdu Poetry Generated by Using Deep Learning Techniques

This study provides Urdu poetry generated using different deep-learning techniques and algorithms. The data was collected through the Rekhta website, containing 1341 text files with several couplets. The data on poetry was not from any…

Computation and Language · Computer Science 2023-09-26 Muhammad Shoaib Farooq , Ali Abbas

Attention based Bidirectional GRU hybrid model for inappropriate content detection in Urdu language

With the increased use of the internet and social networks for online discussions, the spread of toxic and inappropriate content on social networking sites has also increased. Several studies have been conducted in different languages.…

Computation and Language · Computer Science 2025-01-17 Ezzah Shoukat , Rabia Irfan , Iqra Basharat , Muhammad Ali Tahir , Sameen Shaukat

Large Scale Font Independent Urdu Text Recognition System

OCR algorithms have received a significant improvement in performance recently, mainly due to the increase in the capabilities of artificial intelligence algorithms. However, this advancement is not evenly distributed over all languages.…

Computer Vision and Pattern Recognition · Computer Science 2020-05-15 Atique Ur Rehman , Sibt Ul Hussain

Sentiment Analysis for YouTube Comments in Roman Urdu

Sentiment analysis is a vast area in the Machine learning domain. A lot of work is done on datasets and their analysis of the English Language. In Pakistan, a huge amount of data is in roman Urdu language, it is scattered all over the…

Computation and Language · Computer Science 2021-02-22 Tooba Tehreem

Document-Level Sentiment Analysis of Urdu Text Using Deep Learning Techniques

Document level Urdu Sentiment Analysis (SA) is a challenging Natural Language Processing (NLP) task as it deals with large documents in a resource-poor language. In large documents, there are ample amounts of words that exhibit different…

Computation and Language · Computer Science 2025-01-30 Ammarah Irum , M. Ali Tahir

AI-Generated Text Detection in Low-Resource Languages: A Case Study on Urdu

Large Language Models (LLMs) are now capable of generating text that closely resembles human writing, making them powerful tools for content creation, but this growing ability has also made it harder to tell whether a piece of text was…

Computation and Language · Computer Science 2025-10-21 Muhammad Ammar , Hadiya Murad Hadi , Usman Majeed Butt

Chittron: An Automatic Bangla Image Captioning System

Automatic image caption generation aims to produce an accurate description of an image in natural language automatically. However, Bangla, the fifth most widely spoken language in the world, is lagging considerably in the research and…

Computation and Language · Computer Science 2018-09-10 Motiur Rahman , Nabeel Mohammed , Nafees Mansoor , Sifat Momen

Qalb: Largest State-of-the-Art Urdu Large Language Model for 230M Speakers with Systematic Continued Pre-training

Despite remarkable progress in large language models, Urdu-a language spoken by over 230 million people-remains critically underrepresented in modern NLP systems. Existing multilingual models demonstrate poor performance on Urdu-specific…

Computation and Language · Computer Science 2026-01-14 Muhammad Taimoor Hassan , Jawad Ahmed , Muhammad Awais

Deep Learning for Lip Reading using Audio-Visual Information for Urdu Language

Human lip-reading is a challenging task. It requires not only knowledge of underlying language but also visual clues to predict spoken words. Experts need certain level of experience and understanding of visual expressions learning to…

Computer Vision and Pattern Recognition · Computer Science 2018-02-16 M Faisal , Sanaullah Manzoor

Enhanced Urdu Intent Detection with Large Language Models and Prototype-Informed Predictive Pipelines

Multifarious intent detection predictors are developed for different languages, including English, Chinese and French, however, the field remains underdeveloped for Urdu, the 10th most spoken language. In the realm of well-known languages,…

Computation and Language · Computer Science 2025-05-14 Faiza Hassan , Summra Saleem , Kashif Javed , Muhammad Nabeel Asim , Abdur Rehman , Andreas Dengel

PronouncUR: An Urdu Pronunciation Lexicon Generator

State-of-the-art speech recognition systems rely heavily on three basic components: an acoustic model, a pronunciation lexicon and a language model. To build these components, a researcher needs linguistic as well as technical expertise,…

Computation and Language · Computer Science 2018-03-06 Haris Bin Zia , Agha Ali Raza , Awais Athar

Attention-based transformer models for image captioning across languages: An in-depth survey and evaluation

Image captioning involves generating textual descriptions from input images, bridging the gap between computer vision and natural language processing. Recent advancements in transformer-based models have significantly improved caption…

Computer Vision and Pattern Recognition · Computer Science 2025-06-09 Israa A. Albadarneh , Bassam H. Hammo , Omar S. Al-Kadi

Neural Attention for Image Captioning: Review of Outstanding Methods

Image captioning is the task of automatically generating sentences that describe an input image in the best way possible. The most successful techniques for automatically generating image captions have recently used attentive deep learning…

Computer Vision and Pattern Recognition · Computer Science 2021-12-01 Zanyar Zohourianshahzadi , Jugal K. Kalita

Co-occurrences using Fasttext embeddings for word similarity tasks in Urdu

Urdu is a widely spoken language in South Asia. Though immoderate literature exists for the Urdu language still the data isn't enough to naturally process the language by NLP techniques. Very efficient language models exist for the English…

Computation and Language · Computer Science 2021-02-23 Usama Khalid , Aizaz Hussain , Muhammad Umair Arshad , Waseem Shahzad , Mirza Omer Beg

A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism

Image captioning is a fast-growing research field of computer vision and natural language processing that involves creating text explanations for images. This study aims to develop a system that uses a pre-trained convolutional neural…

Computation and Language · Computer Science 2022-03-04 Rashid Khan , M Shujah Islam , Khadija Kanwal , Mansoor Iqbal , Md. Imran Hossain , Zhongfu Ye

CALText: Contextual Attention Localization for Offline Handwritten Text

Recognition of Arabic-like scripts such as Persian and Urdu is more challenging than Latin-based scripts. This is due to the presence of a two-dimensional structure, context-dependent character shapes, spaces and overlaps, and placement of…

Computer Vision and Pattern Recognition · Computer Science 2021-11-09 Tayaba Anjum , Nazar Khan

Boost Image Captioning with Knowledge Reasoning

Automatically generating a human-like description for a given image is a potential research in artificial intelligence, which has attracted a great of attention recently. Most of the existing attention methods explore the mapping…

Computer Vision and Pattern Recognition · Computer Science 2020-11-03 Feicheng Huang , Zhixin Li , Haiyang Wei , Canlong Zhang , Huifang Ma