English
Related papers

Related papers: Audio Description Customization

200 papers

Audio descriptions (AD) make videos accessible for blind and low vision (BLV) users by describing visual elements that cannot be understood from the main audio track. AD created by professionals or novice describers is time-consuming and…

Human-Computer Interaction · Computer Science 2025-05-29 Maryam Cheema , Hasti Seifi , Pooyan Fazli

While audio description (AD) is the standard approach for making videos accessible to blind and low vision (BLV) people, existing AD guidelines do not consider BLV users' varied preferences across viewing scenarios. These scenarios range…

Human-Computer Interaction · Computer Science 2024-03-19 Lucy Jiang , Crescentia Jung , Mahika Phutane , Abigale Stangl , Shiri Azenkot

Customization is crucial for making visualizations accessible to blind and low-vision (BLV) people with widely-varying needs. But what makes for usable or useful customization? We identify four design goals for how BLV people should be able…

Human-Computer Interaction · Computer Science 2024-03-01 Shuli Jones , Isabella Pedraza Pineros , Daniel Hajas , Jonathan Zong , Arvind Satyanarayan

Advances in multimodal large language models enable automatic video narration and question answering (VQA), offering scalable alternatives to labor-intensive, human-authored audio descriptions (ADs) for blind and low vision (BLV) viewers.…

Human-Computer Interaction · Computer Science 2026-03-17 Maryam Cheema , Sina Elahimanesh , Pooyan Fazli , Hasti Seifi

Audio Description (AD) provides essential access to visual media for blind and low vision (BLV) audiences. Yet current AD production tools remain largely inaccessible to BLV video creators, who possess valuable expertise but face barriers…

Human-Computer Interaction · Computer Science 2026-02-10 Franklin Mingzhe Li , Michael Xieyang Liu , Cynthia L. Bennett , Shaun K. Kane

Audio description (AD) makes video content accessible to blind and low-vision (BLV) audiences, but producing high-quality descriptions is resource-intensive. Automated AD offers scalability, and prior studies show human-in-the-loop editing…

Human-Computer Interaction · Computer Science 2026-02-04 Lana Do , Shasta Ihorn , Charity Pitcher-Cooper , Juvenal Francisco Barajas , Gio Jung , Xuan Duy Anh Nguyen , Sanjay Mirani , Ilmi Yoon

While videos have become increasingly prevalent in delivering information across different educational and professional contexts, individuals with ADHD often face attention challenges when watching informational videos due to the dynamic,…

Human-Computer Interaction · Computer Science 2025-07-18 Hanxiu 'Hazel' Zhu , Ruijia Chen , Yuhang Zhao

While videos have become increasingly prevalent in delivering information across different educational and professional contexts, individuals with ADHD often face attention challenges when watching informational videos due to the dynamic,…

Human-Computer Interaction · Computer Science 2025-11-25 Hanxiu 'Hazel' Zhu , Ruijia Chen , Yuhang Zhao

Audio descriptions (ADs) function as acoustic commentaries designed to assist blind persons and persons with visual impairments in accessing digital media content on television and in movies, among other settings. As an accessibility…

Computation and Language · Computer Science 2024-10-14 Yingqiang Gao , Lukas Fischer , Alexa Lintner , Sarah Ebling

The Audio Description (AD) task aims to generate descriptions of visual elements for visually impaired individuals to help them access long-form video content, like movies. With video feature, text, character bank and context information as…

Computer Vision and Pattern Recognition · Computer Science 2025-04-16 Hanlin Wang , Zhan Tong , Kecheng Zheng , Yujun Shen , Limin Wang

Blind or Low-Vision (BLV) users often rely on audio descriptions (AD) to access video content. However, conventional static ADs can leave out detailed information in videos, impose a high mental load, neglect the diverse needs and…

Human-Computer Interaction · Computer Science 2024-02-28 Zheng Ning , Brianna L. Wimer , Kaiwen Jiang , Keyi Chen , Jerrick Ban , Yapeng Tian , Yuhang Zhao , Toby Jia-Jun Li

Audio descriptions (ADs) narrate important visual details in movies, enabling Blind and Low Vision (BLV) users to understand narratives and appreciate visual details. Existing works in automatic AD generation mostly focus on few-second…

Computer Vision and Pattern Recognition · Computer Science 2025-10-02 Divy Kala , Eshika Khandelwal , Makarand Tapaswi

Audio description (AD) makes video content accessible to millions of blind and low vision (BLV) users. However, creating high-quality AD involves a trade-off between the precision of human-crafted descriptions and the efficiency of…

Human-Computer Interaction · Computer Science 2025-08-05 Maryam Cheema , Sina Elahimanesh , Samuel Martin , Pooyan Fazli , Hasti Seifi

Video descriptions are crucial for blind and low vision (BLV) users to access visual content. However, current artificial intelligence models for generating descriptions often fall short due to limitations in the quality of human…

Computer Vision and Pattern Recognition · Computer Science 2025-03-03 Chaoyu Li , Sid Padmanabhuni , Maryam Cheema , Hasti Seifi , Pooyan Fazli

Music has been identified as a promising medium to enhance the accessibility and experience of visual art for people who are blind or have low vision (BLV). However, composing music and designing soundscapes for visual art is a…

Human-Computer Interaction · Computer Science 2024-05-24 Stephen James Krol , Maria Teresa Llano , Matthew Butler , Cagatay Goncu

Video content remains largely inaccessible to blind and low-vision (BLV) users. To address this, we introduce a prototype that leverages a multimodal agent - powered by a novel conversational architecture using a multimodal large language…

Movie Audio Description (AD) aims to narrate visual content during dialogue-free segments, particularly benefiting blind and visually impaired (BVI) audiences. Compared with general video captioning, AD demands plot-relevant narration with…

Computer Vision and Pattern Recognition · Computer Science 2025-05-13 Xiaojun Ye , Chun Wang , Yiren Song , Sheng Zhou , Liangcheng Li , Jiajun Bu

Purpose: Autonomous vehicles (AVs) are becoming a promising transportation solution for blind and low-vision (BLV) travelers, offering the potential for greater independent mobility. This paper explores the information needs of BLV users…

"Scene description" applications that describe visual content in a photo are useful daily tools for blind and low vision (BLV) people. Researchers have studied their use, but they have only explored those that leverage remote sighted…

Human-Computer Interaction · Computer Science 2025-03-13 Ricardo Gonzalez , Jazmin Collins , Shiri Azenkot , Cynthia Bennett

For individuals with blindness or low vision (BLV), navigating complex environments can pose serious risks. Large Vision-Language Models (LVLMs) show promise for generating scene descriptions, but their effectiveness for BLV users remains…

Computer Vision and Pattern Recognition · Computer Science 2026-04-02 Na Min An , Eunki Kim , Wan Ju Kang , Sangryul Kim , James Thorne , Hyunjung Shim
‹ Prev 1 2 3 10 Next ›