Related papers: Pathological Visual Question Answering

PathVQA: 30000+ Questions for Medical Visual Question Answering

Is it possible to develop an "AI Pathologist" to pass the board-certified examination of the American Board of Pathology? To achieve this goal, the first step is to create a visual question answering (VQA) dataset where the AI agent is…

Computation and Language · Computer Science 2020-03-24 Xuehai He , Yichen Zhang , Luntian Mou , Eric Xing , Pengtao Xie

Recommendations on test datasets for evaluating AI solutions in pathology

Artificial intelligence (AI) solutions that automatically extract information from digital histology images have shown great promise for improving pathological diagnosis. Prior to routine use, it is important to evaluate their predictive…

Image and Video Processing · Electrical Eng. & Systems 2022-09-15 André Homeyer , Christian Geißler , Lars Ole Schwen , Falk Zakrzewski , Theodore Evans , Klaus Strohmenger , Max Westphal , Roman David Bülow , Michaela Kargl , Aray Karjauv , Isidre Munné-Bertran , Carl Orge Retzlaff , Adrià Romero-López , Tomasz Sołtysiński , Markus Plass , Rita Carvalho , Peter Steinbach , Yu-Chia Lan , Nassim Bouteldja , David Haber , Mateo Rojas-Carulla , Alireza Vafaei Sadr , Matthias Kraft , Daniel Krüger , Rutger Fick , Tobias Lang , Peter Boor , Heimo Müller , Peter Hufnagl , Norman Zerbe

The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering

Visual Question Answering (VQA) is an interdisciplinary field that bridges the gap between computer vision (CV) and natural language processing(NLP), enabling Artificial Intelligence(AI) systems to answer questions about images. Since its…

Computer Vision and Pattern Recognition · Computer Science 2025-01-14 Anupam Pandey , Deepjyoti Bodo , Arpan Phukan , Asif Ekbal

Survey of Recent Advances in Visual Question Answering

Visual Question Answering (VQA) presents a unique challenge as it requires the ability to understand and encode the multi-modal inputs - in terms of image processing and natural language processing. The algorithm further needs to learn how…

Computer Vision and Pattern Recognition · Computer Science 2017-09-26 Supriya Pandhre , Shagun Sodhani

Medical Visual Question Answering: A Survey

Medical Visual Question Answering~(VQA) is a combination of medical artificial intelligence and popular VQA challenges. Given a medical image and a clinically relevant question in natural language, the medical VQA system is expected to…

Computer Vision and Pattern Recognition · Computer Science 2023-06-12 Zhihong Lin , Donghao Zhang , Qingyi Tao , Danli Shi , Gholamreza Haffari , Qi Wu , Mingguang He , Zongyuan Ge

Towards the Augmented Pathologist: Challenges of Explainable-AI in Digital Pathology

Digital pathology is not only one of the most promising fields of diagnostic medicine, but at the same time a hot topic for fundamental research. Digital pathology is not just the transfer of histopathological slides into digital…

Artificial Intelligence · Computer Science 2017-12-20 Andreas Holzinger , Bernd Malle , Peter Kieseberg , Peter M. Roth , Heimo Müller , Robert Reihs , Kurt Zatloukal

From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and Opportunities

The multimodal task of Visual Question Answering (VQA) encompassing elements of Computer Vision (CV) and Natural Language Processing (NLP), aims to generate answers to questions on any visual input. Over time, the scope of VQA has expanded…

Computer Vision and Pattern Recognition · Computer Science 2024-11-05 Md Farhan Ishmam , Md Sakib Hossain Shovon , M. F. Mridha , Nilanjan Dey

Visual Question Answering in Ophthalmology: A Progressive and Practical Perspective

Accurate diagnosis of ophthalmic diseases relies heavily on the interpretation of multimodal ophthalmic images, a process often time-consuming and expertise-dependent. Visual Question Answering (VQA) presents a potential interdisciplinary…

Image and Video Processing · Electrical Eng. & Systems 2024-10-23 Xiaolan Chen , Ruoyu Chen , Pusheng Xu , Weiyi Zhang , Xianwen Shang , Mingguang He , Danli Shi

Visual Question Answering: A Survey of Methods and Datasets

Visual Question Answering (VQA) is a challenging task that has received increasing attention from both the computer vision and the natural language processing communities. Given an image and a question in natural language, it requires…

Computer Vision and Pattern Recognition · Computer Science 2016-07-21 Qi Wu , Damien Teney , Peng Wang , Chunhua Shen , Anthony Dick , Anton van den Hengel

PA-LLaVA: A Large Language-Vision Assistant for Human Pathology Image Understanding

The previous advancements in pathology image understanding primarily involved developing models tailored to specific tasks. Recent studies has demonstrated that the large vision-language model can enhance the performance of various…

Artificial Intelligence · Computer Science 2024-08-20 Dawei Dai , Yuanhui Zhang , Long Xu , Qianlan Yang , Xiaojing Shen , Shuyin Xia , Guoyin Wang

Visual Question Answering: Datasets, Algorithms, and Future Challenges

Visual Question Answering (VQA) is a recent problem in computer vision and natural language processing that has garnered a large amount of interest from the deep learning, computer vision, and natural language processing communities. In…

Computer Vision and Pattern Recognition · Computer Science 2017-06-16 Kushal Kafle , Christopher Kanan

Interpretable Medical Image Visual Question Answering via Multi-Modal Relationship Graph Learning

Medical visual question answering (VQA) aims to answer clinically relevant questions regarding input medical images. This technique has the potential to improve the efficiency of medical professionals while relieving the burden on the…

Computer Vision and Pattern Recognition · Computer Science 2023-02-21 Xinyue Hu , Lin Gu , Kazuma Kobayashi , Qiyuan An , Qingyu Chen , Zhiyong Lu , Chang Su , Tatsuya Harada , Yingying Zhu

Visual Question Answering using Deep Learning: A Survey and Performance Analysis

The Visual Question Answering (VQA) task combines challenges for processing data with both Visual and Linguistic processing, to answer basic `common sense' questions about given images. Given an image and a question in natural language, the…

Computer Vision and Pattern Recognition · Computer Science 2020-12-24 Yash Srivastava , Vaishnav Murali , Shiv Ram Dubey , Snehasis Mukherjee

A survey on VQA_Datasets and Approaches

Visual question answering (VQA) is a task that combines both the techniques of computer vision and natural language processing. It requires models to answer a text-based question according to the information contained in a visual. In recent…

Computer Vision and Pattern Recognition · Computer Science 2021-05-04 Yeyun Zou , Qiyu Xie

A Comprehensive Survey on Visual Question Answering Datasets and Algorithms

Visual question answering (VQA) refers to the problem where, given an image and a natural language question about the image, a correct natural language answer has to be generated. A VQA model has to demonstrate both the visual understanding…

Computer Vision and Pattern Recognition · Computer Science 2024-11-19 Raihan Kabir , Naznin Haque , Md Saiful Islam , Marium-E-Jannat

Exploring the Feasibility of Multimodal Chatbot AI as Copilot in Pathology Diagnostics: Generalist Model's Pitfall

Pathology images are crucial for diagnosing and managing various diseases by visualizing cellular and tissue-level abnormalities. Recent advancements in artificial intelligence (AI), particularly multimodal models like ChatGPT, have shown…

Human-Computer Interaction · Computer Science 2024-09-25 Mianxin Liu , Jianfeng Wu , Fang Yan , Hongjun Li , Wei Wang , Shaoting Zhang , Zhe Wang

Visual Question Answering in the Medical Domain

Medical visual question answering (Med-VQA) is a machine learning task that aims to create a system that can answer natural language questions based on given medical images. Although there has been rapid progress on the general VQA task,…

Computer Vision and Pattern Recognition · Computer Science 2023-09-21 Louisa Canepa , Sonit Singh , Arcot Sowmya

TM-PATHVQA:90000+ Textless Multilingual Questions for Medical Visual Question Answering

In healthcare and medical diagnostics, Visual Question Answering (VQA) mayemergeasapivotal tool in scenarios where analysis of intricate medical images becomes critical for accurate diagnoses. Current text-based VQA systems limit their…

Computer Vision and Pattern Recognition · Computer Science 2024-07-17 Tonmoy Rajkhowa , Amartya Roy Chowdhury , Sankalp Nagaonkar , Achyut Mani Tripathi

RadImageNet-VQA: A Large-Scale CT and MRI Dataset for Radiologic Visual Question Answering

In this work, we introduce RadImageNet-VQA, a large-scale dataset designed to advance radiologic visual question answering (VQA) on CT and MRI exams. Existing medical VQA datasets are limited in scale, dominated by X-ray imaging or…

Computer Vision and Pattern Recognition · Computer Science 2026-03-31 Léo Butsanets , Charles Corbière , Julien Khlaut , Pierre Manceron , Corentin Dancette

Visuo-Linguistic Question Answering (VLQA) Challenge

Understanding images and text together is an important aspect of cognition and building advanced Artificial Intelligence (AI) systems. As a community, we have achieved good benchmarks over language and vision domains separately, however…

Computer Vision and Pattern Recognition · Computer Science 2020-11-19 Shailaja Keyur Sampat , Yezhou Yang , Chitta Baral