English
Related papers

Related papers: Pathological Visual Question Answering

200 papers

Is it possible to develop an "AI Pathologist" to pass the board-certified examination of the American Board of Pathology? To achieve this goal, the first step is to create a visual question answering (VQA) dataset where the AI agent is…

Computation and Language · Computer Science 2020-03-24 Xuehai He , Yichen Zhang , Luntian Mou , Eric Xing , Pengtao Xie

Artificial intelligence (AI) solutions that automatically extract information from digital histology images have shown great promise for improving pathological diagnosis. Prior to routine use, it is important to evaluate their predictive…

Visual Question Answering (VQA) is an interdisciplinary field that bridges the gap between computer vision (CV) and natural language processing(NLP), enabling Artificial Intelligence(AI) systems to answer questions about images. Since its…

Computer Vision and Pattern Recognition · Computer Science 2025-01-14 Anupam Pandey , Deepjyoti Bodo , Arpan Phukan , Asif Ekbal

Visual Question Answering (VQA) presents a unique challenge as it requires the ability to understand and encode the multi-modal inputs - in terms of image processing and natural language processing. The algorithm further needs to learn how…

Computer Vision and Pattern Recognition · Computer Science 2017-09-26 Supriya Pandhre , Shagun Sodhani

Medical Visual Question Answering~(VQA) is a combination of medical artificial intelligence and popular VQA challenges. Given a medical image and a clinically relevant question in natural language, the medical VQA system is expected to…

Computer Vision and Pattern Recognition · Computer Science 2023-06-12 Zhihong Lin , Donghao Zhang , Qingyi Tao , Danli Shi , Gholamreza Haffari , Qi Wu , Mingguang He , Zongyuan Ge

Digital pathology is not only one of the most promising fields of diagnostic medicine, but at the same time a hot topic for fundamental research. Digital pathology is not just the transfer of histopathological slides into digital…

Artificial Intelligence · Computer Science 2017-12-20 Andreas Holzinger , Bernd Malle , Peter Kieseberg , Peter M. Roth , Heimo Müller , Robert Reihs , Kurt Zatloukal

The multimodal task of Visual Question Answering (VQA) encompassing elements of Computer Vision (CV) and Natural Language Processing (NLP), aims to generate answers to questions on any visual input. Over time, the scope of VQA has expanded…

Computer Vision and Pattern Recognition · Computer Science 2024-11-05 Md Farhan Ishmam , Md Sakib Hossain Shovon , M. F. Mridha , Nilanjan Dey

Accurate diagnosis of ophthalmic diseases relies heavily on the interpretation of multimodal ophthalmic images, a process often time-consuming and expertise-dependent. Visual Question Answering (VQA) presents a potential interdisciplinary…

Image and Video Processing · Electrical Eng. & Systems 2024-10-23 Xiaolan Chen , Ruoyu Chen , Pusheng Xu , Weiyi Zhang , Xianwen Shang , Mingguang He , Danli Shi

Visual Question Answering (VQA) is a challenging task that has received increasing attention from both the computer vision and the natural language processing communities. Given an image and a question in natural language, it requires…

Computer Vision and Pattern Recognition · Computer Science 2016-07-21 Qi Wu , Damien Teney , Peng Wang , Chunhua Shen , Anthony Dick , Anton van den Hengel

The previous advancements in pathology image understanding primarily involved developing models tailored to specific tasks. Recent studies has demonstrated that the large vision-language model can enhance the performance of various…

Artificial Intelligence · Computer Science 2024-08-20 Dawei Dai , Yuanhui Zhang , Long Xu , Qianlan Yang , Xiaojing Shen , Shuyin Xia , Guoyin Wang

Visual Question Answering (VQA) is a recent problem in computer vision and natural language processing that has garnered a large amount of interest from the deep learning, computer vision, and natural language processing communities. In…

Computer Vision and Pattern Recognition · Computer Science 2017-06-16 Kushal Kafle , Christopher Kanan

Medical visual question answering (VQA) aims to answer clinically relevant questions regarding input medical images. This technique has the potential to improve the efficiency of medical professionals while relieving the burden on the…

Computer Vision and Pattern Recognition · Computer Science 2023-02-21 Xinyue Hu , Lin Gu , Kazuma Kobayashi , Qiyuan An , Qingyu Chen , Zhiyong Lu , Chang Su , Tatsuya Harada , Yingying Zhu

The Visual Question Answering (VQA) task combines challenges for processing data with both Visual and Linguistic processing, to answer basic `common sense' questions about given images. Given an image and a question in natural language, the…

Computer Vision and Pattern Recognition · Computer Science 2020-12-24 Yash Srivastava , Vaishnav Murali , Shiv Ram Dubey , Snehasis Mukherjee

Visual question answering (VQA) is a task that combines both the techniques of computer vision and natural language processing. It requires models to answer a text-based question according to the information contained in a visual. In recent…

Computer Vision and Pattern Recognition · Computer Science 2021-05-04 Yeyun Zou , Qiyu Xie

Visual question answering (VQA) refers to the problem where, given an image and a natural language question about the image, a correct natural language answer has to be generated. A VQA model has to demonstrate both the visual understanding…

Computer Vision and Pattern Recognition · Computer Science 2024-11-19 Raihan Kabir , Naznin Haque , Md Saiful Islam , Marium-E-Jannat

Pathology images are crucial for diagnosing and managing various diseases by visualizing cellular and tissue-level abnormalities. Recent advancements in artificial intelligence (AI), particularly multimodal models like ChatGPT, have shown…

Human-Computer Interaction · Computer Science 2024-09-25 Mianxin Liu , Jianfeng Wu , Fang Yan , Hongjun Li , Wei Wang , Shaoting Zhang , Zhe Wang

Medical visual question answering (Med-VQA) is a machine learning task that aims to create a system that can answer natural language questions based on given medical images. Although there has been rapid progress on the general VQA task,…

Computer Vision and Pattern Recognition · Computer Science 2023-09-21 Louisa Canepa , Sonit Singh , Arcot Sowmya

In healthcare and medical diagnostics, Visual Question Answering (VQA) mayemergeasapivotal tool in scenarios where analysis of intricate medical images becomes critical for accurate diagnoses. Current text-based VQA systems limit their…

Computer Vision and Pattern Recognition · Computer Science 2024-07-17 Tonmoy Rajkhowa , Amartya Roy Chowdhury , Sankalp Nagaonkar , Achyut Mani Tripathi

In this work, we introduce RadImageNet-VQA, a large-scale dataset designed to advance radiologic visual question answering (VQA) on CT and MRI exams. Existing medical VQA datasets are limited in scale, dominated by X-ray imaging or…

Computer Vision and Pattern Recognition · Computer Science 2026-03-31 Léo Butsanets , Charles Corbière , Julien Khlaut , Pierre Manceron , Corentin Dancette

Understanding images and text together is an important aspect of cognition and building advanced Artificial Intelligence (AI) systems. As a community, we have achieved good benchmarks over language and vision domains separately, however…

Computer Vision and Pattern Recognition · Computer Science 2020-11-19 Shailaja Keyur Sampat , Yezhou Yang , Chitta Baral
‹ Prev 1 2 3 10 Next ›