Related papers: Deep learning evaluation using deep linguistic pro…

A Deep Learning Approach to Analyzing Continuous-Time Systems

Scientists often use observational time series data to study complex natural processes, but regression analyses often assume simplistic dynamics. Recent advances in deep learning have yielded startling improvements to the performance of…

Machine Learning · Computer Science 2023-04-21 Cory Shain , William Schuler

Challenges and Prospects in Vision and Language Research

Language grounded image understanding tasks have often been proposed as a method for evaluating progress in artificial intelligence. Ideally, these tasks should test a plethora of capabilities that integrate computer vision, reasoning, and…

Machine Learning · Computer Science 2019-05-28 Kushal Kafle , Robik Shrestha , Christopher Kanan

Beyond Static Datasets: A Deep Interaction Approach to LLM Evaluation

Large Language Models (LLMs) have made progress in various real-world tasks, which stimulates requirements for the evaluation of LLMs. Existing LLM evaluation methods are mainly supervised signal-based which depends on static datasets and…

Computation and Language · Computer Science 2023-09-11 Jiatong Li , Rui Li , Qi Liu

Evaluating the Capabilities of Multi-modal Reasoning Models with Synthetic Task Data

The impressive advances and applications of large language and joint language-and-visual understanding models has led to an increased need for methods of probing their potential reasoning capabilities. However, the difficulty of gather…

Machine Learning · Computer Science 2023-06-05 Nathan Vaska , Victoria Helus

Multimodal Deep Learning for Scientific Imaging Interpretation

In the domain of scientific imaging, interpreting visual data often demands an intricate combination of human expertise and deep comprehension of the subject materials. This study presents a novel methodology to linguistically emulate and…

Machine Learning · Computer Science 2023-09-27 Abdulelah S. Alshehri , Franklin L. Lee , Shihu Wang

Deep Learning Approaches on Image Captioning: A Review

Image captioning is a research area of immense importance, aiming to generate natural language descriptions for visual content in the form of still images. The advent of deep learning and more recently vision-language pre-training…

Computer Vision and Pattern Recognition · Computer Science 2023-08-29 Taraneh Ghandi , Hamidreza Pourreza , Hamidreza Mahyar

A Review on Explainability in Multimodal Deep Neural Nets

Artificial Intelligence techniques powered by deep neural nets have achieved much success in several application domains, most significantly and notably in the Computer Vision applications and Natural Language Processing tasks. Surpassing…

Artificial Intelligence · Computer Science 2021-05-19 Gargi Joshi , Rahee Walambe , Ketan Kotecha

Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models

With the availability of large databases and recent improvements in deep learning methodology, the performance of AI systems is reaching or even exceeding the human level on an increasing number of complex tasks. Impressive examples of this…

Artificial Intelligence · Computer Science 2017-08-29 Wojciech Samek , Thomas Wiegand , Klaus-Robert Müller

V-PROM: A Benchmark for Visual Reasoning Using Visual Progressive Matrices

One of the primary challenges faced by deep learning is the degree to which current methods exploit superficial statistics and dataset bias, rather than learning to generalise over the specific representations they have experienced. This is…

Computer Vision and Pattern Recognition · Computer Science 2019-07-30 Damien Teney , Peng Wang , Jiewei Cao , Lingqiao Liu , Chunhua Shen , Anton van den Hengel

Survey of Visual Question Answering: Datasets and Techniques

Visual question answering (or VQA) is a new and exciting problem that combines natural language processing and computer vision techniques. We present a survey of the various datasets and models that have been used to tackle this task. The…

Computation and Language · Computer Science 2017-05-12 Akshay Kumar Gupta

Problems and Countermeasures in Natural Language Processing Evaluation

Evaluation in natural language processing guides and promotes research on models and methods. In recent years, new evalua-tion data sets and evaluation tasks have been continuously proposed. At the same time, a series of problems exposed by…

Computation and Language · Computer Science 2021-04-21 Qingxiu Dong , Zhifang Sui , Weidong Zhan , Baobao Chang

Levels of Analysis for Large Language Models

Modern artificial intelligence systems, such as large language models, are increasingly powerful but also increasingly hard to understand. Recognizing this problem as analogous to the historical difficulties in understanding the human mind,…

Computation and Language · Computer Science 2026-03-24 Alexander Y. Ku , Declan Campbell , Xuechunzi Bai , Jiayi Geng , Ryan Liu , Raja Marjieh , R. Thomas McCoy , Andrew Nam , Ilia Sucholutsky , Veniamin Veselovsky , Liyi Zhang , Jian-Qiao Zhu , Thomas L. Griffiths

Visual Question Answering using Deep Learning: A Survey and Performance Analysis

The Visual Question Answering (VQA) task combines challenges for processing data with both Visual and Linguistic processing, to answer basic `common sense' questions about given images. Given an image and a question in natural language, the…

Computer Vision and Pattern Recognition · Computer Science 2020-12-24 Yash Srivastava , Vaishnav Murali , Shiv Ram Dubey , Snehasis Mukherjee

Deep Learning Embeddings for Discontinuous Linguistic Units

Deep learning embeddings have been successfully used for many natural language processing problems. Embeddings are mostly computed for word forms although a number of recent papers have extended this to other linguistic units like morphemes…

Computation and Language · Computer Science 2013-12-20 Wenpeng Yin , Hinrich Schütze

Deep Neural Networks for Visual Reasoning

Visual perception and language understanding are - fundamental components of human intelligence, enabling them to understand and reason about objects and their interactions. It is crucial for machines to have this capacity to reason using…

Computer Vision and Pattern Recognition · Computer Science 2022-09-27 Thao Minh Le

Deep Learning Models to Study Sentence Comprehension in the Human Brain

Recent artificial neural networks that process natural language achieve unprecedented performance in tasks requiring sentence-level understanding. As such, they could be interesting models of the integration of linguistic information in the…

Computation and Language · Computer Science 2023-02-17 Sophie Arana , Jacques Pesnot Lerousseau , Peter Hagoort

A Survey of the Usages of Deep Learning in Natural Language Processing

Over the last several years, the field of natural language processing has been propelled forward by an explosion in the use of deep learning models. This survey provides a brief introduction to the field and a quick overview of deep…

Computation and Language · Computer Science 2019-12-24 Daniel W. Otter , Julian R. Medina , Jugal K. Kalita

Perceiving Beyond Language Priors: Enhancing Visual Comprehension and Attention in Multimodal Models

Achieving deep alignment between vision and language remains a central challenge for Multimodal Large Language Models (MLLMs). These models often fail to fully leverage visual input, defaulting to strong language priors. Our approach first…

Computer Vision and Pattern Recognition · Computer Science 2025-07-03 Aarti Ghatkesar , Ganesh Venkatesh

Visual Analytics for Explainable Deep Learning

Recently, deep learning has been advancing the state of the art in artificial intelligence to a new level, and humans rely on artificial intelligence techniques more than ever. However, even with such unprecedented advancements, the lack of…

Human-Computer Interaction · Computer Science 2018-04-10 Jaegul Choo , Shixia Liu

VQA and Visual Reasoning: An Overview of Recent Datasets, Methods and Challenges

Artificial Intelligence (AI) and its applications have sparked extraordinary interest in recent years. This achievement can be ascribed in part to advances in AI subfields including Machine Learning (ML), Computer Vision (CV), and Natural…

Computer Vision and Pattern Recognition · Computer Science 2023-01-02 Rufai Yusuf Zakari , Jim Wilson Owusu , Hailin Wang , Ke Qin , Zaharaddeen Karami Lawal , Yuezhou Dong