Related papers: Multi-modal Sentiment Analysis using Deep Canonica…

Learning Relationships between Text, Audio, and Video via Deep Canonical Correlation for Multimodal Language Analysis

Multimodal language analysis often considers relationships between features based on text and those based on acoustical and visual properties. Text features typically outperform non-text features in sentiment analysis or emotion recognition…

Machine Learning · Computer Science 2019-12-03 Zhongkai Sun , Prathusha Sarma , William Sethares , Yingyu Liang

Multimodal Emotion Recognition Using Deep Canonical Correlation Analysis

Multimodal signals are more powerful than unimodal data for emotion recognition since they can represent emotions more comprehensively. In this paper, we introduce deep canonical correlation analysis (DCCA) to multimodal emotion…

Machine Learning · Computer Science 2019-08-16 Wei Liu , Jie-Lin Qiu , Wei-Long Zheng , Bao-Liang Lu

Multimodal Representation Learning using Deep Multiset Canonical Correlation

We propose Deep Multiset Canonical Correlation Analysis (dMCCA) as an extension to representation learning using CCA when the underlying signal is observed across multiple (more than two) modalities. We use deep learning framework to learn…

Machine Learning · Computer Science 2023-02-09 Krishna Somandepalli , Naveen Kumar , Ruchir Travadi , Shrikanth Narayanan

Deep Tensor CCA for Multi-view Learning

We present Deep Tensor Canonical Correlation Analysis (DTCCA), a method to learn complex nonlinear transformations of multiple views (more than two) of data such that the resulting representations are linearly correlated in high order. The…

Machine Learning · Computer Science 2020-05-26 Hok Shing Wong , Li Wang , Raymond Chan , Tieyong Zeng

Discriminative Multiple Canonical Correlation Analysis for Information Fusion

In this paper, we propose the Discriminative Multiple Canonical Correlation Analysis (DMCCA) for multimodal information analysis and fusion. DMCCA is capable of extracting more discriminative characteristics from multimodal information…

Machine Learning · Computer Science 2021-03-02 Lei Gao , Lin Qi , Enqing Chen , Ling Guan

Dynamic Multimodal Sentiment Analysis: Leveraging Cross-Modal Attention for Enabled Classification

This paper explores the development of a multimodal sentiment analysis model that integrates text, audio, and visual data to enhance sentiment classification. The goal is to improve emotion detection by capturing the complex interactions…

Computation and Language · Computer Science 2025-01-15 Hui Lee , Singh Suniljit , Yong Siang Ong

Deep Generalized Canonical Correlation Analysis

We present Deep Generalized Canonical Correlation Analysis (DGCCA) -- a method for learning nonlinear transformations of arbitrarily many views of data, such that the resulting transformations are maximally informative of each other. While…

Machine Learning · Computer Science 2017-06-16 Adrian Benton , Huda Khayrallah , Biman Gujral , Dee Ann Reisinger , Sheng Zhang , Raman Arora

A Deep Multi-Level Attentive network for Multimodal Sentiment Analysis

Multimodal sentiment analysis has attracted increasing attention with broad application prospects. The existing methods focuses on single modality, which fails to capture the social media content for multiple modalities. Moreover, in…

Multimedia · Computer Science 2022-05-11 Ashima Yadav , Dinesh Kumar Vishwakarma

Category-Based Deep CCA for Fine-Grained Venue Discovery from Multimodal Data

In this work, travel destination and business location are taken as venues. Discovering a venue by a photo is very important for context-aware applications. Unfortunately, few efforts paid attention to complicated real images such as venue…

Computer Vision and Pattern Recognition · Computer Science 2018-05-09 Yi Yu , Suhua Tang , Kiyoharu Aizawa , Akiko Aizawa

Audio-Visual Embedding for Cross-Modal MusicVideo Retrieval through Supervised Deep CCA

Deep learning has successfully shown excellent performance in learning joint representations between different data modalities. Unfortunately, little research focuses on cross-modal correlation learning where temporal structures of…

Multimedia · Computer Science 2019-08-13 Donghuo Zeng , Yi Yu , Keizo Oyama

Citation Recommendation using Deep Canonical Correlation Analysis

Recent advances in citation recommendation have improved accuracy by leveraging multi-view representation learning to integrate the various modalities present in scholarly documents. However, effectively combining multiple data views…

Information Retrieval · Computer Science 2025-07-24 Conor McNamara , Effirul Ramlan

An Enhanced Dual Transformer Contrastive Network for Multimodal Sentiment Analysis

Multimodal Sentiment Analysis (MSA) seeks to understand human emotions by jointly analyzing data from multiple modalities typically text and images offering a richer and more accurate interpretation than unimodal approaches. In this paper,…

Machine Learning · Computer Science 2025-10-29 Phuong Q. Dao , Mark Roantree , Vuong M. Ngo

Variational Interpretable Learning from Multi-view Data

The main idea of canonical correlation analysis (CCA) is to map different views onto a common latent space with maximum correlation. We propose a deep interpretable variational canonical correlation analysis (DICCA) for multi-view learning.…

Machine Learning · Statistics 2022-03-03 Lin Qiu , Lynn Lin , Vernon M. Chinchilli

Multimodal Emotion Recognition and Sentiment Analysis in Multi-Party Conversation Contexts

Emotion recognition and sentiment analysis are pivotal tasks in speech and language processing, particularly in real-world scenarios involving multi-party, conversational data. This paper presents a multimodal approach to tackle these…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Aref Farhadipour , Hossein Ranjbar , Masoumeh Chapariniya , Teodora Vukovic , Sarah Ebling , Volker Dellwo

Multimodal Sentiment Analysis on CMU-MOSEI Dataset using Transformer-based Models

This project performs multimodal sentiment analysis using the CMU-MOSEI dataset, using transformer-based models with early fusion to integrate text, audio, and visual modalities. We employ BERT-based encoders for each modality, extracting…

Computation and Language · Computer Science 2025-07-16 Jugal Gajjar , Kaustik Ranaware

Deep Variational Canonical Correlation Analysis

We present deep variational canonical correlation analysis (VCCA), a deep multi-view learning model that extends the latent variable model interpretation of linear CCA to nonlinear observation models parameterized by deep neural networks.…

Machine Learning · Computer Science 2017-02-28 Weiran Wang , Xinchen Yan , Honglak Lee , Karen Livescu

The Labeled Multiple Canonical Correlation Analysis for Information Fusion

The objective of multimodal information fusion is to mathematically analyze information carried in different sources and create a new representation which will be more effectively utilized in pattern recognition and other multimedia…

Computer Vision and Pattern Recognition · Computer Science 2021-03-02 Lei Gao , Rui Zhang , Lin Qi , Enqing Chen , Ling Guan

Enhancing Sentiment Analysis through Multimodal Fusion: A BERT-DINOv2 Approach

Multimodal sentiment analysis enhances conventional sentiment analysis, which traditionally relies solely on text, by incorporating information from different modalities such as images, text, and audio. This paper proposes a novel…

Computer Vision and Pattern Recognition · Computer Science 2025-03-12 Taoxu Zhao , Meisi Li , Kehao Chen , Liye Wang , Xucheng Zhou , Kunal Chaturvedi , Mukesh Prasad , Ali Anaissi , Ali Braytee

Acoustic Feature Learning via Deep Variational Canonical Correlation Analysis

We study the problem of acoustic feature learning in the setting where we have access to another (non-acoustic) modality for feature learning but not at test time. We use deep variational canonical correlation analysis (VCCA), a recently…

Computer Vision and Pattern Recognition · Computer Science 2017-09-01 Qingming Tang , Weiran Wang , Karen Livescu

Multi-Modal Opinion Integration for Financial Sentiment Analysis using Cross-Modal Attention

In recent years, financial sentiment analysis of public opinion has become increasingly important for market forecasting and risk assessment. However, existing methods often struggle to effectively integrate diverse opinion modalities and…

Machine Learning · Computer Science 2025-12-04 Yujing Liu , Chen Yang