Related papers: Multinomial Inverse Regression for Text Analysis

Distributed multinomial regression

This article introduces a model-based approach to distributed computing for multinomial logistic (softmax) regression. We treat counts for each response category as independent Poisson regressions via plug-in estimates for fixed effects…

Applications · Statistics 2015-11-06 Matt Taddy

Measuring political sentiment on Twitter: factor-optimal design for multinomial inverse regression

This article presents a short case study in text analysis: the scoring of Twitter posts for positive, negative, or neutral sentiment directed towards particular US politicians. The study requires selection of a sub-sample of representative…

Applications · Statistics 2013-03-05 Matt Taddy

Sentence-Level Sentiment Analysis of Financial News Using Distributed Text Representations and Multi-Instance Learning

Researchers and financial professionals require robust computerized tools that allow users to rapidly operationalize and assess the semantic textual content in financial news. However, existing methods commonly work at the document-level…

Information Retrieval · Computer Science 2019-01-03 Bernhard Lutz , Nicolas Pröllochs , Dirk Neumann

Language Independent Sentiment Analysis

Social media platforms and online forums generate rapid and increasing amount of textual data. Businesses, government agencies, and media organizations seek to perform sentiment analysis on this rich text data. The results of these…

Computation and Language · Computer Science 2020-09-29 Muhammad Haroon Shakeel , Turki Alghamidi , Safi Faizullah , Imdadullah Khan

Exploring The Contribution of Unlabeled Data in Financial Sentiment Analysis

With the proliferation of its applications in various industries, sentiment analysis by using publicly available web data has become an active research area in text classification during these years. It is argued by researchers that…

Computation and Language · Computer Science 2013-08-06 Jimmy SJ. Ren , Wei Wang , Jiawei Wang , Stephen Shaoyi Liao

Semisupervised Autoencoder for Sentiment Analysis

In this paper, we investigate the usage of autoencoders in modeling textual data. Traditional autoencoders suffer from at least two aspects: scalability with the high dimensionality of vocabulary size and dealing with task-irrelevant words.…

Machine Learning · Computer Science 2015-12-15 Shuangfei Zhai , Zhongfei Zhang

Prediction regions through Inverse Regression

Predict a new response from a covariate is a challenging task in regression, which raises new question since the era of high-dimensional data. In this paper, we are interested in the inverse regression method from a theoretical viewpoint.…

Statistics Theory · Mathematics 2018-07-10 Emilie Devijver , Emeline Perthame

A Simple Ensemble Strategy for LLM Inference: Towards More Stable Text Classification

With the advance of large language models (LLMs), LLMs have been utilized for the various tasks. However, the issues of variability and reproducibility of results from each trial of LLMs have been largely overlooked in existing literature…

Computation and Language · Computer Science 2025-05-08 Junichiro Niimi

Large Language Models for Statistical Inference: Context Augmentation with Applications to the Two-Sample Problem and Regression

We introduce context augmentation, a data-augmentation approach that uses large language models (LLMs) to generate contexts around observed strings as a means of facilitating valid frequentist inference. These generated contexts serve to…

Methodology · Statistics 2025-07-01 Marc Ratkovic

Logistic regression models for aggregated data

Logistic regression models are a popular and effective method to predict the probability of categorical response data. However inference for these models can become computationally prohibitive for large datasets. Here we adapt ideas from…

Methodology · Statistics 2020-08-25 Tom Whitaker , Boris Beranger , Scott A. Sisson

Ensembling Multilingual Transformers for Robust Sentiment Analysis of Tweets

Sentiment analysis is a very important natural language processing activity in which one identifies the polarity of a text, whether it conveys positive, negative, or neutral sentiment. Along with the growth of social media and the Internet,…

Computation and Language · Computer Science 2025-09-30 Meysam Shirdel Bilehsavar , Negin Mahmoudi , Mohammad Jalili Torkamani , Kiana Kiashemshaki

Target-oriented Multimodal Sentiment Classification with Counterfactual-enhanced Debiasing

Target-oriented multimodal sentiment classification seeks to predict sentiment polarity for specific targets from image-text pairs. While existing works achieve competitive performance, they often over-rely on textual content and fail to…

Computation and Language · Computer Science 2025-09-12 Zhiyue Liu , Fanrong Ma , Xin Ling

Practical Text Classification With Large Pre-Trained Language Models

Multi-emotion sentiment classification is a natural language processing (NLP) problem with valuable use cases on real-world data. We demonstrate that large-scale unsupervised language modeling combined with finetuning offers a practical…

Computation and Language · Computer Science 2018-12-05 Neel Kant , Raul Puri , Nikolai Yakovenko , Bryan Catanzaro

Learning Sentiment Memories for Sentiment Modification without Parallel Data

The task of sentiment modification requires reversing the sentiment of the input and preserving the sentiment-independent content. However, aligned sentences with the same content but different sentiments are usually unavailable. Due to the…

Computation and Language · Computer Science 2018-08-23 Yi Zhang , Jingjing Xu , Pengcheng Yang , Xu Sun

Generative Sentiment Analysis via Latent Category Distribution and Constrained Decoding

Fine-grained sentiment analysis involves extracting and organizing sentiment elements from textual data. However, existing approaches often overlook issues of category semantic inclusion and overlap, as well as inherent structural patterns…

Computation and Language · Computer Science 2024-08-01 Jun Zhou , Dongyang Yu , Kamran Aziz , Fangfang Su , Qing Zhang , Fei Li , Donghong Ji

Towards Lossless Encoding of Sentences

A lot of work has been done in the field of image compression via machine learning, but not much attention has been given to the compression of natural language. Compressing text into lossless representations while making features easily…

Computation and Language · Computer Science 2019-08-05 Gabriele Prato , Mathieu Duchesneau , Sarath Chandar , Alain Tapp

Low-dimensional Semantic Space: from Text to Word Embedding

This article focuses on the study of Word Embedding, a feature-learning technique in Natural Language Processing that maps words or phrases to low-dimensional vectors. Beginning with the linguistic theories concerning contextual…

Computation and Language · Computer Science 2019-11-05 Xiaolei Lu , Bin Ni

Multimodal Sentiment Analysis To Explore the Structure of Emotions

We propose a novel approach to multimodal sentiment analysis using deep neural networks combining visual analysis and natural language processing. Our goal is different than the standard sentiment analysis goal of predicting whether a…

Machine Learning · Statistics 2018-05-28 Anthony Hu , Seth Flaxman

Machine Learning Sentiment Prediction based on Hybrid Document Representation

Automated sentiment analysis and opinion mining is a complex process concerning the extraction of useful subjective information from text. The explosion of user generated content on the Web, especially the fact that millions of users, on a…

Computation and Language · Computer Science 2015-12-01 Panagiotis Stalidis , Maria Giatsoglou , Konstantinos Diamantaras , George Sarigiannidis , Konstantinos Ch. Chatzisavvas

Data Selection Strategies for Multi-Domain Sentiment Analysis

Domain adaptation is important in sentiment analysis as sentiment-indicating words vary between domains. Recently, multi-domain adaptation has become more pervasive, but existing approaches train on all available source domains including…

Computation and Language · Computer Science 2017-02-09 Sebastian Ruder , Parsa Ghaffari , John G. Breslin