Related papers: Methods to integrate a language model with semanti…

Applying Part-of-Seech Enhanced LSA to Automatic Essay Grading

Latent Semantic Analysis (LSA) is a widely used Information Retrieval method based on "bag-of-words" assumption. However, according to general conception, syntax plays a role in representing meaning of sentences. Thus, enhancing LSA with…

Information Retrieval · Computer Science 2007-05-23 Tuomo Kakkonen , Niko Myller , Erkki Sutinen

An Enhanced Latent Semantic Analysis Approach for Arabic Document Summarization

The fast-growing amount of information on the Internet makes the research in automatic document summarization very urgent. It is an effective solution for information overload. Many approaches have been proposed based on different…

Computation and Language · Computer Science 2018-08-01 Kamal Al-Sabahi , Zuping Zhang , Jun Long , Khaled Alwesabi

Probabilistic Latent Semantic Analysis

Probabilistic Latent Semantic Analysis is a novel statistical technique for the analysis of two-mode and co-occurrence data, which has applications in information retrieval and filtering, natural language processing, machine learning from…

Machine Learning · Computer Science 2013-01-30 Thomas Hofmann

How Does Latent Semantic Analysis Work? A Visualisation Approach

By using a small example, an analogy to photographic compression, and a simple visualization using heatmaps, we show that latent semantic analysis (LSA) is able to extract what appears to be semantic meaning of words from a set of documents…

Computation and Language · Computer Science 2014-02-05 Jan Koeman , William Rea

Spoken Grammar Assessment Using LLM

Spoken language assessment (SLA) systems restrict themselves to evaluating the pronunciation and oral fluency of a speaker by analysing the read and spontaneous spoken utterances respectively. The assessment of language grammar or…

Computation and Language · Computer Science 2024-10-03 Sunil Kumar Kopparapu , Chitralekha Bhat , Ashish Panda

Evaluating statistical language models as pragmatic reasoners

The relationship between communicated language and intended meaning is often probabilistic and sensitive to context. Numerous strategies attempt to estimate such a mapping, often leveraging recursive Bayesian models of communication. In…

Computation and Language · Computer Science 2023-05-03 Benjamin Lipkin , Lionel Wong , Gabriel Grand , Joshua B Tenenbaum

Predictive Prompt Analysis

Large Language Models (LLMs) are machine learning models that have seen widespread adoption due to their capability of handling previously difficult tasks. LLMs, due to their training, are sensitive to how exactly a question is presented,…

Software Engineering · Computer Science 2025-12-22 Jae Yong Lee , Sungmin Kang , Shin Yoo

Probabilistic Latent Semantic Analysis (PLSA) untuk Klasifikasi Dokumen Teks Berbahasa Indonesia

One task that is included in managing documents is how to find substantial information inside. Topic modeling is a technique that has been developed to produce document representation in form of keywords. The keywords will be used in the…

Computation and Language · Computer Science 2015-12-03 Derwin Suhartono

Neural Latent Relational Analysis to Capture Lexical Semantic Relations in a Vector Space

Capturing the semantic relations of words in a vector space contributes to many natural language processing tasks. One promising approach exploits lexico-syntactic patterns as features of word pairs. In this paper, we propose a novel model…

Computation and Language · Computer Science 2018-09-11 Koki Washio , Tsuneaki Kato

Structured Language Modeling for Speech Recognition

A new language model for speech recognition is presented. The model develops hidden hierarchical syntactic-like structure incrementally and uses it to extract meaningful information from the word history, thus complementing the locality of…

Computation and Language · Computer Science 2007-05-23 Ciprian Chelba , Frederick Jelinek

Human-Level Performance on Word Analogy Questions by Latent Relational Analysis

This paper introduces Latent Relational Analysis (LRA), a method for measuring relational similarity. LRA has potential applications in many areas, including information extraction, word sense disambiguation, machine translation, and…

Computation and Language · Computer Science 2007-05-23 Peter D. Turney

Dise\~no de un espacio sem\'antico sobre la base de la Wikipedia. Una propuesta de an\'alisis de la sem\'antica latente para el idioma espa\~nol

Latent Semantic Analysis (LSA) was initially conceived by the cognitive psychology at the 90s decade. Since its emergence, the LSA has been used to model cognitive processes, pointing out academic texts, compare literature works and analyse…

Computation and Language · Computer Science 2019-02-07 Dalina Aidee Villa , Igor Barahona , Luis Javier Álvarez

Always Keep your Target in Mind: Studying Semantics and Improving Performance of Neural Lexical Substitution

Lexical substitution, i.e. generation of plausible words that can replace a particular target word in a given context, is an extremely powerful technology that can be used as a backbone of various NLP applications, including word sense…

Computation and Language · Computer Science 2023-05-24 Nikolay Arefyev , Boris Sheludko , Alexander Podolskiy , Alexander Panchenko

Latent Tree Language Model

In this paper we introduce Latent Tree Language Model (LTLM), a novel approach to language modeling that encodes syntax and semantics of a given sentence as a tree of word roles. The learning phase iteratively updates the trees by moving…

Computation and Language · Computer Science 2016-09-06 Tomas Brychcin

Semantic-preserved Augmentation with Confidence-weighted Fine-tuning for Aspect Category Sentiment Analysis

Large language model (LLM) is an effective approach to addressing data scarcity in low-resource scenarios. Recent existing research designs hand-crafted prompts to guide LLM for data augmentation. We introduce a data augmentation strategy…

Computation and Language · Computer Science 2025-06-10 Yaping Chai , Haoran Xie , Joe S. Qin

N-gram-Based Low-Dimensional Representation for Document Classification

The bag-of-words (BOW) model is the common approach for classifying documents, where words are used as feature for training a classifier. This generally involves a huge number of features. Some techniques, such as Latent Semantic Analysis…

Computation and Language · Computer Science 2015-04-13 Rémi Lebret , Ronan Collobert

How to Make the Most of LLMs' Grammatical Knowledge for Acceptability Judgments

The grammatical knowledge of language models (LMs) is often measured using a benchmark of linguistic minimal pairs, where the LMs are presented with a pair of acceptable and unacceptable sentences and required to judge which is more…

Computation and Language · Computer Science 2025-02-10 Yusuke Ide , Yuto Nishida , Justin Vasselli , Miyu Oba , Yusuke Sakai , Hidetaka Kamigaito , Taro Watanabe

Latent Relation Language Models

In this paper, we propose Latent Relation Language Models (LRLMs), a class of language models that parameterizes the joint distribution over the words in a document and the entities that occur therein via knowledge graph relations. This…

Computation and Language · Computer Science 2019-08-22 Hiroaki Hayashi , Zecong Hu , Chenyan Xiong , Graham Neubig

Latent Semantic Analysis Approach for Document Summarization Based on Word Embeddings

Since the amount of information on the internet is growing rapidly, it is not easy for a user to find relevant information for his/her query. To tackle this issue, much attention has been paid to Automatic Document Summarization. The key…

Computation and Language · Computer Science 2019-02-05 Kamal Al-Sabahi , Zhang Zuping , Yang Kang

Expoiting Syntactic Structure for Language Modeling

The paper presents a language model that develops syntactic structure and uses it to extract meaningful information from the word history, thus enabling the use of long distance dependencies. The model assigns probability to every joint…

Computation and Language · Computer Science 2007-05-23 Ciprian Chelba , Frederick Jelinek