Related papers: Graphmax for Text Generation

F^2-Softmax: Diversifying Neural Text Generation via Frequency Factorized Softmax

Despite recent advances in neural text generation, encoding the rich diversity in human language remains elusive. We argue that the sub-optimal text generation is mainly attributable to the imbalanced token distribution, which particularly…

Computation and Language · Computer Science 2020-10-06 Byung-Ju Choi , Jimin Hong , David Keetae Park , Sang Wan Lee

Graph Neural Networks (GNNs) are powerful deep learning models designed for graph-structured data, demonstrating effectiveness across a wide range of applications.The softmax function is the most commonly used classifier for semi-supervised…

Machine Learning · Computer Science 2024-09-23 Yiming Yang , Jun Liu , Wei Wan

GraphMatch: Fusing Language and Graph Representations in a Dynamic Two-Sided Work Marketplace

Recommending matches in a text-rich, dynamic two-sided marketplace presents unique challenges due to evolving content and interaction graphs. We introduce GraphMatch, a new large-scale recommendation framework that fuses pre-trained…

Machine Learning · Computer Science 2025-12-03 Mikołaj Sacha , Hammad Jafri , Mattie Terzolo , Ayan Sinha , Andrew Rabinovich

Evaluating and Improving Graph to Text Generation with Large Language Models

Large language models (LLMs) have demonstrated immense potential across various tasks. However, research for exploring and improving the capabilities of LLMs in interpreting graph structures remains limited. To address this gap, we conduct…

Computation and Language · Computer Science 2025-02-17 Jie He , Yijun Yang , Wanqiu Long , Deyi Xiong , Victor Gutierrez-Basulto , Jeff Z. Pan

Efficient softmax approximation for GPUs

We propose an approximate strategy to efficiently train neural network based language models over very large vocabularies. Our approach, called adaptive softmax, circumvents the linear dependency on the vocabulary size by exploiting the…

Computation and Language · Computer Science 2017-06-20 Edouard Grave , Armand Joulin , Moustapha Cissé , David Grangier , Hervé Jégou

GraphText: Graph Reasoning in Text Space

Large Language Models (LLMs) have gained the ability to assimilate human knowledge and facilitate natural language interactions with both humans and other LLMs. However, despite their impressive achievements, LLMs have not made significant…

Computation and Language · Computer Science 2023-10-03 Jianan Zhao , Le Zhuo , Yikang Shen , Meng Qu , Kai Liu , Michael Bronstein , Zhaocheng Zhu , Jian Tang

Planning with Logical Graph-based Language Model for Instruction Generation

Despite the superior performance of large language models to generate natural language texts, it is hard to generate texts with correct logic according to a given task, due to the difficulties for neural models to capture implied rules from…

Computation and Language · Computer Science 2024-07-08 Fan Zhang , Kebing Jin , Hankz Hankui Zhuo

GraphMind: Theorem Selection and Conclusion Generation Framework with Dynamic GNN for LLM Reasoning

Large language models (LLMs) have demonstrated impressive capabilities in natural language understanding and generation, including multi-step reasoning such as mathematical proving. However, existing approaches often lack an explicit and…

Computation and Language · Computer Science 2026-05-19 Yutong Li , Yitian Zhou , Xudong Wang , GuoChen , Caiyan Qin

Graph-based Approach to Automatic Taxonomy Generation (GraBTax)

We propose a novel graph-based approach for constructing concept hierarchy from a large text corpus. Our algorithm, GraBTax, incorporates both statistical co-occurrences and lexical similarity in optimizing the structure of the taxonomy. To…

Information Retrieval · Computer Science 2014-04-30 Pucktada Treeratpituk , Madian Khabsa , C. Lee Giles

Automatic Text Extractive Summarization Based on Graph and Pre-trained Language Model Attention

Representing a text as a graph for obtaining automatic text summarization has been investigated for over ten years. With the development of attention or Transformer on natural language processing (NLP), it is possible to make a connection…

Computation and Language · Computer Science 2022-07-27 Yuan-Ching Lin , Jinwen Ma

Speeding Up Entmax

Softmax is the de facto standard in modern neural networks for language processing when it comes to normalizing logits. However, by producing a dense probability distribution each token in the vocabulary has a nonzero chance of being…

Computation and Language · Computer Science 2022-05-20 Maxat Tezekbayev , Vassilina Nikoulina , Matthias Gallé , Zhenisbek Assylbekov

Learning to Screen for Fast Softmax Inference on Large Vocabulary Neural Networks

Neural language models have been widely used in various NLP tasks, including machine translation, next word prediction and conversational agents. However, it is challenging to deploy these models on mobile devices due to their slow…

Machine Learning · Computer Science 2018-10-31 Patrick H. Chen , Si Si , Sanjiv Kumar , Yang Li , Cho-Jui Hsieh

Task Grouping for Multilingual Text Recognition

Most existing OCR methods focus on alphanumeric characters due to the popularity of English and numbers, as well as their corresponding datasets. On extending the characters to more languages, recent methods have shown that training…

Computer Vision and Pattern Recognition · Computer Science 2022-10-17 Jing Huang , Kevin J Liang , Rama Kovvuri , Tal Hassner

Attention Scheme Inspired Softmax Regression

Large language models (LLMs) have made transformed changes for human society. One of the key computation in LLMs is the softmax unit. This operation is important in LLMs because it allows the model to generate a distribution over possible…

Machine Learning · Computer Science 2023-04-27 Yichuan Deng , Zhihang Li , Zhao Song

Graph Linearization Methods for Reasoning on Graphs with Large Language Models

Large language models have evolved to process multiple modalities beyond text, such as images and audio, which motivates us to explore how to effectively leverage them for graph reasoning tasks. The key question, therefore, is how to…

Computation and Language · Computer Science 2025-06-26 Christos Xypolopoulos , Guokan Shang , Xiao Fei , Giannis Nikolentzos , Hadi Abdine , Iakovos Evdaimon , Michail Chatzianastasis , Giorgos Stamou , Michalis Vazirgiannis

Robust Document Representations using Latent Topics and Metadata

Task specific fine-tuning of a pre-trained neural language model using a custom softmax output layer is the de facto approach of late when dealing with document classification problems. This technique is not adequate when labeled examples…

Computation and Language · Computer Science 2020-10-27 Natraj Raman , Armineh Nourbakhsh , Sameena Shah , Manuela Veloso

Discourse-Aware Soft Prompting for Text Generation

Current efficient fine-tuning methods (e.g., adapters, prefix-tuning, etc.) have optimized conditional text generation via training a small set of extra parameters of the neural language model, while freezing the rest for efficiency. While…

Computation and Language · Computer Science 2022-05-24 Marjan Ghazvininejad , Vladimir Karpukhin , Vera Gor , Asli Celikyilmaz

Generating Diverse Descriptions from Semantic Graphs

Text generation from semantic graphs is traditionally performed with deterministic methods, which generate a unique description given an input graph. However, the generation problem admits a range of acceptable textual outputs, exhibiting…

Computation and Language · Computer Science 2021-08-16 Jiuzhou Han , Daniel Beck , Trevor Cohn

Multiplex Graph Neural Network for Extractive Text Summarization

Extractive text summarization aims at extracting the most representative sentences from a given document as its summary. To extract a good summary from a long text document, sentence embedding plays an important role. Recent studies have…

Computation and Language · Computer Science 2021-09-10 Baoyu Jing , Zeyu You , Tao Yang , Wei Fan , Hanghang Tong

Learning Implicit Text Generation via Feature Matching

Generative feature matching network (GFMN) is an approach for training implicit generative models for images by performing moment matching on features from pre-trained neural networks. In this paper, we present new GFMN formulations that…

Computation and Language · Computer Science 2020-05-12 Inkit Padhi , Pierre Dognin , Ke Bai , Cicero Nogueira dos Santos , Vijil Chenthamarakshan , Youssef Mroueh , Payel Das