Related papers: Discrete Diffusion Language Model for Efficient Te…

Efficient Extractive Summarization with MAMBA-Transformer Hybrids for Low-Resource Scenarios

Extractive summarization of long documents is bottlenecked by quadratic complexity, often forcing truncation and limiting deployment in resource-constrained settings. We introduce the first Mamba-Transformer hybrid for extractive…

Computation and Language · Computer Science 2026-03-03 Nisrine Ait Khayi

DiffuMamba: High-Throughput Diffusion LMs with Mamba Backbone

Diffusion language models (DLMs) have emerged as a promising alternative to autoregressive (AR) generation, yet their reliance on Transformer backbones limits inference efficiency due to quadratic attention or KV-cache overhead. We…

Machine Learning · Computer Science 2026-03-02 Vaibhav Singh , Oleksiy Ostapenko , Pierre-André Noël , Eugene Belilovsky , Torsten Scholak

Survey on Abstractive Text Summarization: Dataset, Models, and Metrics

The advancements in deep learning, particularly the introduction of transformers, have been pivotal in enhancing various natural language processing (NLP) tasks. These include text-to-text applications such as machine translation, text…

Artificial Intelligence · Computer Science 2024-12-24 Gospel Ozioma Nnadi , Flavio Bertini

Improving Abstractive Text Summarization with History Aggregation

Recent neural sequence to sequence models have provided feasible solutions for abstractive summarization. However, such models are still hard to tackle long text dependency in the summarization task. A high-quality summarization system…

Computation and Language · Computer Science 2019-12-25 Pengcheng Liao , Chuang Zhang , Xiaojun Chen , Xiaofei Zhou

Cross-lingual Transfer of Abstractive Summarizer to Less-resource Language

Automatic text summarization extracts important information from texts and presents the information in the form of a summary. Abstractive summarization approaches progressed significantly by switching to deep neural networks, but results…

Computation and Language · Computer Science 2021-09-03 Aleš Žagar , Marko Robnik-Šikonja

Diffusion Features to Bridge Domain Gap for Semantic Segmentation

Pre-trained diffusion models have demonstrated remarkable proficiency in synthesizing images across a wide range of scenarios with customizable prompts, indicating their effective capacity to capture universal features. Motivated by this,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-22 Yuxiang Ji , Boyong He , Chenyuan Qu , Zhuoyue Tan , Chuan Qin , Liaoni Wu

TransFusion: Transcribing Speech with Multinomial Diffusion

Diffusion models have shown exceptional scaling properties in the image synthesis domain, and initial attempts have shown similar benefits for applying diffusion to unconditional text synthesis. Denoising diffusion models attempt to…

Audio and Speech Processing · Electrical Eng. & Systems 2022-10-17 Matthew Baas , Kevin Eloff , Herman Kamper

DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis

Diffusion models have achieved great success in image generation, with the backbone evolving from U-Net to Vision Transformers. However, the computational cost of Transformers is quadratic to the number of tokens, leading to significant…

Computer Vision and Pattern Recognition · Computer Science 2024-07-11 Yao Teng , Yue Wu , Han Shi , Xuefei Ning , Guohao Dai , Yu Wang , Zhenguo Li , Xihui Liu

Diffusion Model as a Generalist Segmentation Learner

Diffusion models are primarily trained for image synthesis, yet their denoising trajectories encode rich, spatially aligned visual priors. In this paper, we demonstrate that these priors can be utilized for text-conditioned semantic and…

Computer Vision and Pattern Recognition · Computer Science 2026-04-28 Haoxiao Wang , Antao Xiang , Haiyang Sun , Peilin Sun , Changhao Pan , Yifu Chen , Minjie Hong , Weijie Wang , Shuang Chen , Yue Chen , Zhou Zhao

Contextualized Diffusion Models for Text-Guided Image and Video Generation

Conditional diffusion models have exhibited superior performance in high-fidelity text-guided visual generation and editing. Nevertheless, prevailing text-guided visual diffusion models primarily focus on incorporating text-visual…

Computer Vision and Pattern Recognition · Computer Science 2024-06-05 Ling Yang , Zhilong Zhang , Zhaochen Yu , Jingwei Liu , Minkai Xu , Stefano Ermon , Bin Cui

MASV: Speaker Verification with Global and Local Context Mamba

Deep learning models like Convolutional Neural Networks and transformers have shown impressive capabilities in speech verification, gaining considerable attention in the research community. However, CNN-based approaches struggle with…

Audio and Speech Processing · Electrical Eng. & Systems 2024-12-17 Yang Liu , Li Wan , Yiteng Huang , Ming Sun , Yangyang Shi , Florian Metze

TextMamba: Scene Text Detector with Mamba

In scene text detection, Transformer-based methods have addressed the global feature extraction limitations inherent in traditional convolution neural network-based methods. However, most directly rely on native Transformer attention layers…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Qiyan Zhao , Yue Yan , Da-Han Wang

LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba

Recent Transformer-based diffusion models have shown remarkable performance, largely attributed to the ability of the self-attention mechanism to accurately capture both global and local contexts by computing all-pair interactions among…

Computer Vision and Pattern Recognition · Computer Science 2024-09-20 Yunxiang Fu , Chaoqi Chen , Yizhou Yu

Self-conditioned Embedding Diffusion for Text Generation

Can continuous diffusion models bring the same performance breakthrough on natural language they did for image generation? To circumvent the discrete nature of text data, we can simply project tokens in a continuous space of embeddings, as…

Computation and Language · Computer Science 2022-11-09 Robin Strudel , Corentin Tallec , Florent Altché , Yilun Du , Yaroslav Ganin , Arthur Mensch , Will Grathwohl , Nikolay Savinov , Sander Dieleman , Laurent Sifre , Rémi Leblond

A Cheaper and Better Diffusion Language Model with Soft-Masked Noise

Diffusion models that are based on iterative denoising have been recently proposed and leveraged in various generation tasks like image generation. Whereas, as a way inherently built for continuous data, existing diffusion models still have…

Computation and Language · Computer Science 2023-04-11 Jiaao Chen , Aston Zhang , Mu Li , Alex Smola , Diyi Yang

Windowing Models for Abstractive Summarization of Long Texts

Neural summarization models suffer from the fixed-size input limitation: if text length surpasses the model's maximal number of input tokens, some document content (possibly summary-relevant) gets truncated Independently summarizing windows…

Computation and Language · Computer Science 2020-04-08 Leon Schüller , Florian Wilhelm , Nico Kreiling , Goran Glavaš

Curriculum-Guided Abstractive Summarization

Recent Transformer-based summarization models have provided a promising approach to abstractive summarization. They go beyond sentence selection and extractive strategies to deal with more complicated tasks such as novel word generation and…

Computation and Language · Computer Science 2023-02-09 Sajad Sotudeh , Hanieh Deilamsalehy , Franck Dernoncourt , Nazli Goharian

ReMamba: Equip Mamba with Effective Long-Sequence Modeling

While the Mamba architecture demonstrates superior inference efficiency and competitive performance on short-context natural language processing (NLP) tasks, empirical evidence suggests its capacity to comprehend long contexts is limited…

Computation and Language · Computer Science 2025-01-03 Danlong Yuan , Jiahao Liu , Bei Li , Huishuai Zhang , Jingang Wang , Xunliang Cai , Dongyan Zhao

Adaptive Multi Scale Document Binarisation Using Vision Mamba

Enhancing and preserving the readability of document images, particularly historical ones, is crucial for effective document image analysis. Numerous models have been proposed for this task, including convolutional-based, transformer-based,…

Computer Vision and Pattern Recognition · Computer Science 2024-10-31 Mohd. Azfar , Siddhant Bharadwaj , Ashwin Sasikumar

Dual-Rate Diffusion: Accelerating diffusion models with an interleaved heavy-light network

Diffusion models achieve state-of-the-art generative performance but suffer from high computational costs during inference due to the repeated evaluation of a heavy neural network. In this work, we propose Dual-Rate Diffusion, a method to…

Machine Learning · Computer Science 2026-05-19 Grigory Bartosh , David Ruhe , Emiel Hoogeboom , Jonathan Heek , Thomas Mensink , Tim Salimans