English
Related papers

Related papers: Discrete Diffusion Language Model for Efficient Te…

200 papers

Extractive summarization of long documents is bottlenecked by quadratic complexity, often forcing truncation and limiting deployment in resource-constrained settings. We introduce the first Mamba-Transformer hybrid for extractive…

Computation and Language · Computer Science 2026-03-03 Nisrine Ait Khayi

Diffusion language models (DLMs) have emerged as a promising alternative to autoregressive (AR) generation, yet their reliance on Transformer backbones limits inference efficiency due to quadratic attention or KV-cache overhead. We…

Machine Learning · Computer Science 2026-03-02 Vaibhav Singh , Oleksiy Ostapenko , Pierre-André Noël , Eugene Belilovsky , Torsten Scholak

The advancements in deep learning, particularly the introduction of transformers, have been pivotal in enhancing various natural language processing (NLP) tasks. These include text-to-text applications such as machine translation, text…

Artificial Intelligence · Computer Science 2024-12-24 Gospel Ozioma Nnadi , Flavio Bertini

Recent neural sequence to sequence models have provided feasible solutions for abstractive summarization. However, such models are still hard to tackle long text dependency in the summarization task. A high-quality summarization system…

Computation and Language · Computer Science 2019-12-25 Pengcheng Liao , Chuang Zhang , Xiaojun Chen , Xiaofei Zhou

Automatic text summarization extracts important information from texts and presents the information in the form of a summary. Abstractive summarization approaches progressed significantly by switching to deep neural networks, but results…

Computation and Language · Computer Science 2021-09-03 Aleš Žagar , Marko Robnik-Šikonja

Pre-trained diffusion models have demonstrated remarkable proficiency in synthesizing images across a wide range of scenarios with customizable prompts, indicating their effective capacity to capture universal features. Motivated by this,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-22 Yuxiang Ji , Boyong He , Chenyuan Qu , Zhuoyue Tan , Chuan Qin , Liaoni Wu

Diffusion models have shown exceptional scaling properties in the image synthesis domain, and initial attempts have shown similar benefits for applying diffusion to unconditional text synthesis. Denoising diffusion models attempt to…

Audio and Speech Processing · Electrical Eng. & Systems 2022-10-17 Matthew Baas , Kevin Eloff , Herman Kamper

Diffusion models have achieved great success in image generation, with the backbone evolving from U-Net to Vision Transformers. However, the computational cost of Transformers is quadratic to the number of tokens, leading to significant…

Computer Vision and Pattern Recognition · Computer Science 2024-07-11 Yao Teng , Yue Wu , Han Shi , Xuefei Ning , Guohao Dai , Yu Wang , Zhenguo Li , Xihui Liu

Diffusion models are primarily trained for image synthesis, yet their denoising trajectories encode rich, spatially aligned visual priors. In this paper, we demonstrate that these priors can be utilized for text-conditioned semantic and…

Computer Vision and Pattern Recognition · Computer Science 2026-04-28 Haoxiao Wang , Antao Xiang , Haiyang Sun , Peilin Sun , Changhao Pan , Yifu Chen , Minjie Hong , Weijie Wang , Shuang Chen , Yue Chen , Zhou Zhao

Conditional diffusion models have exhibited superior performance in high-fidelity text-guided visual generation and editing. Nevertheless, prevailing text-guided visual diffusion models primarily focus on incorporating text-visual…

Computer Vision and Pattern Recognition · Computer Science 2024-06-05 Ling Yang , Zhilong Zhang , Zhaochen Yu , Jingwei Liu , Minkai Xu , Stefano Ermon , Bin Cui

Deep learning models like Convolutional Neural Networks and transformers have shown impressive capabilities in speech verification, gaining considerable attention in the research community. However, CNN-based approaches struggle with…

Audio and Speech Processing · Electrical Eng. & Systems 2024-12-17 Yang Liu , Li Wan , Yiteng Huang , Ming Sun , Yangyang Shi , Florian Metze

In scene text detection, Transformer-based methods have addressed the global feature extraction limitations inherent in traditional convolution neural network-based methods. However, most directly rely on native Transformer attention layers…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Qiyan Zhao , Yue Yan , Da-Han Wang

Recent Transformer-based diffusion models have shown remarkable performance, largely attributed to the ability of the self-attention mechanism to accurately capture both global and local contexts by computing all-pair interactions among…

Computer Vision and Pattern Recognition · Computer Science 2024-09-20 Yunxiang Fu , Chaoqi Chen , Yizhou Yu

Can continuous diffusion models bring the same performance breakthrough on natural language they did for image generation? To circumvent the discrete nature of text data, we can simply project tokens in a continuous space of embeddings, as…

Diffusion models that are based on iterative denoising have been recently proposed and leveraged in various generation tasks like image generation. Whereas, as a way inherently built for continuous data, existing diffusion models still have…

Computation and Language · Computer Science 2023-04-11 Jiaao Chen , Aston Zhang , Mu Li , Alex Smola , Diyi Yang

Neural summarization models suffer from the fixed-size input limitation: if text length surpasses the model's maximal number of input tokens, some document content (possibly summary-relevant) gets truncated Independently summarizing windows…

Computation and Language · Computer Science 2020-04-08 Leon Schüller , Florian Wilhelm , Nico Kreiling , Goran Glavaš

Recent Transformer-based summarization models have provided a promising approach to abstractive summarization. They go beyond sentence selection and extractive strategies to deal with more complicated tasks such as novel word generation and…

Computation and Language · Computer Science 2023-02-09 Sajad Sotudeh , Hanieh Deilamsalehy , Franck Dernoncourt , Nazli Goharian

While the Mamba architecture demonstrates superior inference efficiency and competitive performance on short-context natural language processing (NLP) tasks, empirical evidence suggests its capacity to comprehend long contexts is limited…

Computation and Language · Computer Science 2025-01-03 Danlong Yuan , Jiahao Liu , Bei Li , Huishuai Zhang , Jingang Wang , Xunliang Cai , Dongyan Zhao

Enhancing and preserving the readability of document images, particularly historical ones, is crucial for effective document image analysis. Numerous models have been proposed for this task, including convolutional-based, transformer-based,…

Computer Vision and Pattern Recognition · Computer Science 2024-10-31 Mohd. Azfar , Siddhant Bharadwaj , Ashwin Sasikumar

Diffusion models achieve state-of-the-art generative performance but suffer from high computational costs during inference due to the repeated evaluation of a heavy neural network. In this work, we propose Dual-Rate Diffusion, a method to…

Machine Learning · Computer Science 2026-05-19 Grigory Bartosh , David Ruhe , Emiel Hoogeboom , Jonathan Heek , Thomas Mensink , Tim Salimans
‹ Prev 1 2 3 10 Next ›