English
Related papers

Related papers: Learning Sparse Prototypes for Text Generation

200 papers

Current state-of-the-art text generators build on powerful language models such as GPT-2, achieving impressive performance. However, to avoid degenerate text, they require sampling from a modified softmax, via temperature parameters or…

Computation and Language · Computer Science 2020-10-06 Pedro Henrique Martins , Zita Marinho , André F. T. Martins

Modern large-scale Pre-trained Language Models (PLMs) have achieved tremendous success on a wide range of downstream tasks. However, most of the LM pre-training objectives only focus on text reconstruction, but have not sought to learn…

Computation and Language · Computer Science 2022-10-28 Liliang Ren , Zixuan Zhang , Han Wang , Clare R. Voss , Chengxiang Zhai , Heng Ji

It is still a challenging task to learn a neural text generation model under the framework of generative adversarial networks (GANs) since the entire training process is not differentiable. The existing training strategies either suffer…

Computation and Language · Computer Science 2023-07-25 Liping Yuan , Jiehang Zeng , Xiaoqing Zheng

Neural text generation models conditioning on given input (e.g. machine translation and image captioning) are usually trained by maximum likelihood estimation of target text. However, the trained models suffer from various types of errors…

Computation and Language · Computer Science 2020-12-29 Keisuke Shirai , Kazuma Hashimoto , Akiko Eriguchi , Takashi Ninomiya , Shinsuke Mori

We propose a new generative model of sentences that first samples a prototype sentence from the training corpus and then edits it into a new sentence. Compared to traditional models that generate from scratch either left-to-right or by…

Computation and Language · Computer Science 2018-09-10 Kelvin Guu , Tatsunori B. Hashimoto , Yonatan Oren , Percy Liang

Current approaches in paraphrase generation and detection heavily rely on a single general similarity score, ignoring the intricate linguistic properties of language. This paper introduces two new tasks to address this shortcoming by…

Computation and Language · Computer Science 2024-07-17 Jan Philip Wahle , Bela Gipp , Terry Ruas

Generic generation and manipulation of text is challenging and has limited success compared to recent deep generative modeling in visual domain. This paper aims at generating plausible natural language sentences, whose attributes are…

Machine Learning · Computer Science 2018-09-14 Zhiting Hu , Zichao Yang , Xiaodan Liang , Ruslan Salakhutdinov , Eric P. Xing

Sparsity is a desirable attribute. It can lead to more efficient and more effective representations compared to the dense model. Meanwhile, learning sparse latent representations has been a challenging problem in the field of computer…

Computer Vision and Pattern Recognition · Computer Science 2022-09-22 Hanao Li , Tian Han

Recent neural approaches to data-to-text generation have mostly focused on improving content fidelity while lacking explicit control over writing styles (e.g., word choices, sentence structures). More traditional systems use templates to…

Computation and Language · Computer Science 2020-10-12 Shuai Lin , Wentao Wang , Zichao Yang , Xiaodan Liang , Frank F. Xu , Eric Xing , Zhiting Hu

Past work on story generation has demonstrated the usefulness of conditioning on a generation plan to generate coherent stories. However, these approaches have used heuristics or off-the-shelf models to first tag training stories with the…

Computation and Language · Computer Science 2020-10-08 Harsh Jhamtani , Taylor Berg-Kirkpatrick

Auto-regressive text generation models usually focus on local fluency, and may cause inconsistent semantic meaning in long text generation. Further, automatically generating words with similar semantics is challenging, and hand-crafted…

Computation and Language · Computer Science 2020-05-05 Ruiyi Zhang , Changyou Chen , Zhe Gan , Wenlin Wang , Dinghan Shen , Guoyin Wang , Zheng Wen , Lawrence Carin

Deep generative modeling of natural languages has achieved many successes, such as producing fluent sentences and translating from one language into another. However, the development of generative modeling techniques for paraphrase…

Computation and Language · Computer Science 2023-11-28 Haotian Luo , Yixin Liu , Peidong Liu , Xianggen Liu

Current efficient fine-tuning methods (e.g., adapters, prefix-tuning, etc.) have optimized conditional text generation via training a small set of extra parameters of the neural language model, while freezing the rest for efficiency. While…

Computation and Language · Computer Science 2022-05-24 Marjan Ghazvininejad , Vladimir Karpukhin , Vera Gor , Asli Celikyilmaz

In this thesis we discuss machine learning methods performing automated variable selection for learning sparse predictive models. There are multiple reasons for promoting sparsity in the predictive models. By relying on a limited set of…

Machine Learning · Computer Science 2019-03-27 Magda Gregorova

Neural language models often fail to generate diverse and informative texts, limiting their applicability in real-world problems. While previous approaches have proposed to address these issues by identifying and penalizing undesirable…

Computation and Language · Computer Science 2023-09-25 Jimin Hong , ChaeHun Park , Jaegul Choo

The dominant text generation models compose the output by sequentially selecting words from a fixed vocabulary. In this paper, we formulate text generation as progressively copying text segments (e.g., words or phrases) from an existing…

Computation and Language · Computer Science 2023-07-17 Tian Lan , Deng Cai , Yan Wang , Heyan Huang , Xian-Ling Mao

With the advances of deep learning techniques, text generation is attracting increasing interest in the artificial intelligence (AI) community, because of its wide applications and because it is an essential component of AI. Traditional…

Computation and Language · Computer Science 2023-09-19 Lili Mou

Diffusion models have been successfully adapted to text generation tasks by mapping the discrete text into the continuous space. However, there exist nonnegligible gaps between training and inference, owing to the absence of the forward…

Computation and Language · Computer Science 2023-05-09 Zecheng Tang , Pinzheng Wang , Keyan Zhou , Juntao Li , Ziqiang Cao , Min Zhang

Providing pretrained language models with simple task descriptions in natural language enables them to solve some tasks in a fully unsupervised fashion. Moreover, when combined with regular learning from examples, this idea yields…

Computation and Language · Computer Science 2021-10-05 Timo Schick , Hinrich Schütze

Much literature has shown that prompt-based learning is an efficient method to make use of the large pre-trained language model. Recent works also exhibit the possibility of steering a chatbot's output by plugging in an appropriate prompt.…

Computation and Language · Computer Science 2022-10-14 Hsuan Su , Pohan Chi , Shih-Cheng Huang , Chung Ho Lam , Saurav Sahay , Shang-Tse Chen , Hung-yi Lee
‹ Prev 1 2 3 10 Next ›