English
Related papers

Related papers: Adapter-TST: A Parameter Efficient Method for Mult…

200 papers

Adapter modules were recently introduced as an efficient alternative to fine-tuning in NLP. Adapter tuning consists in freezing pretrained parameters of a model and injecting lightweight modules between layers, resulting in the addition of…

Computation and Language · Computer Science 2021-07-14 Hang Le , Juan Pino , Changhan Wang , Jiatao Gu , Didier Schwab , Laurent Besacier

Parameter-efficient transfer learning (PETL) methods have emerged as a solid alternative to the standard full fine-tuning approach. They only train a few extra parameters for each downstream task, without sacrificing performance and…

Audio and Speech Processing · Electrical Eng. & Systems 2024-07-16 Umberto Cappellazzo , Daniele Falavigna , Alessio Brutti , Mirco Ravanelli

Different languages have distinct phonetic systems and vary in their prosodic features making it challenging to develop a Text-to-Speech (TTS) model that can effectively synthesise speech in multilingual settings. Furthermore, TTS…

Computation and Language · Computer Science 2024-06-26 Yingting Li , Ambuj Mehrish , Bryan Chew , Bo Cheng , Soujanya Poria

Sentiment transfer aims at revising the input text to satisfy a given sentiment polarity while retaining the original semantic content. The nucleus of sentiment transfer lies in precisely separating the sentiment information from the…

Computation and Language · Computer Science 2023-02-24 Yingze Xie , Jie Xu , LiQiang Qiao , Yun Liu , Feiren Huang , Chaozhuo Li

Fine-tuning of self-supervised models is a powerful transfer learning method in a variety of fields, including speech processing, since it can utilize generic feature representations obtained from large amounts of unlabeled data.…

Multimedia · Computer Science 2022-12-07 Shinta Otake , Rei Kawakami , Nakamasa Inoue

Text Style Transfer (TST) is a pivotal task in natural language generation to manipulate text style attributes while preserving style-independent content. The attributes targeted in TST can vary widely, including politeness, authorship,…

Computation and Language · Computer Science 2024-07-23 Sourabrata Mukherjee , Ondrej Dušek

Fine-tuning is a popular method for adapting text-to-speech (TTS) models to new speakers. However this approach has some challenges. Usually fine-tuning requires several hours of high quality speech per speaker. There is also that…

Audio and Speech Processing · Electrical Eng. & Systems 2022-11-02 Cheng-Ping Hsieh , Subhankar Ghosh , Boris Ginsburg

Capitalizing on large pre-trained models for various downstream tasks of interest have recently emerged with promising performance. Due to the ever-growing model size, the standard full fine-tuning based task adaptation strategy becomes…

Computer Vision and Pattern Recognition · Computer Science 2022-10-14 Junting Pan , Ziyi Lin , Xiatian Zhu , Jing Shao , Hongsheng Li

Pre-trained vision-language models provide a robust foundation for efficient transfer learning across various downstream tasks. In the field of video action recognition, mainstream approaches often introduce additional modules to capture…

Computer Vision and Pattern Recognition · Computer Science 2025-03-19 Haoxing Chen , Zizheng Huang , Yan Hong , Yanshuo Wang , Zhongcai Lyu , Zhuoer Xu , Jun Lan , Zhangxuan Gu

The dominant approach to unsupervised "style transfer" in text is based on the idea of learning a latent representation, which is independent of the attributes specifying its "style". In this paper, we show that this condition is not…

Computation and Language · Computer Science 2019-09-23 Sandeep Subramanian , Guillaume Lample , Eric Michael Smith , Ludovic Denoyer , Marc'Aurelio Ranzato , Y-Lan Boureau

The style transfer task in Text-to-Speech refers to the process of transferring style information into text content to generate corresponding speech with a specific style. However, most existing style transfer approaches are either based on…

Audio and Speech Processing · Electrical Eng. & Systems 2024-02-01 Wenhao Guan , Yishuang Li , Tao Li , Hukai Huang , Feng Wang , Jiayan Lin , Lingyan Huang , Lin Li , Qingyang Hong

The pretrain-then-finetune paradigm has been widely used in various unimodal and multimodal tasks. However, finetuning all the parameters of a pre-trained model becomes prohibitive as the model size grows exponentially. To address this…

Multimedia · Computer Science 2023-08-29 Hongye Liu , Xianhai Xie , Yang Gao , Size Li , Zhou YU

Text style transfer (TST) involves altering the linguistic style of a text while preserving its core content. This paper focuses on sentiment transfer, a popular TST subtask, across a spectrum of Indian languages: Hindi, Magahi, Malayalam,…

Computation and Language · Computer Science 2024-08-28 Sourabrata Mukherjee , Atul Kr. Ojha , Akanksha Bansal , Deepak Alok , John P. McCrae , Ondřej Dušek

Text Style Transfer (TST) is performable through approaches such as latent space disentanglement, cycle-consistency losses, prototype editing etc. The prototype editing approach, which is known to be quite successful in TST, involves two…

Computation and Language · Computer Science 2022-10-13 Sharan Narasimhan , Pooja Shekar , Suvodip Dey , Maunendra Sankar Desarkar

Recently, the pre-trained Transformer models have received a rising interest in the field of speech processing thanks to their great success in various downstream tasks. However, most fine-tuning approaches update all the parameters of the…

Audio and Speech Processing · Electrical Eng. & Systems 2022-10-31 Junyi Peng , Themos Stafylakis , Rongzhi Gu , Oldřich Plchot , Ladislav Mošner , Lukáš Burget , Jan Černocký

Recent neural style transfer frameworks have obtained astonishing visual quality and flexibility in Single-style Transfer (SST), but little attention has been paid to Multi-style Transfer (MST) which refers to simultaneously transferring…

Computer Vision and Pattern Recognition · Computer Science 2019-10-30 Zixuan Huang , Jinghuai Zhang , Jing Liao

In the era of information explosion, spatio-temporal data mining serves as a critical part of urban management. Considering the various fields demanding attention, e.g., traffic state, human activity, and social event, predicting multiple…

Artificial Intelligence · Computer Science 2023-09-19 Zijian Zhang , Xiangyu Zhao , Qidong Liu , Chunxu Zhang , Qian Ma , Wanyu Wang , Hongwei Zhao , Yiqi Wang , Zitao Liu

There are significant challenges for speaker adaptation in text-to-speech for languages that are not widely spoken or for speakers with accents or dialects that are not well-represented in the training data. To address this issue, we…

Sound · Computer Science 2023-05-30 Ambuj Mehrish , Abhinav Ramesh Kashyap , Li Yingting , Navonil Majumder , Soujanya Poria

This paper addresses the issues of parameter redundancy, rigid structure, and limited task adaptability in the fine-tuning of large language models. It proposes an adapter-based fine-tuning method built on a structure-learnable mechanism.…

Computation and Language · Computer Science 2025-09-04 Ming Gong , Yingnan Deng , Nia Qi , Yujun Zou , Zhihao Xue , Yun Zi

This paper proposes a novel method for Text Style Transfer (TST) based on parameter-efficient fine-tuning of Large Language Models (LLMs). Addressing the scarcity of parallel corpora that map between styles, the study employs roundtrip…

Computation and Language · Computer Science 2026-02-17 Ruoxi Liu , Philipp Koehn
‹ Prev 1 2 3 10 Next ›