English
Related papers

Related papers: fairseq: A Fast, Extensible Toolkit for Sequence M…

200 papers

We introduce fairseq S2T, a fairseq extension for speech-to-text (S2T) modeling tasks such as end-to-end speech recognition and speech-to-text translation. It follows fairseq's careful design for scalability and extensibility. We provide…

Computation and Language · Computer Science 2022-06-15 Changhan Wang , Yun Tang , Xutai Ma , Anne Wu , Sravya Popuri , Dmytro Okhonko , Juan Pino

Modern deep learning systems like PyTorch and Tensorflow are able to train enormous models with billions (or trillions) of parameters on a distributed infrastructure. These systems require that the internal nodes have the same memory…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-10-01 Yifan Ding , Nicholas Botzer , Tim Weninger

This paper presents fairseq S^2, a fairseq extension for speech synthesis. We implement a number of autoregressive (AR) and non-AR text-to-speech models, and their multi-speaker variants. To enable training speech synthesis models with less…

Audio and Speech Processing · Electrical Eng. & Systems 2021-09-16 Changhan Wang , Wei-Ning Hsu , Yossi Adi , Adam Polyak , Ann Lee , Peng-Jen Chen , Jiatao Gu , Juan Pino

Transformer-based models have made tremendous impacts in natural language generation. However the inference speed is a bottleneck due to large model size and intensive computing involved in auto-regressive decoding process. We develop…

Computation and Language · Computer Science 2021-07-14 Yu Yan , Fei Hu , Jiusheng Chen , Nikhil Bhendawade , Ting Ye , Yeyun Gong , Nan Duan , Desheng Cui , Bingyu Chi , Ruofei Zhang

We present NaturalCC, an efficient and extensible toolkit to bridge the gap between natural language and programming language, and facilitate the research on big code analysis. Using NaturalCC, researchers both from natural language or…

Software Engineering · Computer Science 2020-12-08 Yao Wan , Yang He , Jian-Guo Zhang , Yulei Sui , Hai Jin , Guandong Xu , Caiming Xiong , Philip S. Yu

Past work in natural language processing interpretability focused mainly on popular classification tasks while largely overlooking generation settings, partly due to a lack of dedicated tools. In this work, we introduce Inseq, a Python…

Computation and Language · Computer Science 2023-09-08 Gabriele Sarti , Nils Feldhus , Ludwig Sickert , Oskar van der Wal , Malvina Nissim , Arianna Bisazza

Transformer, BERT and their variants have achieved great success in natural language processing. Since Transformer models are huge in size, serving these models is a challenge for real industrial applications. In this paper, we propose…

Mathematical Software · Computer Science 2021-04-23 Xiaohui Wang , Ying Xiong , Yang Wei , Mingxuan Wang , Lei Li

We present OpenSeq2Seq - a TensorFlow-based toolkit for training sequence-to-sequence models that features distributed and mixed-precision training. Benchmarks on machine translation and speech recognition tasks show that models built using…

Computation and Language · Computer Science 2018-11-22 Oleksii Kuchaiev , Boris Ginsburg , Igor Gitman , Vitaly Lavrukhin , Jason Li , Huyen Nguyen , Carl Case , Paulius Micikevicius

Transformer-based neural models are used in many AI applications. Training these models is expensive, as it takes huge GPU resources and long duration. It is challenging because typical data like sentences have variable lengths, and…

Computation and Language · Computer Science 2022-06-17 Xiaohui Wang , Yang Wei , Ying Xiong , Guyue Huang , Xian Qian , Yufei Ding , Mingxuan Wang , Lei Li

We present FairX, an open-source Python-based benchmarking tool designed for the comprehensive analysis of models under the umbrella of fairness, utility, and eXplainability (XAI). FairX enables users to train benchmarking bias-mitigation…

Machine Learning · Computer Science 2024-09-04 Md Fahim Sikder , Resmi Ramachandranpillai , Daniel de Leng , Fredrik Heintz

Recently, diffusion models have emerged as a new paradigm for generative models. Despite the success in domains using continuous signals such as vision and audio, adapting diffusion models to natural language is under-explored due to the…

Computation and Language · Computer Science 2023-02-15 Shansan Gong , Mukai Li , Jiangtao Feng , Zhiyong Wu , Lingpeng Kong

This paper describes NCRF++, a toolkit for neural sequence labeling. NCRF++ is designed for quick implementation of different neural sequence labeling models with a CRF inference layer. It provides users with an inference for building the…

Computation and Language · Computer Science 2018-06-19 Jie Yang , Yue Zhang

Automatic evaluation of text generation tasks (e.g. machine translation, text summarization, image captioning and video description) usually relies heavily on task-specific metrics, such as BLEU and ROUGE. They, however, are abstract…

Computation and Language · Computer Science 2019-12-06 Changhan Wang , Anirudh Jain , Danlu Chen , Jiatao Gu

We introduce pyGSL, a Python library that provides efficient implementations of state-of-the-art graph structure learning models along with diverse datasets to evaluate them on. The implementations are written in GPU-friendly ways, allowing…

Machine Learning · Computer Science 2022-11-08 Max Wasserman , Gonzalo Mateos

Empowering models to dynamically accomplish tasks specified through natural language instructions represents a promising path toward more capable and general artificial intelligence. In this work, we introduce InstructSeq, an…

Computer Vision and Pattern Recognition · Computer Science 2023-12-01 Rongyao Fang , Shilin Yan , Zhaoyang Huang , Jingqiu Zhou , Hao Tian , Jifeng Dai , Hongsheng Li

SparseChem provides fast and accurate machine learning models for biochemical applications. Especially, the package supports very high-dimensional sparse inputs, e.g., millions of features and millions of compounds. It is possible to train…

Machine Learning · Statistics 2022-03-10 Adam Arany , Jaak Simm , Martijn Oldenhof , Yves Moreau

We present AutoNMT, a framework to streamline the research of seq-to-seq models by automating the data pipeline (i.e., file management, data preprocessing, and exploratory analysis), automating experimentation in a toolkit-agnostic manner,…

Computation and Language · Computer Science 2023-02-13 Salvador Carrión , Francisco Casacuberta

We introduce an open-source toolkit, i.e., the deep Self End-to-end Learning Framework (deepSELF), as a toolkit of deep self end-to-end learning framework for multi-modal signals. To the best of our knowledge, it is the first public toolkit…

Machine Learning · Computer Science 2020-05-15 Tomoya Koike , Kun Qian , Björn W. Schuller , Yoshiharu Yamamoto

Sequential structure is a key feature of multiple domains of natural cognition and behavior, such as language, movement and decision-making. Likewise, it is also a central property of tasks to which we would like to apply artificial…

Neurons and Cognition · Quantitative Biology 2026-01-01 Barna Zajzon , Younes Bouhadjar , Maxime Fabre , Felix Schmidt , Noah Ostendorf , Emre Neftci , Abigail Morrison , Renato Duarte

In this paper we present our open-source neural machine translation (NMT) toolkit called "Yet Another Neural Machine Translation Toolkit" abbreviated as YANMTT which is built on top of the Transformers library. Despite the growing…

Computation and Language · Computer Science 2021-08-26 Raj Dabre , Eiichiro Sumita
‹ Prev 1 2 3 10 Next ›