English
Related papers

Related papers: MelGlow: Efficient Waveform Generative Network Bas…

200 papers

In this paper, we propose a novel conditional convolution network, named location-variable convolution, to model the dependencies of the waveform sequence. Different from the use of unified convolution kernels in WaveNet to capture the…

Audio and Speech Processing · Electrical Eng. & Systems 2021-02-23 Zhen Zeng , Jianzong Wang , Ning Cheng , Jing Xiao

Depthwise convolution and grouped convolution has been successfully applied to improve the efficiency of convolutional neural network (CNN). We suggest that these models can be considered as special cases of a generalized convolution…

Computer Vision and Pattern Recognition · Computer Science 2018-03-28 Dong-Qing Zhang

Deep convolutional neural networks are used to address many computer vision problems, including video prediction. The task of video prediction requires analyzing the video frames, temporally and spatially, and constructing a model of how…

Computer Vision and Pattern Recognition · Computer Science 2018-10-17 Niloofar Azizi , Hafez Farazi , Sven Behnke

Spatio-temporal forecasting is challenging attributing to the high nonlinearity in temporal dynamics as well as complex location-characterized patterns in spatial domains, especially in fields like weather forecasting. Graph convolutions…

Machine Learning · Computer Science 2021-12-14 Haitao Lin , Zhangyang Gao , Yongjie Xu , Lirong Wu , Ling Li , Stan. Z. Li

Convolutional neural networks (CNNs) with dilated filters such as the Wavenet or the Temporal Convolutional Network (TCN) have shown good results in a variety of sequence modelling tasks. However, efficiently modelling long-term…

Machine Learning · Computer Science 2019-11-18 Daniel Stoller , Mi Tian , Sebastian Ewert , Simon Dixon

Flow-based generative models, conceptually attractive due to tractability of both the exact log-likelihood computation and latent-variable inference, and efficiency of both training and sampling, has led to a number of impressive empirical…

Machine Learning · Computer Science 2019-10-29 Xuezhe Ma , Xiang Kong , Shanghang Zhang , Eduard Hovy

In this paper we propose WaveGlow: a flow-based network capable of generating high quality speech from mel-spectrograms. WaveGlow combines insights from Glow and WaveNet in order to provide fast, efficient and high-quality audio synthesis,…

Sound · Computer Science 2018-11-02 Ryan Prenger , Rafael Valle , Bryan Catanzaro

In this paper, we are committed to establishing an unified and end-to-end multi-modal network via exploring the language-guided visual recognition. To approach this target, we first propose a novel multi-modal convolution module called…

Computer Vision and Pattern Recognition · Computer Science 2023-09-15 Gen Luo , Yiyi Zhou , Xiaoshuai Sun , Yongjian Wu , Yue Gao , Rongrong Ji

In recent years, there have been attempts to increase the kernel size of Convolutional Neural Nets (CNNs) to mimic the global receptive field of Vision Transformers' (ViTs) self-attention blocks. That approach, however, quickly hit an upper…

Computer Vision and Pattern Recognition · Computer Science 2024-07-16 Shahaf E. Finder , Roy Amoyal , Eran Treister , Oren Freifeld

In this study, we employ a convolutional neural network to classify gravitational waves originating from core-collapse supernovae. Training is conducted using spectrograms derived from three-dimensional numerical simulations of waveforms,…

Instrumentation and Methods for Astrophysics · Physics 2023-12-21 Seiya Sasaoka , Naoki Koyama , Diego Dominguez , Yusuke Sakai , Kentaro Somiya , Yuto Omae , Hirotaka Takahashi

Deep neural networks are increasingly used on mobile devices, where computational resources are limited. In this paper we develop CondenseNet, a novel network architecture with unprecedented efficiency. It combines dense connectivity with a…

Computer Vision and Pattern Recognition · Computer Science 2018-06-08 Gao Huang , Shichen Liu , Laurens van der Maaten , Kilian Q. Weinberger

Graph neural networks (GNNs) have significantly improved the representation power for graph-structured data. Despite of the recent success of GNNs, the graph convolution in most GNNs have two limitations. Since the graph convolution is…

Machine Learning · Computer Science 2021-12-30 Jinyoung Park , Sungdong Yoo , Jihwan Park , Hyunwoo J. Kim

We present a logarithmic-scale efficient convolutional neural network architecture for edge devices, named WaveletNet. Our model is based on the well-known depthwise convolution, and on two new layers, which we introduce in this work: a…

Machine Learning · Computer Science 2018-11-29 Li Jing , Rumen Dangovski , Marin Soljacic

Convolutional neural networks (CNN) have improved speech recognition performance greatly by exploiting localized time-frequency patterns. But these patterns are assumed to appear in symmetric and rigid kernels by the conventional CNN…

Audio and Speech Processing · Electrical Eng. & Systems 2025-06-19 Jiamin Xie , John H. L. Hansen

Deep learning with 3D data has progressed significantly since the introduction of convolutional neural networks that can handle point order ambiguity in point cloud data. While being able to achieve good accuracies in various scene…

Computer Vision and Pattern Recognition · Computer Science 2019-08-20 Zhiyuan Zhang , Binh-Son Hua , Sai-Kit Yeung

Long-range contextual information is essential for achieving high-performance semantic segmentation. Previous feature re-weighting methods demonstrate that using global context for re-weighting feature channels can effectively improve the…

Computer Vision and Pattern Recognition · Computer Science 2020-08-27 Jianbo Liu , Junjun He , Jimmy S. Ren , Yu Qiao , Hongsheng Li

Modern wake word detection systems usually rely on neural networks for acoustic modeling. Transformers has recently shown superior performance over LSTM and convolutional networks in various sequence modeling tasks with their better…

Computation and Language · Computer Science 2021-02-10 Yiming Wang , Hang Lv , Daniel Povey , Lei Xie , Sanjeev Khudanpur

Time Delay Neural Networks (TDNN)-based methods are widely used in dialect identification. However, in previous work with TDNN application, subtle variant is being neglected in different feature scales. To address this issue, we propose a…

Computation and Language · Computer Science 2021-08-18 Tianlong Kong , Shouyi Yin , Dawei Zhang , Wang Geng , Xin Wang , Dandan Song , Jinwen Huang , Huiyu Shi , Xiaorui Wang

In this paper, we propose a novel network design mechanism for efficient embedded computing. Inspired by the limited computing patterns, we propose to fix the number of channels in a group convolution, instead of the existing practice that…

Computer Vision and Pattern Recognition · Computer Science 2020-05-01 Qian Zhang , Jianjun Li , Meng Yao , Liangchen Song , Helong Zhou , Zhichao Li , Wenming Meng , Xuezhi Zhang , Guoli Wang

It has been widely proven that modelling long-range dependencies in fully convolutional networks (FCNs) via global aggregation modules is critical for complex scene understanding tasks such as semantic segmentation and object detection.…

Computer Vision and Pattern Recognition · Computer Science 2019-09-17 Xiangtai Li , Li Zhang , Ansheng You , Maoke Yang , Kuiyuan Yang , Yunhai Tong
‹ Prev 1 2 3 10 Next ›