Related papers: Fast Spectrogram Inversion using Multi-head Convol…

ConvTransformer: A Convolutional Transformer Network for Video Frame Synthesis

Deep Convolutional Neural Networks (CNNs) are powerful models that have achieved excellent performance on difficult computer vision tasks. Although CNNs perform well whenever large labeled training samples are available, they work badly on…

Computer Vision and Pattern Recognition · Computer Science 2021-06-03 Zhouyong Liu , Shun Luo , Wubin Li , Jingben Lu , Yufan Wu , Shilei Sun , Chunguo Li , Luxi Yang

Convolutional Learning on Multigraphs

Graph convolutional learning has led to many exciting discoveries in diverse areas. However, in some applications, traditional graphs are insufficient to capture the structure and intricacies of the data. In such scenarios, multigraphs…

Machine Learning · Computer Science 2023-04-26 Landon Butler , Alejandro Parada-Mayorga , Alejandro Ribeiro

A Multi-Head Convolutional Neural Network With Multi-path Attention improves Image Denoising

Recently, convolutional neural networks (CNNs) and attention mechanisms have been widely used in image denoising and achieved satisfactory performance. However, the previous works mostly use a single head to receive the noisy image,…

Computer Vision and Pattern Recognition · Computer Science 2022-11-07 Jiahong Zhang , Meijun Qu , Ye Wang , Lihong Cao

Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition

In this paper, we propose a novel Convolutional Neural Network (CNN) architecture for learning multi-scale feature representations with good tradeoffs between speed and accuracy. This is achieved by using a multi-branch network, which has…

Computer Vision and Pattern Recognition · Computer Science 2019-08-01 Chun-Fu Chen , Quanfu Fan , Neil Mallinar , Tom Sercu , Rogerio Feris

Complex spectrogram enhancement by convolutional neural network with multi-metrics learning

This paper aims to address two issues existing in the current speech enhancement methods: 1) the difficulty of phase estimations; 2) a single objective function cannot consider multiple metrics simultaneously. To solve the first problem, we…

Machine Learning · Statistics 2017-09-12 Szu-Wei Fu , Ting-yao Hu , Yu Tsao , Xugang Lu

Learning with Multigraph Convolutional Filters

In this paper, we introduce a convolutional architecture to perform learning when information is supported on multigraphs. Exploiting algebraic signal processing (ASP), we propose a convolutional signal processing model on multigraphs…

Signal Processing · Electrical Eng. & Systems 2022-10-31 Landon Butler , Alejandro Parada-Mayorga , Alejandro Ribeiro

Fast PET reconstruction using Multi-scale Fully Convolutional Neural Networks

Reconstruction of PET images is an ill-posed inverse problem and often requires iterative algorithms to achieve good image quality for reliable clinical use in practice, at huge computational costs. In this paper, we consider the PET…

Computer Vision and Pattern Recognition · Computer Science 2017-04-25 Jieqing Jiao , Sebastien Ourselin

LPCNet: Improving Neural Speech Synthesis Through Linear Prediction

Neural speech synthesis models have recently demonstrated the ability to synthesize high quality speech for text-to-speech and compression applications. These new models often require powerful GPUs to achieve real-time operation, so being…

Audio and Speech Processing · Electrical Eng. & Systems 2019-02-20 Jean-Marc Valin , Jan Skoglund

Parallel Multi Channel Convolution using General Matrix Multiplication

Convolutional neural networks (CNNs) have emerged as one of the most successful machine learning technologies for image and video processing. The most computationally intensive parts of CNNs are the convolutional layers, which convolve…

Computer Vision and Pattern Recognition · Computer Science 2017-07-04 Aravind Vasudevan , Andrew Anderson , David Gregg

Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks

Deep learning and Convolutional Neural Networks (CNNs) have driven major transformations in diverse research areas. However, their limitations in handling low-frequency information present obstacles in certain tasks like interpreting global…

Computer Vision and Pattern Recognition · Computer Science 2024-03-14 Fuzhi Wu , Jiasong Wu , Youyong Kong , Chunfeng Yang , Guanyu Yang , Huazhong Shu , Guy Carrault , Lotfi Senhadji

Speaker Recognition from Raw Waveform with SincNet

Deep learning is progressively gaining popularity as a viable alternative to i-vectors for speaker recognition. Promising results have been recently obtained with Convolutional Neural Networks (CNNs) when fed by raw speech samples directly.…

Audio and Speech Processing · Electrical Eng. & Systems 2019-08-12 Mirco Ravanelli , Yoshua Bengio

Mathematical Vocoder Algorithm : Modified Spectral Inversion for Efficient Neural Speech Synthesis

In this work, we propose a new mathematical vocoder algorithm(modified spectral inversion) that generates a waveform from acoustic features without phase estimation. The main benefit of using our proposed method is that it excludes the…

Audio and Speech Processing · Electrical Eng. & Systems 2021-06-17 Hyun Gon Ryu , Jeong-Hoon Kim , Simon See

GNN-CNN: An Efficient Hybrid Model of Convolutional and Graph Neural Networks for Text Representation

Time, cost, and energy efficiency are critical considerations in Deep-Learning (DL), particularly when processing long texts. Transformers, which represent the current state of the art, exhibit quadratic computational complexity relative to…

Computation and Language · Computer Science 2025-07-11 Fardin Rastakhiz

Accelerating the Super-Resolution Convolutional Neural Network

As a successful deep model applied in image super-resolution (SR), the Super-Resolution Convolutional Neural Network (SRCNN) has demonstrated superior performance to the previous hand-crafted models either in speed and restoration quality.…

Computer Vision and Pattern Recognition · Computer Science 2016-08-02 Chao Dong , Chen Change Loy , Xiaoou Tang

Convolutional Neural Networks with Recurrent Neural Filters

We introduce a class of convolutional neural networks (CNNs) that utilize recurrent neural networks (RNNs) as convolution filters. A convolution filter is typically implemented as a linear affine transformation followed by a non-linear…

Computation and Language · Computer Science 2018-08-29 Yi Yang

Fast and High-Quality Singing Voice Synthesis System based on Convolutional Neural Networks

The present paper describes singing voice synthesis based on convolutional neural networks (CNNs). Singing voice synthesis systems based on deep neural networks (DNNs) are currently being proposed and are improving the naturalness of…

Audio and Speech Processing · Electrical Eng. & Systems 2020-04-23 Kazuhiro Nakamura , Shinji Takaki , Kei Hashimoto , Keiichiro Oura , Yoshihiko Nankaku , Keiichi Tokuda

Toward end-to-end interpretable convolutional neural networks for waveform signals

This paper introduces a novel convolutional neural networks (CNN) framework tailored for end-to-end audio deep learning models, presenting advancements in efficiency and explainability. By benchmarking experiments on three standard speech…

Sound · Computer Science 2024-05-06 Linh Vu , Thu Tran , Wern-Han Lim , Raphael Phan

Convolutional Neural Networks In Convolution

Currently, increasingly deeper neural networks have been applied to improve their accuracy. In contrast, We propose a novel wider Convolutional Neural Networks (CNN) architecture, motivated by the Multi-column Deep Neural Networks and the…

Computer Vision and Pattern Recognition · Computer Science 2018-10-10 Xiaobo Huang

Multi-level Wavelet Convolutional Neural Networks

In computer vision, convolutional networks (CNNs) often adopts pooling to enlarge receptive field which has the advantage of low computational complexity. However, pooling can cause information loss and thus is detrimental to further…

Computer Vision and Pattern Recognition · Computer Science 2019-07-09 Pengju Liu , Hongzhi Zhang , Wei Lian , Wangmeng Zuo

Long Short-Term Memory based Convolutional Recurrent Neural Networks for Large Vocabulary Speech Recognition

Long short-term memory (LSTM) recurrent neural networks (RNNs) have been shown to give state-of-the-art performance on many speech recognition tasks, as they are able to provide the learned dynamically changing contextual window of all…

Computation and Language · Computer Science 2016-10-12 Xiangang Li , Xihong Wu