Related papers: Virtual Width Networks

Wide Residual Networks

Deep residual networks were shown to be able to scale up to thousands of layers and still have improving performance. However, each fraction of a percent of improved accuracy costs nearly doubling the number of layers, and so training very…

Computer Vision and Pattern Recognition · Computer Science 2017-06-15 Sergey Zagoruyko , Nikos Komodakis

Software Defined Networking Enabled Wireless Network Virtualization: Challenges and Solutions

Next generation (5G) wireless networks are expected to support the massive data and accommodate a wide range of services/use cases with distinct requirements in a cost-effective, flexible, and agile manner. As a promising solution, wireless…

Networking and Internet Architecture · Computer Science 2017-04-06 Ning Zhang , Peng Yang , Shan Zhang , Dajiang Chen , Weihua Zhuang , Ben Liang , Xuemin , Shen

Differentiable Weightless Neural Networks

We introduce the Differentiable Weightless Neural Network (DWN), a model based on interconnected lookup tables. Training of DWNs is enabled by a novel Extended Finite Difference technique for approximate differentiation of binary values. We…

Machine Learning · Computer Science 2025-03-04 Alan T. L. Bacellar , Zachary Susskind , Mauricio Breternitz , Eugene John , Lizy K. John , Priscila M. V. Lima , Felipe M. G. França

Volumetric Transformer Networks

Existing techniques to encode spatial invariance within deep convolutional neural networks (CNNs) apply the same warping field to all the feature channels. This does not account for the fact that the individual feature channels can…

Computer Vision and Pattern Recognition · Computer Science 2020-07-21 Seungryong Kim , Sabine Süsstrunk , Mathieu Salzmann

Efficient Large-Scale Visual Representation Learning And Evaluation

Efficiently learning visual representations of items is vital for large-scale recommendations. In this article we compare several pretrained efficient backbone architectures, both in the convolutional neural network (CNN) and in the vision…

Computer Vision and Pattern Recognition · Computer Science 2023-08-03 Eden Dolev , Alaa Awad , Denisa Roberts , Zahra Ebrahimzadeh , Marcin Mejran , Vaibhav Malpani , Mahir Yavuz

Benchmarking Wireless Representations: High-Dimensional vs. Compressed Embeddings for Efficiency and Robustness

Building on recent advances in representation learning for wireless channels, this work investigates the cost-benefit trade-offs of high-dimensional channel embeddings in practical systems. We benchmark multiple wireless representations:…

Signal Processing · Electrical Eng. & Systems 2026-05-05 Murilo Batista , Shirin Salehi , Saeed Mashdour , Paul Zheng , Rodrigo C. de Lamare , Anke Schmeink

VNN: Verification-Friendly Neural Networks with Hard Robustness Guarantees

Machine learning techniques often lack formal correctness guarantees, evidenced by the widespread adversarial examples that plague most deep-learning applications. This lack of formal guarantees resulted in several research efforts that aim…

Machine Learning · Computer Science 2024-06-11 Anahita Baninajjar , Ahmed Rezine , Amir Aminifar

Value Prediction Network

This paper proposes a novel deep reinforcement learning (RL) architecture, called Value Prediction Network (VPN), which integrates model-free and model-based RL methods into a single neural network. In contrast to typical model-based RL…

Artificial Intelligence · Computer Science 2017-11-08 Junhyuk Oh , Satinder Singh , Honglak Lee

The Backpropagation of the Wave Network

This paper provides an in-depth analysis of Wave Network, a novel token representation method derived from the Wave Network, designed to capture both global and local semantics of input text through wave-inspired complex vectors. In complex…

Computation and Language · Computer Science 2025-01-14 Xin Zhang , Victor S. Sheng

Width Transfer: On the (In)variance of Width Optimization

Optimizing the channel counts for different layers of a CNN has shown great promise in improving the efficiency of CNNs at test-time. However, these methods often introduce large computational overhead (e.g., an additional 2x FLOPs of…

Computer Vision and Pattern Recognition · Computer Science 2021-04-28 Ting-Wu Chin , Diana Marculescu , Ari S. Morcos

Weightless Neural Networks for Efficient Edge Inference

Weightless Neural Networks (WNNs) are a class of machine learning model which use table lookups to perform inference. This is in contrast with Deep Neural Networks (DNNs), which use multiply-accumulate operations. State-of-the-art WNN…

Hardware Architecture · Computer Science 2022-03-04 Zachary Susskind , Aman Arora , Igor Dantas Dos Santos Miranda , Luis Armando Quintanilla Villon , Rafael Fontella Katopodis , Leandro Santiago de Araujo , Diego Leonel Cadette Dutra , Priscila Machado Vieira Lima , Felipe Maia Galvao Franca , Mauricio Breternitz , Lizy K. John

Towards Better Accuracy-efficiency Trade-offs: Divide and Co-training

The width of a neural network matters since increasing the width will necessarily increase the model capacity. However, the performance of a network does not improve linearly with the width and soon gets saturated. In this case, we argue…

Computer Vision and Pattern Recognition · Computer Science 2022-09-07 Shuai Zhao , Liguang Zhou , Wenxiao Wang , Deng Cai , Tin Lun Lam , Yangsheng Xu

Implicit Geometry Representations for Vision-and-Language Navigation from Web Videos

Vision-and-Language Navigation (VLN) has long been constrained by the limited diversity and scalability of simulator-curated datasets, which fail to capture the complexity of real-world environments. To overcome this limitation, we…

Computer Vision and Pattern Recognition · Computer Science 2026-03-11 Mingfei Han , Haihong Hao , Liang Ma , Kamila Zhumakhanova , Ekaterina Radionova , Jingyi Zhang , Xiaojun Chang , Xiaodan Liang , Ivan Laptev

Any-Width Networks

Despite remarkable improvements in speed and accuracy, convolutional neural networks (CNNs) still typically operate as monolithic entities at inference time. This poses a challenge for resource-constrained practical applications, where both…

Computer Vision and Pattern Recognition · Computer Science 2020-12-08 Thanh Vu , Marc Eder , True Price , Jan-Michael Frahm

Understanding Virtual Nodes: Oversquashing and Node Heterogeneity

While message passing neural networks (MPNNs) have convincing success in a range of applications, they exhibit limitations such as the oversquashing problem and their inability to capture long-range interactions. Augmenting MPNNs with a…

Machine Learning · Computer Science 2025-04-08 Joshua Southern , Francesco Di Giovanni , Michael Bronstein , Johannes F. Lutzeyer

PPT: Token Pruning and Pooling for Efficient Vision Transformers

Vision Transformers (ViTs) have emerged as powerful models in the field of computer vision, delivering superior performance across various vision tasks. However, the high computational complexity poses a significant barrier to their…

Computer Vision and Pattern Recognition · Computer Science 2024-02-06 Xinjian Wu , Fanhu Zeng , Xiudong Wang , Xinghao Chen

MedNeXt-v2: Scaling 3D ConvNeXts for Large-Scale Supervised Representation Learning in Medical Image Segmentation

Large-scale supervised pretraining is rapidly reshaping 3D medical image segmentation. However, existing efforts focus primarily on increasing dataset size and overlook the question of whether the backbone network is an effective…

Image and Video Processing · Electrical Eng. & Systems 2025-12-22 Saikat Roy , Yannick Kirchhoff , Constantin Ulrich , Maximillian Rokuss , Tassilo Wald , Fabian Isensee , Klaus Maier-Hein

BLoad: Enhancing Neural Network Training with Efficient Sequential Data Handling

The increasing complexity of modern deep neural network models and the expanding sizes of datasets necessitate the development of optimized and scalable training methods. In this white paper, we addressed the challenge of efficiently…

Machine Learning · Computer Science 2024-04-29 Raphael Ruschel , A. S. M. Iftekhar , B. S. Manjunath , Suya You

Representation Learning via Variational Bayesian Networks

We present Variational Bayesian Network (VBN) - a novel Bayesian entity representation learning model that utilizes hierarchical and relational side information and is particularly useful for modeling entities in the ``long-tail'', where…

Machine Learning · Computer Science 2023-06-29 Oren Barkan , Avi Caciularu , Idan Rejwan , Ori Katz , Jonathan Weill , Itzik Malkiel , Noam Koenigstein

Variable Length Embeddings

In this work, we introduce a novel deep learning architecture, Variable Length Embeddings (VLEs), an autoregressive model that can produce a latent representation composed of an arbitrary number of tokens. As a proof of concept, we…

Computer Vision and Pattern Recognition · Computer Science 2023-05-18 Johnathan Chiu , Andi Gu , Matt Zhou