Related papers: Optimization Planning for 3D ConvNets

Learning to Generate Parameters of ConvNets for Unseen Image Data

Typical Convolutional Neural Networks (ConvNets) depend heavily on large amounts of image data and resort to an iterative optimization algorithm (e.g., SGD or Adam) to learn network parameters, which makes training very time- and…

Computer Vision and Pattern Recognition · Computer Science 2024-08-12 Shiye Wang , Kaituo Feng , Changsheng Li , Ye Yuan , Guoren Wang

Towards Effective Low-bitwidth Convolutional Neural Networks

This paper tackles the problem of training a deep convolutional neural network with both low-precision weights and low-bitwidth activations. Optimizing a low-precision network is very challenging since the training process can easily get…

Computer Vision and Pattern Recognition · Computer Science 2021-06-05 Bohan Zhuang , Chunhua Shen , Mingkui Tan , Lingqiao Liu , Ian Reid

3D Topology Optimization using Convolutional Neural Networks

Topology optimization is computationally demanding that requires the assembly and solution to a finite element problem for each material distribution hypothesis. As a complementary alternative to the traditional physics-based topology…

Machine Learning · Computer Science 2018-08-23 Saurabh Banga , Harsh Gehani , Sanket Bhilare , Sagar Patel , Levent Kara

Learning to Estimate 3D Human Pose and Shape from a Single Color Image

This work addresses the problem of estimating the full body 3D human pose and shape from a single color image. This is a task where iterative optimization-based solutions have typically prevailed, while Convolutional Networks (ConvNets)…

Computer Vision and Pattern Recognition · Computer Science 2018-05-11 Georgios Pavlakos , Luyang Zhu , Xiaowei Zhou , Kostas Daniilidis

Real-Time Topology Optimization in 3D via Deep Transfer Learning

The published literature on topology optimization has exploded over the last two decades to include methods that use shape and topological derivatives or evolutionary algorithms formulated on various geometric representations and…

Machine Learning · Computer Science 2021-02-16 MohammadMahdi Behzadi , Horea T. Ilies

A Multigrid Method for Efficiently Training Video Models

Training competitive deep video models is an order of magnitude slower than training their counterpart image models. Slow training causes long research cycles, which hinders progress in video understanding research. Following standard…

Computer Vision and Pattern Recognition · Computer Science 2020-06-11 Chao-Yuan Wu , Ross Girshick , Kaiming He , Christoph Feichtenhofer , Philipp Krähenbühl

An Efficient 3D Convolutional Neural Network with Channel-wise, Spatial-grouped, and Temporal Convolutions

There has been huge progress on video action recognition in recent years. However, many works focus on tweaking existing 2D backbones due to the reliance of ImageNet pretraining, which restrains the models from achieving higher efficiency…

Computer Vision and Pattern Recognition · Computer Science 2025-03-05 Zhe Wang , Xulei Yang

Learning Algorithm Hyperparameters for Fast Parametric Convex Optimization

We introduce a machine-learning framework to learn the hyperparameter sequence of first-order methods (e.g., the step sizes in gradient descent) to quickly solve parametric convex optimization problems. Our computational architecture…

Optimization and Control · Mathematics 2024-12-23 Rajiv Sambharya , Bartolomeo Stellato

Towards Better Exploiting Convolutional Neural Networks for Remote Sensing Scene Classification

We present an analysis of three possible strategies for exploiting the power of existing convolutional neural networks (ConvNets) in different scenarios from the ones they were trained: full training, fine tuning, and using ConvNets as…

Computer Vision and Pattern Recognition · Computer Science 2016-11-08 Keiller Nogueira , Otávio A. B. Penatti , Jefersson A. dos Santos

Design Light-weight 3D Convolutional Networks for Video Recognition Temporal Residual, Fully Separable Block, and Fast Algorithm

Deep 3-dimensional (3D) Convolutional Network (ConvNet) has shown promising performance on video recognition tasks because of its powerful spatio-temporal information fusion ability. However, the extremely intensive requirements on memory…

Computer Vision and Pattern Recognition · Computer Science 2019-06-03 Haonan Wang , Jun Lin , Zhongfeng Wang

Learned Vertex Descent: A New Direction for 3D Human Model Fitting

We propose a novel optimization-based paradigm for 3D human model fitting on images and scans. In contrast to existing approaches that directly regress the parameters of a low-dimensional statistical body model (e.g. SMPL) from input…

Computer Vision and Pattern Recognition · Computer Science 2022-07-21 Enric Corona , Gerard Pons-Moll , Guillem Alenyà , Francesc Moreno-Noguer

Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose

This paper addresses the challenge of 3D human pose estimation from a single color image. Despite the general success of the end-to-end learning paradigm, top performing approaches employ a two-step solution consisting of a Convolutional…

Computer Vision and Pattern Recognition · Computer Science 2017-07-27 Georgios Pavlakos , Xiaowei Zhou , Konstantinos G. Derpanis , Kostas Daniilidis

Multi-Scale Networks for 3D Human Pose Estimation with Inference Stage Optimization

Estimating 3D human poses from a monocular video is still a challenging task. Many existing methods' performance drops when the target person is occluded by other objects, or the motion is too fast/slow relative to the scale and speed of…

Computer Vision and Pattern Recognition · Computer Science 2020-10-20 Cheng Yu , Bo Wang , Bo Yang , Robby T. Tan

Learning Spatiotemporal Features with 3D Convolutional Networks

We propose a simple, yet effective approach for spatiotemporal feature learning using deep 3-dimensional convolutional networks (3D ConvNets) trained on a large scale supervised video dataset. Our findings are three-fold: 1) 3D ConvNets are…

Computer Vision and Pattern Recognition · Computer Science 2015-10-08 Du Tran , Lubomir Bourdev , Rob Fergus , Lorenzo Torresani , Manohar Paluri

Holistic Planimetric prediction to Local Volumetric prediction for 3D Human Pose Estimation

We propose a novel approach to 3D human pose estimation from a single depth map. Recently, convolutional neural network (CNN) has become a powerful paradigm in computer vision. Many of computer vision tasks have benefited from CNNs,…

Computer Vision and Pattern Recognition · Computer Science 2017-07-11 Gyeongsik Moon , Ju Yong Chang , Yumin Suh , Kyoung Mu Lee

Regularization and Optimization strategies in Deep Convolutional Neural Network

Convolution Neural Networks, known as ConvNets exceptionally perform well in many complex machine learning tasks. The architecture of ConvNets demands the huge and rich amount of data and involves with a vast number of parameters that leads…

Computer Vision and Pattern Recognition · Computer Science 2017-12-14 Pushparaja Murugan , Shanmugasundaram Durairaj

Harvesting Multiple Views for Marker-less 3D Human Pose Annotations

Recent advances with Convolutional Networks (ConvNets) have shifted the bottleneck for many computer vision tasks to annotated data collection. In this paper, we present a geometry-driven approach to automatically collect annotations for…

Computer Vision and Pattern Recognition · Computer Science 2017-04-18 Georgios Pavlakos , Xiaowei Zhou , Konstantinos G. Derpanis , Kostas Daniilidis

Learning with Batch-wise Optimal Transport Loss for 3D Shape Recognition

Deep metric learning is essential for visual recognition. The widely used pair-wise (or triplet) based loss objectives cannot make full use of semantical information in training samples or give enough attention to those hard samples during…

Computer Vision and Pattern Recognition · Computer Science 2019-03-22 Lin Xu , Han Sun , Yuai Liu

MoViNets: Mobile Video Networks for Efficient Video Recognition

We present Mobile Video Networks (MoViNets), a family of computation and memory efficient video networks that can operate on streaming video for online inference. 3D convolutional neural networks (CNNs) are accurate at video recognition but…

Computer Vision and Pattern Recognition · Computer Science 2021-04-20 Dan Kondratyuk , Liangzhe Yuan , Yandong Li , Li Zhang , Mingxing Tan , Matthew Brown , Boqing Gong

Temporal 3D ConvNets: New Architecture and Transfer Learning for Video Classification

The work in this paper is driven by the question how to exploit the temporal cues available in videos for their accurate classification, and for human action recognition in particular? Thus far, the vision community has focused on…

Computer Vision and Pattern Recognition · Computer Science 2017-11-23 Ali Diba , Mohsen Fayyaz , Vivek Sharma , Amir Hossein Karami , Mohammad Mahdi Arzani , Rahman Yousefzadeh , Luc Van Gool