Related papers: Learning to Approximate: Auto Direction Vector Set…

H2C: Hippocampal Circuit-inspired Continual Learning for Lifelong Trajectory Prediction in Autonomous Driving

Deep learning (DL) has shown state-of-the-art performance in trajectory prediction, which is critical to safe navigation in autonomous driving (AD). However, most DL-based methods suffer from catastrophic forgetting, where adapting to a new…

Artificial Intelligence · Computer Science 2025-08-12 Yunlong Lin , Zirui Li , Guodong Du , Xiaocong Zhao , Cheng Gong , Xinwei Wang , Chao Lu , Jianwei Gong

HV-Net: Hypervolume Approximation based on DeepSets

In this letter, we propose HV-Net, a new method for hypervolume approximation in evolutionary multi-objective optimization. The basic idea of HV-Net is to use DeepSets, a deep neural network with permutation invariant property, to…

Neural and Evolutionary Computing · Computer Science 2022-03-07 Ke Shang , Weiyu Chen , Weiduo Liao , Hisao Ishibuchi

Towards Vector Optimization on Low-Dimensional Vector Symbolic Architecture

Vector Symbolic Architecture (VSA) is emerging in machine learning due to its efficiency, but they are hindered by issues of hyperdimensionality and accuracy. As a promising mitigation, the Low-Dimensional Computing (LDC) method…

Machine Learning · Computer Science 2025-03-18 Shijin Duan , Yejia Liu , Gaowen Liu , Ramana Rao Kompella , Shaolei Ren , Xiaolin Xu

Learning to Instruct for Visual Instruction Tuning

We propose L2T, an advancement of visual instruction tuning (VIT). While VIT equips Multimodal LLMs (MLLMs) with promising multimodal capabilities, the current design choices for VIT often result in overfitting and shortcut learning,…

Computer Vision and Pattern Recognition · Computer Science 2025-10-14 Zhihan Zhou , Feng Hong , Jiaan Luo , Jiangchao Yao , Dongsheng Li , Bo Han , Ya Zhang , Yanfeng Wang

MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models

Reinforcement learning from human feedback (RLHF) with reward models has advanced alignment of generative models to human aesthetic and perceptual preferences. However, jointly optimizing multiple rewards often incurs an alignment tax,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Chieh-Yun Chen , Zhonghao Wang , Qi Chen , Zhifan Ye , Min Shi , Yue Zhao , Yinan Zhao , Hui Qu , Wei-An Lin , Yiru Shen , Ajinkya Kale , Irfan Essa , Humphrey Shi

Learning to Drive by Imitating Surrounding Vehicles

Imitation learning is a promising approach for training autonomous vehicles (AV) to navigate complex traffic environments by mimicking expert driver behaviors. While existing imitation learning frameworks focus on leveraging expert…

Robotics · Computer Science 2025-09-25 Yasin Sonmez , Hanna Krasowski , Murat Arcak

End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression

Conventional video compression (VC) methods are based on motion compensated transform coding, and the steps of motion estimation, mode and quantization parameter selection, and entropy coding are optimized individually due to the…

Image and Video Processing · Electrical Eng. & Systems 2021-12-20 M. Akın Yılmaz , A. Murat Tekalp

Learning to Coordinate: Distributed Meta-Trajectory Optimization Via Differentiable ADMM-DDP

Distributed trajectory optimization via ADMM-DDP is a powerful approach for coordinating multi-agent systems, but it requires extensive tuning of tightly coupled hyperparameters that jointly govern local task performance and global…

Machine Learning · Computer Science 2025-09-08 Bingheng Wang , Yichao Gao , Tianchen Sun , Lin Zhao

Steer2Adapt: Dynamically Composing Steering Vectors Elicits Efficient Adaptation of LLMs

Activation steering has emerged as a promising approach for efficiently adapting large language models (LLMs) to downstream behaviors. However, most existing steering methods rely on a single static direction per task or concept, making…

Artificial Intelligence · Computer Science 2026-02-10 Pengrui Han , Xueqiang Xu , Keyang Xuan , Peiyang Song , Siru Ouyang , Runchu Tian , Yuqing Jiang , Cheng Qian , Pengcheng Jiang , Jiashuo Sun , Junxia Cui , Ming Zhong , Ge Liu , Jiawei Han , Jiaxuan You

Multimodal LLMs as Customized Reward Models for Text-to-Image Generation

We introduce LLaVA-Reward, an efficient reward model designed to automatically evaluate text-to-image (T2I) generations across multiple perspectives, leveraging pretrained multimodal large language models (MLLMs). Existing MLLM-based…

Computer Vision and Pattern Recognition · Computer Science 2025-07-31 Shijie Zhou , Ruiyi Zhang , Huaisheng Zhu , Branislav Kveton , Yufan Zhou , Jiuxiang Gu , Jian Chen , Changyou Chen

Balancing Rewards in Text Summarization: Multi-Objective Reinforcement Learning via HyperVolume Optimization

Text summarization is a crucial task that requires the simultaneous optimization of multiple objectives, including consistency, coherence, relevance, and fluency, which presents considerable challenges. Although large language models (LLMs)…

Computation and Language · Computer Science 2025-10-23 Junjie Song , Yiwen Liu , Dapeng Li , Yin Sun , Shukun Fu , Siqi Chen , Yuji Cao

MeTA-LoRA: Data-Efficient Multi-Task Fine-Tuning for Large Language Models

Low-Rank Adaptation (LoRA) has emerged as one of the most widely used parameter-efficient fine-tuning (PEFT) methods for adapting large language models (LLMs) to downstream tasks. While highly effective in single-task settings, it struggles…

Computation and Language · Computer Science 2025-10-14 Bo Cheng , Xu Wang , Jinda Liu , Yi Chang , Yuan Wu

Steadily Learn to Drive with Virtual Memory

Reinforcement learning has shown great potential in developing high-level autonomous driving. However, for high-dimensional tasks, current RL methods suffer from low data efficiency and oscillation in the training process. This paper…

Machine Learning · Computer Science 2021-02-17 Yuhang Zhang , Yao Mu , Yujie Yang , Yang Guan , Shengbo Eben Li , Qi Sun , Jianyu Chen

From Holistic to Localized: Local Enhanced Adapters for Efficient Visual Instruction Fine-Tuning

Efficient Visual Instruction Fine-Tuning (EVIT) seeks to adapt Multimodal Large Language Models (MLLMs) to downstream tasks with minimal computational overhead. However, as task diversity and complexity increase, EVIT faces significant…

Computer Vision and Pattern Recognition · Computer Science 2025-07-02 Pengkun Jiao , Bin Zhu , Jingjing Chen , Chong-Wah Ngo , Yu-Gang Jiang

Text-to-Vector Generation with Neural Path Representation

Vector graphics are widely used in digital art and highly favored by designers due to their scalability and layer-wise properties. However, the process of creating and editing vector graphics requires creativity and design expertise, making…

Computer Vision and Pattern Recognition · Computer Science 2024-05-21 Peiying Zhang , Nanxuan Zhao , Jing Liao

Learning from Hypervectors: A Survey on Hypervector Encoding

Hyperdimensional computing (HDC) is an emerging computing paradigm that imitates the brain's structure to offer a powerful and efficient processing and learning model. In HDC, the data are encoded with long vectors, called hypervectors,…

Machine Learning · Computer Science 2023-08-02 Sercan Aygun , Mehran Shoushtari Moghadam , M. Hassan Najafi , Mohsen Imani

Automatic, Dynamic, and Nearly Optimal Learning Rate Specification by Local Quadratic Approximation

In deep learning tasks, the learning rate determines the update step size in each iteration, which plays a critical role in gradient-based optimization. However, the determination of the appropriate learning rate in practice typically…

Machine Learning · Statistics 2020-04-08 Yingqiu Zhu , Yu Chen , Danyang Huang , Bo Zhang , Hansheng Wang

Learning variational autoencoders via MCMC speed measures

Variational autoencoders (VAEs) are popular likelihood-based generative models which can be efficiently trained by maximizing an Evidence Lower Bound (ELBO). There has been much progress in improving the expressiveness of the variational…

Machine Learning · Statistics 2023-08-29 Marcel Hirt , Vasileios Kreouzis , Petros Dellaportas

The Hypervolume Indicator: Problems and Algorithms

The hypervolume indicator is one of the most used set-quality indicators for the assessment of stochastic multiobjective optimizers, as well as for selection in evolutionary multiobjective optimization algorithms. Its theoretical properties…

Data Structures and Algorithms · Computer Science 2022-04-14 Andreia P. Guerreiro , Carlos M. Fonseca , Luís Paquete

Unifying Language-Action Understanding and Generation for Autonomous Driving

Vision-Language-Action (VLA) models are emerging as a promising paradigm for end-to-end autonomous driving, valued for their potential to leverage world knowledge and reason about complex driving scenes. However, existing methods suffer…

Computer Vision and Pattern Recognition · Computer Science 2026-03-03 Xinyang Wang , Qian Liu , Wenjie Ding , Zhao Yang , Wei Li , Chang Liu , Bailin Li , Kun Zhan , Xianpeng Lang , Wei Chen