Machine Learning · Computer Science
Keyframe-oriented Vision Token Pruning: Enhancing Efficiency of Large Vision Language Models on Long-Form Video Processing
Yudong Liu, Jingwei Sun, Yueqian Lin, Jingyang Zhang +5
2025-04-25
Computation and Language · Computer Science
LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models
Yizheng Sun, Yanze Xin, Hao Li, Jingyuan Sun +2
2025-03-11
Computer Vision and Pattern Recognition · Computer Science
VLTP: Vision-Language Guided Token Pruning for Task-Oriented Segmentation
Hanning Chen, Yang Ni, Wenjun Huang, Yezi Liu +5
2024-12-02
Computer Vision and Pattern Recognition · Computer Science
ViTCoP: Accelerating Large Vision-Language Models via Visual and Textual Semantic Collaborative Pruning
Wen Luo, Peng Chen, Xiaotao Huang, LiQun Huang
2026-01-27
Computer Vision and Pattern Recognition · Computer Science
Balanced Token Pruning: Accelerating Vision Language Models Beyond Local Optimization
Kaiyuan Li, Xiaoyue Chen, Chen Gao, Yong Li +1
2025-10-24
Computer Vision and Pattern Recognition · Computer Science
Pyramid Token Pruning for High-Resolution Large Vision-Language Models via Region, Token, and Instruction-Guided Importance
Yuxuan Liang, Xu Li, Xiaolei Chen, Yi Zheng +3
2026-02-17
Computer Vision and Pattern Recognition · Computer Science
StreamingAssistant: Efficient Visual Token Pruning for Accelerating Online Video Understanding
Xinqi Jin, Hanxun Yu, Bohan Yu, Kebin Liu +7
2025-12-16
Computer Vision and Pattern Recognition · Computer Science
AdaTP: Attention-Debiased Token Pruning for Video Large Language Models
Fengyuan Sun, Leqi Shen, Hui Chen, Sicheng Zhao +2
2025-05-27
Computer Vision and Pattern Recognition · Computer Science
ST$^3$: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming
Jiedong Zhuang, Lu Lu, Ming Dai, Rui Hu +3
2024-12-31
Computer Vision and Pattern Recognition · Computer Science
Unified Spatio-Temporal Token Scoring for Efficient Video VLMs
Jianrui Zhang, Yue Yang, Rohun Tripathi, Winson Han +4
2026-03-19
Computer Vision and Pattern Recognition · Computer Science
EchoPrune: Interpreting Redundancy as Temporal Echoes for Efficient VideoLLMs
Jiameng Li, Minye Wu, Jiezhang Cao, Aleksei Tiulpin +1
2026-05-12
Computer Vision and Pattern Recognition · Computer Science
V-CAST: Video Curvature-Aware Spatio-Temporal Pruning for Efficient Video Large Language Models
Xinying Lin, Xuyang Liu, Yiyu Wang, Teng Ma +1
2026-03-31
Computer Vision and Pattern Recognition · Computer Science
RedVTP: Training-Free Acceleration of Diffusion Vision-Language Models Inference via Masked Token-Guided Visual Token Pruning
Jingqi Xu, Jingxi Lu, Chenghao Li, Sreetama Sarkar +2
2025-11-18
Computer Vision and Pattern Recognition · Computer Science
Object-Centric Vision Token Pruning for Vision Language Models
Guangyuan Li, Rongzhen Zhao, Jinhong Deng, Yanbo Wang +1
2026-05-28
Computer Vision and Pattern Recognition · Computer Science
MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer
Jianjian Cao, Peng Ye, Shengze Li, Chong Yu +3
2024-03-06
Computer Vision and Pattern Recognition · Computer Science
Parallel Vision Token Scheduling for Fast and Accurate Multimodal LMMs Inference
Wengyi Zhan, Mingbao Lin, Zhihang Lin, Rongrong Ji
2025-11-25
Computer Vision and Pattern Recognition · Computer Science
GroundVTS: Visual Token Sampling in Multimodal Large Language Models for Video Temporal Grounding
Rong Fan, Kaiyan Xiao, Minghao Zhu, Liuyi Wang +2
2026-04-03
Computer Vision and Pattern Recognition · Computer Science
Back to Fundamentals: Low-Level Visual Features Guided Progressive Token Pruning
Yuanbing Ouyang, Yizhuo Liang, Qingpeng Li, Xinfei Guo +4
2025-04-28
Computer Vision and Pattern Recognition · Computer Science
ATP-LLaVA: Adaptive Token Pruning for Large Vision Language Models
Xubing Ye, Yukang Gan, Yixiao Ge, Xiao-Ping Zhang +1
2024-12-03
Computer Vision and Pattern Recognition · Computer Science
HoliTom: Holistic Token Merging for Fast Video Large Language Models
Kele Shao, Keda Tao, Can Qin, Haoxuan You +2
2025-10-13
Computer Vision and Pattern Recognition · Computer Science
TrimTokenator-LC: Towards Adaptive Visual Token Pruning for Large Multimodal Models with Long Contexts
Hao Zhang, Mengsi Lyu, Bo Huang, Yulong Ao +1
2026-01-01
Computer Vision and Pattern Recognition · Computer Science
Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs
Qizhe Zhang, Aosong Cheng, Ming Lu, Renrui Zhang +5
2025-05-13
Computer Vision and Pattern Recognition · Computer Science
Recurrent Attention-based Token Selection for Efficient Streaming Video-LLMs
Vaggelis Dorovatas, Soroush Seifi, Gunshi Gupta, Rahaf Aljundi
2025-10-21
Computer Vision and Pattern Recognition · Computer Science
A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models
Quan-Sheng Zeng, Yunheng Li, Qilong Wang, Peng-Tao Jiang +3
2025-08-05