图像与视频处理

Enhanced Neural Video Representation Compression across Extreme Complexity and Quality Scales

Implicit neural representations (INRs) have recently emerged as a promising approach to video compression, delivering competitive rate-distortion performance alongside rapid decoding. However, existing neural video codecs struggle to…

图像与视频处理 · 电气工程与系统科学 2026-06-26 Ho Man Kwan , Tianhao Peng , Fan Zhang , Mike Nilsson , Andrew Gower , David Bull

MLVC: Multi-platform Learned Video Codec for Real-World Deployment

Neural video codecs have surpassed classical codecs in coding efficiency but remain impractical for deployment due to cross-platform incompatibility and high computational cost. Existing quantization-based solutions fail to produce…

图像与视频处理 · 电气工程与系统科学 2026-06-26 Tanel Pärnamaa , Martin Lumiste , Ardi Loot , Evgenii Indenbom , Andrei Znobishchev , Ando Saabas

DFM: Difference Feature Modeling with Text-Guided Gated Contrastive Loss for Remote Sensing Image Change Captioning

The primary goal of Remote Sensing Image Change Captioning (RSICC) is to automatically generate descriptions of changes between remote sensing images captured at different time points. Existing models still rely on a single autoregressive…

图像与视频处理 · 电气工程与系统科学 2026-06-25 Yelin Wang , Zijia Song , Chuanguang Yang , Miaoyu Wang , Zhulin An , Libo Huang , Yongjun Xu

Automated brain tumor detection in MRI images using CNN and ResNet architectures

Deep learning has shown significant potential in medical image analysis, particularly for disease detection using MRI scans. Accurate and early diagnosis of brain tumors remains challenging due to the complexity of brain structures and…

图像与视频处理 · 电气工程与系统科学 2026-06-25 Annapurna V K , Asha N , K Paramesha , Shabana Sultana , Kirankumar Humse

Enabling self-supervised learned primal dual with Noise2Inverse

X-ray computed tomography reconstruction is an ill-posed inverse problem, particularly in low-dose and sparse-angle settings where measurements are noisy and incomplete. While learned reconstruction methods such as the Learned Primal-Dual…

图像与视频处理 · 电气工程与系统科学 2026-06-25 Antti Sällinen , Siiri Rautio , Santeri Kaupinmäki , Andreas Hauptmann

Dual-Prior Guided Null-Space Learning with Mixture-of-Splines for Arbitrary Medical Slice Super-Resolution

Arbitrary slice super-resolution reconstructs isotropic volumes from anisotropic clinical acquisitions by synthesizing intermediate slices at arbitrary scales. However, treating this ill-posed inverse problem as unconstrained residual-based…

图像与视频处理 · 电气工程与系统科学 2026-06-25 Haofei Song , Siyuan Xu , Xintian Mao , Shaojie Guo , Qingli Li , Yan Wang

MLFFM-SegDiff: A Multi-Level Feature Fusion Diffusion Model for Skin Lesion Segmentation

Skin lesion segmentation is a key task in computer-aided dermatological diagnosis, where accuracy directly impacts downstream analysis and disease classification. However, dermoscopic images are challenging due to blurred boundaries, low…

图像与视频处理 · 电气工程与系统科学 2026-06-25 Jingjun Gu , Chaojie Shen , Yifeng Cao , Wei Zhang , Yiliu Li , Aobo Fan

Revealing Mammographic Phenotypes in Deep Learning Breast Cancer Risk Models

Mammogram-based deep learning models have improved breast cancer risk prediction, but the learned imaging patterns remain underexplored. Existing interpretability methods rely on single-image saliency maps, failing to identify recurring…

图像与视频处理 · 电气工程与系统科学 2026-06-24 Ruiyu Jia , Yanqi Xu , Yuxuan Chen , Yiqiu Shen , Laura Heacock

An Evaluation of ABR Switching for Time-Shifted Clients in MoQ

Media over QUIC enables ultra low latency video streaming over QUIC, but its default quality-switching semantics risk introducing playback gaps during periods of network congestion. The in-progress SWITCH specification for MOQ Transport…

图像与视频处理 · 电气工程与系统科学 2026-06-24 Abanisenioluwa Orojo , Tanvir Redoy , Samira Afzal , Andrew C. Freeman

Rendering Novel Views of MRI Using 3D Gaussian Splatting

The objective of this paper is to improve radiological gradings measured on MRIs of spines, by resampling scans so that the new view planes are better aligned with the target anatomy than the original sparse images. To this end, we adapt 3D…

图像与视频处理 · 电气工程与系统科学 2026-06-24 Robin Y. Park , Mark C. Eid , Rhydian Windsor , Amir Jamaludin , Ana I. L. Namburete , João F. Henriques , Andrew Zisserman

Absorption and Phase-Contrast Microtomography Using Direct X-ray Detection With COTS CMOS Sensors

This work presents a high-resolution X-ray microtomography system that uses commercial off-the-shelf (COTS) CMOS image sensors as direct detectors, relying on the sensor s intrinsic resolution to achieve tomographic reconstructions without…

图像与视频处理 · 电气工程与系统科学 2026-05-29 Damian L. Corzi , Jose Lipovetzky , Fabricio Alcalde Bessia , German Mato , Andres Cicuttin , Maria L. Crespo , Martin Perez , Mariano Gomez Berisso

A unified deeplearning framework for contrast-phase-specific virtual monochromatic imaging

Dual-energy CT (DECT) enables virtual monochromatic imaging (VMI) and improved contrast resolution, but its clinical adoption is limited by hardware complexity and cost. In this work, we propose a unified deep learning framework that…

图像与视频处理 · 电气工程与系统科学 2026-05-29 Antony Jerald , Hemant K Aggarwal , Brian Nett , Avinash Gopal , Phaneendra K Yalavarthy , Bipul Das , Rajesh Langoju

Constructing efficient channels for ideal observers using the conjugate gradient method

Task-based assessment of image quality (IQ) is critically important for the design and optimization of medical imaging systems. Ideal observers, including the Bayesian Ideal Observer (IO) and the ideal linear observer, i.e., the Hotelling…

图像与视频处理 · 电气工程与系统科学 2026-05-29 Weimin Zhou

BCER Agent: Reliable Long-Horizon MRI Workflow Execution via Compilation, Artifact Binding, and Bounded Local Recovery

Many recent medical VLM and agent studies are benchmarked on 2D images or comparatively short tool-calling exchanges, whereas real MRI analysis typically demands long, interdependent pipelines that operate on 3D/4D volumetric data. Under…

图像与视频处理 · 电气工程与系统科学 2026-05-29 Ziyang Long , Xinqi Li , Junzhou Chen , Yifan Gao , Debiao Li , Hsin-Jung Yang

Accelerating HEVC Intra Partitioning via a CNN-Hierarchical Attention Transformer Hybrid

The recursive quad-tree partitioning in High Efficiency Video Coding (HEVC) incurs considerable computational overhead, with exhaustive rate-distortion optimization for CTU partition prediction consuming the dominant share of encoding time.…

图像与视频处理 · 电气工程与系统科学 2026-05-29 Krishna Kumar Sharma , Somdyuti Paul

FRAPPE: Full Input, Residual Output Autoencoding with Projection Pursuit Encoder

Media compression standards have reached a plateau in terms of the rate-distortion-complexity trade-off, limiting the ability to offload expensive AI perception to the cloud in applications like robotics, wearables, and remote sensing.…

图像与视频处理 · 电气工程与系统科学 2026-05-29 Dan Jacobellis , Neeraja J. Yadwadkar

Prospective evaluation of multimodal respiratory failure prediction: Do chest X-rays improve performance beyond EHR signals?

Early prediction of respiratory failure is critical for timely clinical intervention in intensive care units. Existing electronic health record (EHR)-based models can continuously monitor physiologic deterioration, but they may not fully…

图像与视频处理 · 电气工程与系统科学 2026-05-29 Xiaolei Lu , Shamim Nemati

CTseg: A Tool for Brain CT Segmentation, Spatial Normalisation, and Volumetrics

This paper presents and validates CTseg, a freely available software for brain CT segmentation, spatial normalisation, and volumetrics. CTseg builds on the Multi-Brain generative modelling framework, providing a CT-specific pipeline that…

图像与视频处理 · 电气工程与系统科学 2026-05-29 Mikael Brudfors

LUMINA: A Multi-Vendor Mammography Benchmark with Energy Harmonization Protocol

Publicly available full-field digital mammography (FFDM) datasets remain limited in size, clinical annotations, and vendor diversity, hindering the development of robust models. We introduce LUMINA, a curated, multi-vendor FFDM dataset that…

图像与视频处理 · 电气工程与系统科学 2026-05-29 Hongyi Pan , Gorkem Durak , Halil Ertugrul Aktas , Andrea M. Bejar , Baver Tutun , Emre Uysal , Ezgi Bulbul , Mehmet Fatih Dogan , Berrin Erok , Berna Akkus Yildirim , Sukru Mehmet Erturk , Ulas Bagci

Bayesian model selection and misspecification testing in imaging inverse problems only from noisy and partial measurements

Modern imaging techniques heavily rely on Bayesian statistical models to address difficult image reconstruction and restoration tasks. This paper addresses the objective evaluation of such models in settings where ground truth is…

图像与视频处理 · 电气工程与系统科学 2026-05-29 Tom Sprunck , Marcelo Pereyra , Tobias Liaudat