Kin-Man Lam — Scifaro

Emotion Diffusion Classifier with Adaptive Margin Discrepancy Training for Facial Expression Recognition

Facial Expression Recognition (FER) is essential for human-machine interaction, as it enables machines to interpret human emotions and internal states from facial affective behaviors. Although deep learning has significantly advanced FER…

Computer Vision and Pattern Recognition · Computer Science 2026-04-01 Rongkang Dong , Cuixin Yang , Cong Zhang , Yushen Zuo , Kin-Man Lam

Multi-level distortion-aware deformable network for omnidirectional image super-resolution

As augmented reality and virtual reality applications gain popularity, image processing for OmniDirectional Images (ODIs) has attracted increasing attention. OmniDirectional Image Super-Resolution (ODISR) is a promising technique for…

Computer Vision and Pattern Recognition · Computer Science 2025-12-22 Cuixin Yang , Rongkang Dong , Kin-Man Lam , Yuhang Zhang , Guoping Qiu

Vision-Language Model Guided Image Restoration

Many image restoration (IR) tasks require both pixel-level fidelity and high-level semantic understanding to recover realistic photos with fine-grained details. However, previous approaches often struggle to effectively leverage both the…

Computer Vision and Pattern Recognition · Computer Science 2025-12-22 Cuixin Yang , Rongkang Dong , Kin-Man Lam

HFS: Holistic Query-Aware Frame Selection for Efficient Video Reasoning

Key frame selection in video understanding presents significant challenges. Traditional top-K selection methods, which score frames independently, often fail to optimize the selection as a whole. This independent scoring frequently results…

Computer Vision and Pattern Recognition · Computer Science 2025-12-15 Yiqing Yang , Kin-Man Lam

SA$^{2}$Net: Scale-Adaptive Structure-Affinity Transformation for Spine Segmentation from Ultrasound Volume Projection Imaging

Spine segmentation, based on ultrasound volume projection imaging (VPI), plays a vital role for intelligent scoliosis diagnosis in clinical applications. However, this task faces several significant challenges. Firstly, the global…

Computer Vision and Pattern Recognition · Computer Science 2025-10-31 Hao Xie , Zixun Huang , Yushen Zuo , Yakun Ju , Frank H. F. Leung , N. F. Law , Kin-Man Lam , Yong-Ping Zheng , Sai Ho Ling

Enhancing Technical Documents Retrieval for RAG

In this paper, we introduce Technical-Embeddings, a novel framework designed to optimize semantic retrieval in technical documentation, with applications in both hardware and software development. Our approach addresses the challenges of…

Information Retrieval · Computer Science 2025-09-05 Songjiang Lai , Tsun-Hin Cheung , Ka-Chun Fung , Kaiwen Xue , Kwan-Ho Lin , Yan-Ming Choi , Vincent Ng , Kin-Man Lam

Enhancing Novel View Synthesis from extremely sparse views with SfM-free 3D Gaussian Splatting Framework

3D Gaussian Splatting (3DGS) has demonstrated remarkable real-time performance in novel view synthesis, yet its effectiveness relies heavily on dense multi-view inputs with precisely known camera poses, which are rarely available in…

Computer Vision and Pattern Recognition · Computer Science 2025-08-22 Zongqi He , Hanmin Li , Kin-Chung Chan , Yushen Zuo , Hao Xie , Zhe Xiao , Jun Xiao , Kin-Man Lam

Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks

Vision-Language Models (VLMs) extend the capabilities of Large Language Models (LLMs) by incorporating visual information, yet they remain vulnerable to jailbreak attacks, especially when processing noisy or corrupted images. Although…

Computer Vision and Pattern Recognition · Computer Science 2025-08-05 Jiawei Wang , Yushen Zuo , Yuanjun Chai , Zhendong Liu , Yicheng Fu , Yichun Feng , Kin-Man Lam

Deep Learning-Driven Ultra-High-Definition Image Restoration: A Survey

Ultra-high-definition (UHD) image restoration aims to specifically solve the problem of quality degradation in ultra-high-resolution images. Recent advancements in this field are predominantly driven by deep learning-based innovations,…

Computer Vision and Pattern Recognition · Computer Science 2025-05-23 Liyan Wang , Weixiang Zhou , Cong Wang , Kin-Man Lam , Zhixun Su , Jinshan Pan

See In Detail: Enhancing Sparse-view 3D Gaussian Splatting with Local Depth and Semantic Regularization

3D Gaussian Splatting (3DGS) has shown remarkable performance in novel view synthesis. However, its rendering quality deteriorates with sparse inphut views, leading to distorted content and reduced details. This limitation hinders its…

Computer Vision and Pattern Recognition · Computer Science 2025-01-22 Zongqi He , Zhe Xiao , Kin-Chung Chan , Yushen Zuo , Jun Xiao , Kin-Man Lam

Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution

As virtual and augmented reality applications gain popularity, omnidirectional image (ODI) super-resolution has become increasingly important. Unlike 2D plain images that are formed on a plane, ODIs are projected onto spherical surfaces.…

Image and Video Processing · Electrical Eng. & Systems 2025-01-17 Cuixin Yang , Rongkang Dong , Jun Xiao , Cong Zhang , Kin-Man Lam , Fei Zhou , Guoping Qiu

HAAT: Hybrid Attention Aggregation Transformer for Image Super-Resolution

In the research area of image super-resolution, Swin-transformer-based models are favored for their global spatial modeling and shifting window attention mechanism. However, existing methods often limit self-attention to non overlapping…

Image and Video Processing · Electrical Eng. & Systems 2024-12-11 Song-Jiang Lai , Tsun-Hin Cheung , Ka-Chun Fung , Kai-wen Xue , Kin-Man Lam

Residual Attention Single-Head Vision Transformer Network for Rolling Bearing Fault Diagnosis in Noisy Environments

Rolling bearings play a crucial role in industrial machinery, directly influencing equipment performance, durability, and safety. However, harsh operating conditions, such as high speeds and temperatures, often lead to bearing malfunctions,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Songjiang Lai , Tsun-Hin Cheung , Jiayi Zhao , Kaiwen Xue , Ka-Chun Fung , Kin-Man Lam

Automatic Prompt Generation and Grounding Object Detection for Zero-Shot Image Anomaly Detection

Identifying defects and anomalies in industrial products is a critical quality control task. Traditional manual inspection methods are slow, subjective, and error-prone. In this work, we propose a novel zero-shot training-free approach for…

Computer Vision and Pattern Recognition · Computer Science 2024-12-02 Tsun-Hin Cheung , Ka-Chun Fung , Songjiang Lai , Kwan-Ho Lin , Vincent Ng , Kin-Man Lam

An End-to-End Two-Stream Network Based on RGB Flow and Representation Flow for Human Action Recognition

With the rapid advancements in deep learning, computer vision tasks have seen significant improvements, making two-stream neural networks a popular focus for video based action recognition. Traditional models using RGB and optical flow…

Computer Vision and Pattern Recognition · Computer Science 2024-11-28 Song-Jiang Lai , Tsun-Hin Cheung , Ka-Chun Fung , Tian-Shan Liu , Kin-Man Lam

Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models

Blind image restoration remains a significant challenge in low-level vision tasks. Recently, denoising diffusion models have shown remarkable performance in image synthesis. Guided diffusion models, leveraging the potent generative priors…

Computer Vision and Pattern Recognition · Computer Science 2024-11-20 Jun Xiao , Zihang Lyu , Hao Xie , Cong Zhang , Yakun Ju , Changjian Shui , Kin-Man Lam

Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning

The stylization of 3D scenes is an increasingly attractive topic in 3D vision. Although image style transfer has been extensively researched with promising results, directly applying 2D style transfer methods to 3D scenes often fails to…

Computer Vision and Pattern Recognition · Computer Science 2024-11-18 Yushen Zuo , Jun Xiao , Kin-Chung Chan , Rongkang Dong , Cuixin Yang , Zongqi He , Hao Xie , Kin-Man Lam

AIM 2024 Sparse Neural Rendering Challenge: Methods and Results

This paper reviews the challenge on Sparse Neural Rendering that was part of the Advances in Image Manipulation (AIM) workshop, held in conjunction with ECCV 2024. This manuscript focuses on the competition set-up, the proposed methods and…

Computer Vision and Pattern Recognition · Computer Science 2024-09-24 Michal Nazarczuk , Sibi Catley-Chandar , Thomas Tanay , Richard Shaw , Eduardo Pérez-Pellitero , Radu Timofte , Xing Yan , Pan Wang , Yali Guo , Yongxin Wu , Youcheng Cai , Yanan Yang , Junting Li , Yanghong Zhou , P. Y. Mok , Zongqi He , Zhe Xiao , Kin-Chung Chan , Hana Lebeta Goshu , Cuixin Yang , Rongkang Dong , Jun Xiao , Kin-Man Lam , Jiayao Hao , Qiong Gao , Yanyan Zu , Junpei Zhang , Licheng Jiao , Xu Liu , Kuldeep Purohit

Deep Learning Methods for Calibrated Photometric Stereo and Beyond

Photometric stereo recovers the surface normals of an object from multiple images with varying shading cues, i.e., modeling the relationship between surface orientation and intensity at each pixel. Photometric stereo prevails in superior…

Computer Vision and Pattern Recognition · Computer Science 2024-02-02 Yakun Ju , Kin-Man Lam , Wuyuan Xie , Huiyu Zhou , Junyu Dong , Boxin Shi

AMSP-UOD: When Vortex Convolution and Stochastic Perturbation Meet Underwater Object Detection

In this paper, we present a novel Amplitude-Modulated Stochastic Perturbation and Vortex Convolutional Network, AMSP-UOD, designed for underwater object detection. AMSP-UOD specifically addresses the impact of non-ideal imaging factors on…

Computer Vision and Pattern Recognition · Computer Science 2024-01-19 Jingchun Zhou , Zongxin He , Kin-Man Lam , Yudong Wang , Weishi Zhang , ChunLe Guo , Chongyi Li