Fakhri Karray — Scifaro

NL-MambaXCT: Self-Supervised Nested-Learning Mamba for Nomex Honeycomb X-ray CT Defect Classification

X-ray computed tomography (XCT) is widely used for non-destructive testing of Nomex honeycomb structures in aerospace manufacturing, but industrial inspection still relies heavily on manual interpretation and supervised models trained on…

Image and Video Processing · Electrical Eng. & Systems 2026-05-28 Ghaleb Aldoboni , Lobna Nassar , Fakhri Karray , Reem Alshamsi

Convex Compositional Reasoning Models

Compositional energy-based models can generalize to larger combinatorial reasoning problems by reusing a learned factor energy across many local constraints. In our paper, we show that a key bottleneck in compositional reasoning is not…

Machine Learning · Computer Science 2026-05-26 Meir Roketlishvili , Semyon Semenov , Maksim Bobrin , Viktor Kovalchuk , Albert Baichorov , Abduragim Shtanchaev , Fakhri Karray , Dmitry V. Dylov , Martin Takáč , Arip Asadulaev

In-Context Learning Operates as Concept Subspace Learning

Regression and Bayesian accounts of in-context learning (ICL) explain how demonstrations can induce predictors, while mechanistic analyses often identify compact activation directions that steer prompted behavior. However, it remains…

Machine Learning · Computer Science 2026-05-20 Wei Tang , Xinyan Jiang , Fakhri Karray , Lijie Hu

CAST: Channel-Aware Spatial Transfer Learning with Pseudo-Image Radar for Sign Language Recognition

We propose CAST, a dual-stream architecture that utilizes channel-aware spatial transfer learning for isolated sign language recognition addressing the challenges of magnitude-only 60~GHz radar Range-Time Maps (RTM). The proposed framework…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Md. Shakhoyat Rahman Shujon , Sheikh Md. Galib Mahim , Md. Milon Islam , Md Rezwanul Haque , Md Rabiul Islam , Hamdi Altaheri , Fakhri Karray

Projected Gradient Unlearning for Text-to-Image Diffusion Models: Defending Against Concept Revival Attacks

Machine unlearning for text-to-image diffusion models aims to selectively remove undesirable concepts from pre-trained models without costly retraining. Current unlearning methods share a common weakness: erased concepts return when the…

Computer Vision and Pattern Recognition · Computer Science 2026-04-24 Aljalila Aladawi , Mohammed Talha Alam , Fakhri Karray

CodeMMR: Bridging Natural Language, Code, and Image for Unified Retrieval

Code search, framed as information retrieval (IR), underpins modern software engineering and increasingly powers retrieval-augmented generation (RAG), improving code discovery, reuse, and the reliability of LLM-based coding. Yet existing…

Software Engineering · Computer Science 2026-04-20 Jiahui Geng , Qing Li , Fengyu Cai , Fakhri Karray

VulnScout-C: A Lightweight Transformer for C Code Vulnerability Detection

Vulnerability detection in C programs is a critical challenge in software security. Although large language models (LLMs) achieve strong detection performance, their multi-billion-parameter scale makes them impractical for integration into…

Cryptography and Security · Computer Science 2026-03-31 Aymen Lassoued , Nacef Mbarek , Bechir Dardouri , Bassem Ouni , Qing Li , Fakhri Karray

KineVLA: Towards Kinematics-Aware Vision-Language-Action Models with Bi-Level Action Decomposition

In this paper, we introduce a novel kinematics-rich vision-language-action (VLA) task, in which language commands densely encode diverse kinematic attributes (such as direction, trajectory, orientation, and relative displacement) from…

Robotics · Computer Science 2026-03-19 Gaoge Han , Zhengqing Gao , Ziwen Li , Jiaxin Huang , Shaoli Huang , Fakhri Karray , Mingming Gong , Tongliang Liu

Your Latent Reasoning is Secretly Policy Improvement Operator

Recently, small models with latent recursion have obtained promising results on complex reasoning tasks. These results are typically explained by the theory that such recursion increases a networks depth, allowing it to compactly emulate…

Computation and Language · Computer Science 2026-02-06 Arip Asadulaev , Rayan Banerjee , Fakhri Karray , Martin Takac

Y-Shaped Generative Flows

Modern continuous-time generative models typically induce \emph{V-shaped} flows: each sample travels independently along a nearly straight trajectory from the prior to the data. Although effective, this independent movement overlooks the…

Machine Learning · Computer Science 2026-02-05 Arip Asadulaev , Semyon Semenov , Abduragim Shtanchaev , Eric Moulines , Fakhri Karray , Martin Takac

Zero-Shot Off-Policy Learning

Off-policy learning methods seek to derive an optimal policy directly from a fixed dataset of prior interactions. This objective presents significant challenges, primarily due to the inherent distributional shift and value function…

Machine Learning · Computer Science 2026-02-03 Arip Asadulaev , Maksim Bobrin , Salem Lahlou , Dmitry Dylov , Fakhri Karray , Martin Takac

AdaptPrompt: Parameter-Efficient Adaptation of VLMs for Generalizable Deepfake Detection

Recent advances in image generation have led to the widespread availability of highly realistic synthetic media, increasing the difficulty of reliable deepfake detection. A key challenge is generalization, as detectors trained on a narrow…

Computer Vision and Pattern Recognition · Computer Science 2025-12-22 Yichen Jiang , Mohammed Talha Alam , Sohail Ahmed Khan , Duc-Tien Dang-Nguyen , Fakhri Karray

PosA-VLA: Enhancing Action Generation via Pose-Conditioned Anchor Attention

The Vision-Language-Action (VLA) models have demonstrated remarkable performance on embodied tasks and shown promising potential for real-world applications. However, current VLAs still struggle to produce consistent and precise…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Ziwen Li , Xin Wang , Hanlue Zhang , Runnan Chen , Runqi Lin , Xiao He , Han Huang , Yandong Guo , Fakhri Karray , Tongliang Liu , Mingming Gong

Vision Language Models for Dynamic Human Activity Recognition in Healthcare Settings

As generative AI continues to evolve, Vision Language Models (VLMs) have emerged as promising tools in various healthcare applications. One area that remains relatively underexplored is their use in human activity recognition (HAR) for…

Computation and Language · Computer Science 2025-11-18 Abderrazek Abid , Thanh-Cong Ho , Fakhri Karray

REMONI: An Autonomous System Integrating Wearables and Multimodal Large Language Models for Enhanced Remote Health Monitoring

With the widespread adoption of wearable devices in our daily lives, the demand and appeal for remote patient monitoring have significantly increased. Most research in this field has concentrated on collecting sensor data, visualizing it,…

Computation and Language · Computer Science 2025-10-27 Thanh Cong Ho , Farah Kharrat , Abderrazek Abid , Fakhri Karray

Expert or not? assessing data quality in offline reinforcement learning

Offline reinforcement learning (RL) learns exclusively from static datasets, without further interaction with the environment. In practice, such datasets vary widely in quality, often mixing expert, suboptimal, and even random trajectories.…

Machine Learning · Computer Science 2025-10-15 Arip Asadulaev , Fakhri Karray , Martin Takac

CoQuIR: A Comprehensive Benchmark for Code Quality-Aware Information Retrieval

Code retrieval is essential in modern software development, as it boosts code reuse and accelerates debugging. However, current benchmarks primarily emphasize functional relevance while neglecting critical dimensions of software quality.…

Software Engineering · Computer Science 2025-08-28 Jiahui Geng , Fengyu Cai , Shaobo Cui , Qing Li , Liangwei Chen , Chenyang Lyu , Haonan Li , Derui Zhu , Walter Pretschner , Heinz Koeppl , Fakhri Karray

A Signer-Invariant Conformer and Multi-Scale Fusion Transformer for Continuous Sign Language Recognition

Continuous Sign Language Recognition (CSLR) faces multiple challenges, including significant inter-signer variability and poor generalization to novel sentence structures. Traditional solutions frequently fail to handle these issues…

Computer Vision and Pattern Recognition · Computer Science 2025-08-14 Md Rezwanul Haque , Md. Milon Islam , S M Taslim Uddin Raju , Fakhri Karray

FusionEnsemble-Net: An Attention-Based Ensemble of Spatiotemporal Networks for Multimodal Sign Language Recognition

Accurate recognition of sign language in healthcare communication poses a significant challenge, requiring frameworks that can accurately interpret complex multimodal gestures. To deal with this, we propose FusionEnsemble-Net, a novel…

Computer Vision and Pattern Recognition · Computer Science 2025-08-14 Md. Milon Islam , Md Rezwanul Haque , S M Taslim Uddin Raju , Fakhri Karray

MDD-Net: Multimodal Depression Detection through Mutual Transformer

Depression is a major mental health condition that severely impacts the emotional and physical well-being of individuals. The simple nature of data collection from social media platforms has attracted significant interest in properly…

Computer Vision and Pattern Recognition · Computer Science 2025-08-12 Md Rezwanul Haque , Md. Milon Islam , S M Taslim Uddin Raju , Hamdi Altaheri , Lobna Nassar , Fakhri Karray