Abhinav Java — Scifaro

Understanding Task Transfer in Vision-Language Models

Vision-Language Models (VLMs) perform well on multimodal benchmarks but lag behind humans and specialized models on visual perception tasks like depth estimation or object counting. Finetuning on one task can unpredictably affect…

Computer Vision and Pattern Recognition · Computer Science 2026-04-10 Bhuvan Sachdeva , Karan Uppal , Abhinav Java , Vineeth N. Balasubramanian

FrugalRAG: Less is More in RL Finetuning for Multi-Hop Question Answering

Reinforcement learning (RL) based on the final answer's reward has driven recent progress in small language models (SLMs) on reasoning-heavy tasks such as math and code. However, applying the same techniques to retrieval-augmented…

Computation and Language · Computer Science 2026-03-03 Abhinav Java , Srivathsan Koundinyan , Nagarajan Natarajan , Amit Sharma

Characterizing Deep Research: A Benchmark and Formal Definition

Information tasks such as writing surveys or analytical reports require complex search and reasoning, and have recently been grouped under the umbrella of \textit{deep research} -- a term also adopted by recent models targeting these…

Computation and Language · Computer Science 2025-08-07 Abhinav Java , Ashmit Khandelwal , Sukruta Midigeshi , Aaron Halfaker , Amit Deshpande , Navin Goyal , Ankur Gupta , Nagarajan Natarajan , Amit Sharma

Towards Efficient Exemplar Based Image Editing with Multimodal VLMs

Text-to-Image Diffusion models have enabled a wide array of image editing applications. However, capturing all types of edits through text alone can be challenging and cumbersome. The ambiguous nature of certain image edits is better…

Computer Vision and Pattern Recognition · Computer Science 2025-06-26 Avadhoot Jadhav , Ashutosh Srivastava , Abhinav Java , Silky Singh , Tarun Ram Menta , Surgan Jandial , Balaji Krishnamurthy

LEAST: "Local" text-conditioned image style transfer

Text-conditioned style transfer enables users to communicate their desired artistic styles through text descriptions, offering a new and expressive means of achieving stylization. In this work, we evaluate the text-conditioned image editing…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Silky Singh , Surgan Jandial , Simra Shahid , Abhinav Java

Towards Operationalizing Right to Data Protection

The widespread practice of indiscriminate data scraping to fine-tune language models (LMs) raises significant legal and ethical concerns, particularly regarding compliance with data protection laws such as the General Data Protection…

Machine Learning · Computer Science 2024-11-19 Abhinav Java , Simra Shahid , Chirag Agarwal

ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models

Modern Text-to-Image (T2I) Diffusion models have revolutionized image editing by enabling the generation of high-quality photorealistic images. While the de facto method for performing edits with T2I models is through text instructions,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-07 Ashutosh Srivastava , Tarun Ram Menta , Abhinav Java , Avadhoot Jadhav , Silky Singh , Surgan Jandial , Balaji Krishnamurthy

Thinking Fair and Slow: On the Efficacy of Structured Prompts for Debiasing Language Models

Existing debiasing techniques are typically training-based or require access to the model's internals and output distributions, so they are inaccessible to end-users looking to adapt LLM outputs for their particular needs. In this study, we…

Computation and Language · Computer Science 2024-05-20 Shaz Furniturewala , Surgan Jandial , Abhinav Java , Pragyan Banerjee , Simra Shahid , Sumit Bhatia , Kokil Jaidka

All Should Be Equal in the Eyes of Language Models: Counterfactually Aware Fair Text Generation

Fairness in Language Models (LMs) remains a longstanding challenge, given the inherent biases in training data that can be perpetuated by models and affect the downstream tasks. Recent methods employ expensive retraining or attempt…

Computation and Language · Computer Science 2023-11-10 Pragyan Banerjee , Abhinav Java , Surgan Jandial , Simra Shahid , Shaz Furniturewala , Balaji Krishnamurthy , Sumit Bhatia

One-Shot Doc Snippet Detection: Powering Search in Document Beyond Text

Active consumption of digital documents has yielded scope for research in various applications, including search. Traditionally, searching within a document has been cast as a text matching problem ignoring the rich layout and visual cues…

Computer Vision and Pattern Recognition · Computer Science 2022-09-15 Abhinav Java , Shripad Deshmukh , Milan Aggarwal , Surgan Jandial , Mausoom Sarkar , Balaji Krishnamurthy

Learning to Censor by Noisy Sampling

Point clouds are an increasingly ubiquitous input modality and the raw signal can be efficiently processed with recent progress in deep learning. This signal may, often inadvertently, capture sensitive information that can leak semantic and…

Computer Vision and Pattern Recognition · Computer Science 2022-03-24 Ayush Chopra , Abhinav Java , Abhishek Singh , Vivek Sharma , Ramesh Raskar

Introducing Self-Attention to Target Attentive Graph Neural Networks

Session-based recommendation systems suggest relevant items to users by modeling user behavior and preferences using short-term anonymous sessions. Existing methods leverage Graph Neural Networks (GNNs) that propagate and aggregate…

Information Retrieval · Computer Science 2022-01-10 Sai Mitheran , Abhinav Java , Surya Kant Sahu , Arshad Shaikh

AdaSplit: Adaptive Trade-offs for Resource-constrained Distributed Deep Learning

Distributed deep learning frameworks like federated learning (FL) and its variants are enabling personalized experiences across a wide range of web clients and mobile/IoT devices. However, FL-based frameworks are constrained by…

Machine Learning · Computer Science 2021-12-06 Ayush Chopra , Surya Kant Sahu , Abhishek Singh , Abhinav Java , Praneeth Vepakomma , Vivek Sharma , Ramesh Raskar

Rethinking Neural Networks With Benford's Law

Benford's Law (BL) or the Significant Digit Law defines the probability distribution of the first digit of numerical values in a data sample. This Law is observed in many naturally occurring datasets. It can be seen as a measure of…

Machine Learning · Computer Science 2021-10-25 Surya Kant Sahu , Abhinav Java , Arshad Shaikh , Yannic Kilcher