Qichen Fu — Scifaro

Apple Intelligence Foundation Language Models

We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute.…

Artificial Intelligence · Computer Science 2026-05-28 Tom Gunter , Zirui Wang , Chong Wang , Ruoming Pang , Andy Narayanan , Aonan Zhang , Bowen Zhang , Chen Chen , Chung-Cheng Chiu , David Qiu , Deepak Gopinath , Dian Ang Yap , Dong Yin , Feng Nan , Floris Weers , Guoli Yin , Haoshuo Huang , Jianyu Wang , Jiarui Lu , John Peebles , Ke Ye , Mark Lee , Nan Du , Qibin Chen , Quentin Keunebroek , Sam Wiseman , Syd Evans , Tao Lei , Vivek Rathod , Xiang Kong , Xianzhi Du , Yanghao Li , Yongqiang Wang , Yuan Gao , Zaid Ahmed , Zhaoyang Xu , Zhiyun Lu , Al Rashid , Albin Madappally Jose , Alec Doane , Alfredo Bencomo , Allison Vanderby , Andrew Hansen , Ankur Jain , Anupama Mann Anupama , Areeba Kamal , Bugu Wu , Carolina Brum , Charlie Maalouf , Chinguun Erdenebileg , Chris Dulhanty , Daniel Parilla , Dominik Moritz , Doug Kang , Eduardo Jimenez , Evan Ladd , Fangping Shi , Felix Bai , Frank Chu , Fred Hohman , Hadas Kotek , Hannah Gillis Coleman , Jane Li , Jeffrey Bigham , Jeffery Cao , Jeff Lai , Jessica Cheung , Jiulong Shan , Joe Zhou , John Li , Jun Qin , Karanjeet Singh , Karla Vega , Kelvin Zou , Laura Heckman , Lauren Gardiner , Margit Bowler , Maria Cordell , Meng Cao , Nicole Hay , Nilesh Shahdadpuri , Otto Godwin , Pranay Dighe , Pushyami Rachapudi , Ramsey Tantawi , Roman Frigg , Sam Davarnia , Sanskruti Shah , Saptarshi Guha , Sasha Sirovica , Shen Ma , Shuang Ma , Simon Wang , Sulgi Kim , Suma Jayaram , Vaishaal Shankar , Varsha Paidi , Vivek Kumar , Xin Wang , Xin Zheng , Walker Cheng , Yael Shrager , Yang Ye , Yasu Tanaka , Yihao Guo , Yunsong Meng , Zhao Tang Luo , Zhi Ouyang , Alp Aygar , Alvin Wan , Andrew Walkingshaw , Andy Narayanan , Antonie Lin , Arsalan Farooq , Brent Ramerth , Colorado Reed , Chris Bartels , Chris Chaney , David Riazati , Eric Liang Yang , Erin Feldman , Gabriel Hochstrasser , Guillaume Seguin , Irina Belousova , Joris Pelemans , Karen Yang , Keivan Alizadeh Vahid , Liangliang Cao , Mahyar Najibi , Marco Zuliani , Max Horton , Minsik Cho , Nikhil Bhendawade , Patrick Dong , Piotr Maj , Pulkit Agrawal , Qi Shan , Qichen Fu , Regan Poston , Sam Xu , Shuangning Liu , Sushma Rao , Tashweena Heeramun , Thomas Merth , Uday Rayala , Victor Cui , Vivek Rangarajan Sridhar , Wencong Zhang , Wenqi Zhang , Wentao Wu , Xingyu Zhou , Xinwen Liu , Yang Zhao , Yin Xia , Zhile Ren , Zhongzheng Ren

Efficient Vision-Language Models by Summarizing Visual Tokens into Compact Registers

Recent advancements in vision-language models (VLMs) have expanded their potential for real-world applications, enabling these models to perform complex reasoning on images. In the widely used fully autoregressive transformer-based models…

Computer Vision and Pattern Recognition · Computer Science 2024-10-21 Yuxin Wen , Qingqing Cao , Qichen Fu , Sachin Mehta , Mahyar Najibi

LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference

The inference of transformer-based large language models consists of two sequential stages: 1) a prefilling stage to compute the KV cache of prompts and generate the first token, and 2) a decoding stage to generate subsequent tokens. For…

Computation and Language · Computer Science 2024-07-22 Qichen Fu , Minsik Cho , Thomas Merth , Sachin Mehta , Mohammad Rastegari , Mahyar Najibi

Superposition Prompting: Improving and Accelerating Retrieval-Augmented Generation

Despite the successes of large language models (LLMs), they exhibit significant drawbacks, particularly when processing long contexts. Their inference cost scales quadratically with respect to sequence length, making it expensive for…

Computation and Language · Computer Science 2024-07-22 Thomas Merth , Qichen Fu , Mohammad Rastegari , Mahyar Najibi

Speculative Streaming: Fast LLM Inference without Auxiliary Models

Speculative decoding is a prominent technique to speed up the inference of a large target language model based on predictions of an auxiliary draft model. While effective, in application-specific settings, it often involves fine-tuning both…

Computation and Language · Computer Science 2024-02-20 Nikhil Bhendawade , Irina Belousova , Qichen Fu , Henry Mason , Mohammad Rastegari , Mahyar Najibi

FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline

Super-resolution (SR) techniques have recently been proposed to upscale the outputs of neural radiance fields (NeRF) and generate high-quality images with enhanced inference speeds. However, existing NeRF+SR methods increase training…

Computer Vision and Pattern Recognition · Computer Science 2023-12-25 Chien-Yu Lin , Qichen Fu , Thomas Merth , Karren Yang , Anurag Ranjan

eDKM: An Efficient and Accurate Train-time Weight Clustering for Large Language Models

Since Large Language Models or LLMs have demonstrated high-quality performance on many complex language tasks, there is a great interest in bringing these LLMs to mobile devices for faster responses and better privacy protection. However,…

Machine Learning · Computer Science 2023-09-15 Minsik Cho , Keivan A. Vahid , Qichen Fu , Saurabh Adya , Carlo C Del Mundo , Mohammad Rastegari , Devang Naik , Peter Zatloukal

Deformer: Dynamic Fusion Transformer for Robust Hand Pose Estimation

Accurately estimating 3D hand pose is crucial for understanding how humans interact with the world. Despite remarkable progress, existing methods often struggle to generate plausible hand poses when the hand is heavily occluded or blurred.…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Qichen Fu , Xingyu Liu , Ran Xu , Juan Carlos Niebles , Kris M. Kitani

Domain Adaptive Hand Keypoint and Pixel Localization in the Wild

We aim to improve the performance of regressing hand keypoints and segmenting pixel-level hand masks under new imaging conditions (e.g., outdoors) when we only have labeled images taken under very different conditions (e.g., indoors). In…

Computer Vision and Pattern Recognition · Computer Science 2022-07-15 Takehiko Ohkawa , Yu-Jhe Li , Qichen Fu , Ryosuke Furuta , Kris M. Kitani , Yoichi Sato

Sequential Voting with Relational Box Fields for Active Object Detection

A key component of understanding hand-object interactions is the ability to identify the active object -- the object that is being manipulated by the human hand. In order to accurately localize the active object, any method must reason…

Computer Vision and Pattern Recognition · Computer Science 2022-06-03 Qichen Fu , Xingyu Liu , Kris M. Kitani

Ego4D: Around the World in 3,000 Hours of Egocentric Video

We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite. It offers 3,670 hours of daily-life activity video spanning hundreds of scenarios (household, outdoor, workplace, leisure, etc.) captured by 931 unique camera…

Computer Vision and Pattern Recognition · Computer Science 2022-03-15 Kristen Grauman , Andrew Westbury , Eugene Byrne , Zachary Chavis , Antonino Furnari , Rohit Girdhar , Jackson Hamburger , Hao Jiang , Miao Liu , Xingyu Liu , Miguel Martin , Tushar Nagarajan , Ilija Radosavovic , Santhosh Kumar Ramakrishnan , Fiona Ryan , Jayant Sharma , Michael Wray , Mengmeng Xu , Eric Zhongcong Xu , Chen Zhao , Siddhant Bansal , Dhruv Batra , Vincent Cartillier , Sean Crane , Tien Do , Morrie Doulaty , Akshay Erapalli , Christoph Feichtenhofer , Adriano Fragomeni , Qichen Fu , Abrham Gebreselasie , Cristina Gonzalez , James Hillis , Xuhua Huang , Yifei Huang , Wenqi Jia , Weslie Khoo , Jachym Kolar , Satwik Kottur , Anurag Kumar , Federico Landini , Chao Li , Yanghao Li , Zhenqiang Li , Karttikeya Mangalam , Raghava Modhugu , Jonathan Munro , Tullie Murrell , Takumi Nishiyasu , Will Price , Paola Ruiz Puentes , Merey Ramazanova , Leda Sari , Kiran Somasundaram , Audrey Southerland , Yusuke Sugano , Ruijie Tao , Minh Vo , Yuchen Wang , Xindi Wu , Takuma Yagi , Ziwei Zhao , Yunyi Zhu , Pablo Arbelaez , David Crandall , Dima Damen , Giovanni Maria Farinella , Christian Fuegen , Bernard Ghanem , Vamsi Krishna Ithapu , C. V. Jawahar , Hanbyul Joo , Kris Kitani , Haizhou Li , Richard Newcombe , Aude Oliva , Hyun Soo Park , James M. Rehg , Yoichi Sato , Jianbo Shi , Mike Zheng Shou , Antonio Torralba , Lorenzo Torresani , Mingfei Yan , Jitendra Malik