Related papers: Apple Intelligence Foundation Language Models

Apple Intelligence Foundation Language Models: Tech Report 2025

We introduce two multilingual, multimodal foundation language models that power Apple Intelligence features across Apple devices and services: i a 3B-parameter on-device model optimized for Apple silicon through architectural innovations…

Machine Learning · Computer Science 2025-08-28 Ethan Li , Anders Boesen Lindbo Larsen , Chen Zhang , Xiyou Zhou , Jun Qin , Dian Ang Yap , Narendran Raghavan , Xuankai Chang , Margit Bowler , Eray Yildiz , John Peebles , Hannah Gillis Coleman , Matteo Ronchi , Peter Gray , Keen You , Anthony Spalvieri-Kruse , Ruoming Pang , Reed Li , Yuli Yang , Emad Soroush , Zhiyun Lu , Crystal Xiao , Rong Situ , Jordan Huffaker , David Griffiths , Zaid Ahmed , Peng Zhang , Daniel Parilla , Asaf Liberman , Jennifer Mallalieu , Parsa Mazaheri , Qibin Chen , Manjot Bilkhu , Aonan Zhang , Eric Wang , Dave Nelson , Michael FitzMaurice , Thomas Voice , Jeremy Liu , Josh Shaffer , Shiwen Zhao , Prasanth Yadla , Farzin Rasteh , Pengsheng Guo , Arsalan Farooq , Jeremy Snow , Stephen Murphy , Tao Lei , Minsik Cho , George Horrell , Sam Dodge , Lindsay Hislop , Sumeet Singh , Alex Dombrowski , Aiswarya Raghavan , Sasha Sirovica , Mandana Saebi , Faye Lao , Max Lam , TJ Lu , Zhaoyang Xu , Karanjeet Singh , Marc Kirchner , David Mizrahi , Rajat Arora , Haotian Zhang , Henry Mason , Lawrence Zhou , Yi Hua , Ankur Jain , Felix Bai , Joseph Astrauskas , Floris Weers , Josh Gardner , Mira Chiang , Yi Zhang , Pulkit Agrawal , Tony Sun , Quentin Keunebroek , Matthew Hopkins , Bugu Wu , Tao Jia , Chen Chen , Xingyu Zhou , Nanzhu Wang , Peng Liu , Ruixuan Hou , Rene Rauch , Yuan Gao , Afshin Dehghan , Jonathan Janke , Zirui Wang , Cha Chen , Xiaoyi Ren , Feng Nan , Josh Elman , Dong Yin , Yusuf Goren , Jeff Lai , Yiran Fei , Syd Evans , Muyang Yu , Guoli Yin , Yi Qin , Erin Feldman , Isha Garg , Aparna Rajamani , Karla Vega , Walker Cheng , TJ Collins , Hans Han , Raul Rea Menacho , Simon Yeung , Sophy Lee , Phani Mutyala , Ying-Chang Cheng , Zhe Gan , Sprite Chu , Justin Lazarow , Alessandro Pappalardo , Federico Scozzafava , Jing Lu , Erik Daxberger , Laurent Duchesne , Jen Liu , David Güera , Stefano Ligas , Mary Beth Kery , Brent Ramerth , Ciro Sannino , Marcin Eichner , Haoshuo Huang , Rui Qian , Moritz Schwarzer-Becker , David Riazati , Mingfei Gao , Bailin Wang , Jack Cackler , Yang Lu , Ransen Niu , John Dennison , Guillaume Klein , Jeffrey Bigham , Deepak Gopinath , Navid Shiee , Darren Botten , Guillaume Tartavel , Alex Guillen Garcia , Sam Xu , Victoria MönchJuan Haladjian , Zi-Yi Dou , Matthias Paulik , Adolfo Lopez Mendez , Zhen Li , Hong-You Chen , Chao Jia , Dhaval Doshi , Zhengdong Zhang , Raunak Manjani , Aaron Franklin , Zhile Ren , David Chen , Artsiom Peshko , Nandhitha Raghuram , Hans Hao , Jiulong Shan , Kavya Nerella , Ramsey Tantawi , Vivek Kumar , Saiwen Wang , Brycen Wershing , Bhuwan Dhingra , Dhruti Shah , Ob Adaranijo , Xin Zheng , Tait Madsen , Hadas Kotek , Chang Liu , Yin Xia , Hanli Li , Suma Jayaram , Yanchao Sun , Ahmed Fakhry , Vasileios Saveris , Dustin Withers , Yanghao Li , Alp Aygar , Andres Romero Mier Y Teran , Kaiwei Huang , Mark Lee , Xiujun Li , Yuhong Li , Tyler Johnson , Jay Tang , Joseph Yitan Cheng , Futang Peng , Andrew Walkingshaw , Lucas Guibert , Abhishek Sharma , Cheng Shen , Piotr Maj , Yasutaka Tanaka , You-Cyuan Jhang , Vivian Ma , Tommi Vehvilainen , Kelvin Zou , Jeff Nichols , Matthew Lei , David Qiu , Yihao Qian , Gokul Santhanam , Wentao Wu , Yena Han , Dominik Moritz , Haijing Fu , Mingze Xu , Vivek Rathod , Jian Liu , Louis D'hauwe , Qin Ba , Haitian Sun , Haoran Yan , Philipp Dufter , Anh Nguyen , Yihao Feng , Emma Wang , Keyu He , Rahul Nair , Sanskruti Shah , Jiarui Lu , Patrick Sonnenberg , Jeremy Warner , Yuanzhi Li , Bowen Pan , Ziyi Zhong , Joe Zhou , Sam Davarnia , Olli Saarikivi , Irina Belousova , Rachel Burger , Shang-Chen Wu , Di Feng , Bas Straathof , James Chou , Yuanyang Zhang , Marco Zuliani , Eduardo Jimenez , Abhishek Sundararajan , Xianzhi Du , Chang Lan , Nilesh Shahdadpuri , Peter Grasch , Sergiu Sima , Josh Newnham , Varsha Paidi , Jianyu Wang , Kaelen Haag , Alex Braunstein , Daniele Molinari , Richard Wei , Brenda Yang , Nicholas Lusskin , Joanna Arreaza-Taylor , Meng Cao , Nicholas Seidl , Simon Wang , Jiaming Hu , Yiping Ma , Mengyu Li , Kieran Liu , Hang Su , Sachin Ravi , Chong Wang , Xin Wang , Kevin Smith , Haoxuan You , Binazir Karimzadeh , Rui Li , Jinhao Lei , Wei Fang , Alec Doane , Sam Wiseman , Ismael Fernandez , Jane Li , Andrew Hansen , Javier Movellan , Christopher Neubauer , Hanzhi Zhou , Chris Chaney , Nazir Kamaldin , Valentin Wolf , Fernando Bermúdez-Medina , Joris Pelemans , Peter Fu , Howard Xing , Xiang Kong , Wayne Shan , Gabriel Jacoby-Cooper , Dongcai Shen , Tom Gunter , Guillaume Seguin , Fangping Shi , Shiyu Li , Yang Xu , Areeba Kamal , Dan Masi , Saptarshi Guha , Qi Zhu , Jenna Thibodeau , Changyuan Zhang , Rebecca Callahan , Charles Maalouf , Wilson Tsao , Boyue Li , Qingqing Cao , Naomy Sabo , Cheng Leong , Yi Wang , Anupama Mann Anupama , Colorado Reed , Kenneth Jung , Zhifeng Chen , Mohana Prasad Sathya Moorthy , Yifei He , Erik Hornberger , Devi Krishna , Senyu Tong , Michael , Lee , David Haldimann , Yang Zhao , Bowen Zhang , Chang Gao , Chris Bartels , Sushma Rao , Nathalie Tran , Simon Lehnerer , Co Giang , Patrick Dong , Junting Pan , Biyao Wang , Dongxu Li , Mehrdad Farajtabar , Dongseong Hwang , Grace Duanmu , Eshan Verma , Sujeeth Reddy , Qi Shan , Hongbin Gao , Nan Du , Pragnya Sridhar , Forrest Huang , Yingbo Wang , Nikhil Bhendawade , Diane Zhu , Sai Aitharaju , Fred Hohman , Lauren Gardiner , Chung-Cheng Chiu , Yinfei Yang , Alper Kokmen , Frank Chu , Ke Ye , Kaan Elgin , Oron Levy , John Park , Donald Zhang , Eldon Schoop , Nina Wenzel , Michael Booker , Hyunjik Kim , Chinguun Erdenebileg , Nan Dun , Eric Liang Yang , Priyal Chhatrapati , Vishaal Mahtani , Haiming Gang , Kohen Chia , Deepa Seshadri , Donghan Yu , Yan Meng , Kelsey Peterson , Zhen Yang , Yongqiang Wang , Carina Peng , Doug Kang , Anuva Agarwal , Albert Antony , Juan Lao Tebar , Albin Madappally Jose , Regan Poston , Andy De Wang , Gerard Casamayor , Elmira Amirloo , Violet Yao , Wojciech Kryscinski , Kun Duan , Lezhi L

OpenELM: An Efficient Language Model Family with Open Training and Inference Framework

The reproducibility and transparency of large language models are crucial for advancing open research, ensuring the trustworthiness of results, and enabling investigations into data and model biases, as well as potential risks. To this end,…

Computation and Language · Computer Science 2024-05-03 Sachin Mehta , Mohammad Hossein Sekhavat , Qingqing Cao , Maxwell Horton , Yanzi Jin , Chenfan Sun , Iman Mirzadeh , Mahyar Najibi , Dmitry Belenko , Peter Zatloukal , Mohammad Rastegari

A Performance Evaluation of a Quantized Large Language Model on Various Smartphones

This paper explores the feasibility and performance of on-device large language model (LLM) inference on various Apple iPhone models. Amidst the rapid evolution of generative AI, on-device LLMs offer solutions to privacy, security, and…

Machine Learning · Computer Science 2024-02-02 Tolga Çöplü , Marc Loedi , Arto Bendiken , Mykhailo Makohin , Joshua J. Bouw , Stephen Cobb

A Case for Business Process-Specific Foundation Models

The inception of large language models has helped advance state-of-the-art performance on numerous natural language tasks. This has also opened the door for the development of foundation models for other domains and data modalities such as…

Artificial Intelligence · Computer Science 2022-12-02 Yara Rizk , Praveen Venkateswaran , Vatche Isahagian , Vinod Muthusamy

Profiling Large Language Model Inference on Apple Silicon: A Quantization Perspective

A systematic understanding of Apple Silicon is lacking in the current landscape of hardware efficiency; research focus is largely centered on accelerating GPUs for large-scale training or inference on CUDA devices. This paper investigates…

Performance · Computer Science 2025-08-13 Afsara Benazir , Felix Xiaozhu Lin

Towards Responsible Generative AI: A Reference Architecture for Designing Foundation Model based Agents

Foundation models, such as large language models (LLMs), have been widely recognised as transformative AI technologies due to their capabilities to understand and generate content, including plans with reasoning capabilities. Foundation…

Artificial Intelligence · Computer Science 2024-04-04 Qinghua Lu , Liming Zhu , Xiwei Xu , Zhenchang Xing , Stefan Harrer , Jon Whittle

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

Language modelling provides a step towards intelligent communication systems by harnessing large repositories of written human knowledge to better predict and understand the world. In this paper, we present an analysis of Transformer-based…

Computation and Language · Computer Science 2022-01-24 Jack W. Rae , Sebastian Borgeaud , Trevor Cai , Katie Millican , Jordan Hoffmann , Francis Song , John Aslanides , Sarah Henderson , Roman Ring , Susannah Young , Eliza Rutherford , Tom Hennigan , Jacob Menick , Albin Cassirer , Richard Powell , George van den Driessche , Lisa Anne Hendricks , Maribeth Rauh , Po-Sen Huang , Amelia Glaese , Johannes Welbl , Sumanth Dathathri , Saffron Huang , Jonathan Uesato , John Mellor , Irina Higgins , Antonia Creswell , Nat McAleese , Amy Wu , Erich Elsen , Siddhant Jayakumar , Elena Buchatskaya , David Budden , Esme Sutherland , Karen Simonyan , Michela Paganini , Laurent Sifre , Lena Martens , Xiang Lorraine Li , Adhiguna Kuncoro , Aida Nematzadeh , Elena Gribovskaya , Domenic Donato , Angeliki Lazaridou , Arthur Mensch , Jean-Baptiste Lespiau , Maria Tsimpoukelli , Nikolai Grigorev , Doug Fritz , Thibault Sottiaux , Mantas Pajarskas , Toby Pohlen , Zhitao Gong , Daniel Toyama , Cyprien de Masson d'Autume , Yujia Li , Tayfun Terzi , Vladimir Mikulik , Igor Babuschkin , Aidan Clark , Diego de Las Casas , Aurelia Guy , Chris Jones , James Bradbury , Matthew Johnson , Blake Hechtman , Laura Weidinger , Iason Gabriel , William Isaac , Ed Lockhart , Simon Osindero , Laura Rimell , Chris Dyer , Oriol Vinyals , Kareem Ayoub , Jeff Stanway , Lorrayne Bennett , Demis Hassabis , Koray Kavukcuoglu , Geoffrey Irving

Large Language Models over Networks: Collaborative Intelligence under Resource Constraints

Large language models (LLMs) are transforming society, powering applications from smartphone assistants to autonomous driving. Yet cloud-based LLM services alone cannot serve a growing class of applications, including those operating under…

Signal Processing · Electrical Eng. & Systems 2026-05-12 Liangqi Yuan , Wenzhi Fang , Shiqiang Wang , H. Vincent Poor , Christopher G. Brinton

Yi: Open Foundation Models by 01.AI

We introduce the Yi model family, a series of language and multimodal models that demonstrate strong multi-dimensional capabilities. The Yi model family is based on 6B and 34B pretrained language models, then we extend them to chat models,…

Computation and Language · Computer Science 2025-01-22 01. AI , : , Alex Young , Bei Chen , Chao Li , Chengen Huang , Ge Zhang , Guanwei Zhang , Guoyin Wang , Heng Li , Jiangcheng Zhu , Jianqun Chen , Jing Chang , Kaidong Yu , Peng Liu , Qiang Liu , Shawn Yue , Senbin Yang , Shiming Yang , Wen Xie , Wenhao Huang , Xiaohui Hu , Xiaoyi Ren , Xinyao Niu , Pengcheng Nie , Yanpeng Li , Yuchi Xu , Yudong Liu , Yue Wang , Yuxuan Cai , Zhenyu Gu , Zhiyuan Liu , Zonghong Dai

Foundation Models for Natural Language Processing -- Pre-trained Language Models Integrating Media

This open access book provides a comprehensive overview of the state of the art in research and applications of Foundation Models and is intended for readers familiar with basic Natural Language Processing (NLP) concepts. Over the recent…

Computation and Language · Computer Science 2023-02-20 Gerhard Paaß , Sven Giesselbach

A Survey of Reasoning with Foundation Models

Reasoning, a crucial ability for complex problem-solving, plays a pivotal role in various real-world settings such as negotiation, medical diagnosis, and criminal investigation. It serves as a fundamental methodology in the field of…

Artificial Intelligence · Computer Science 2024-01-26 Jiankai Sun , Chuanyang Zheng , Enze Xie , Zhengying Liu , Ruihang Chu , Jianing Qiu , Jiaqi Xu , Mingyu Ding , Hongyang Li , Mengzhe Geng , Yue Wu , Wenhai Wang , Junsong Chen , Zhangyue Yin , Xiaozhe Ren , Jie Fu , Junxian He , Wu Yuan , Qi Liu , Xihui Liu , Yu Li , Hao Dong , Yu Cheng , Ming Zhang , Pheng Ann Heng , Jifeng Dai , Ping Luo , Jingdong Wang , Ji-Rong Wen , Xipeng Qiu , Yike Guo , Hui Xiong , Qun Liu , Zhenguo Li

The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources

Foundation model development attracts a rapidly expanding body of contributors, scientists, and applications. To help shape responsible development practices, we introduce the Foundation Model Development Cheatsheet: a growing collection of…

Machine Learning · Computer Science 2025-02-18 Shayne Longpre , Stella Biderman , Alon Albalak , Hailey Schoelkopf , Daniel McDuff , Sayash Kapoor , Kevin Klyman , Kyle Lo , Gabriel Ilharco , Nay San , Maribeth Rauh , Aviya Skowron , Bertie Vidgen , Laura Weidinger , Arvind Narayanan , Victor Sanh , David Adelani , Percy Liang , Rishi Bommasani , Peter Henderson , Sasha Luccioni , Yacine Jernite , Luca Soldaini

Foundation models in brief: A historical, socio-technical focus

Foundation models can be disruptive for future AI development by scaling up deep learning in terms of model size and training data's breadth and size. These models achieve state-of-the-art performance (often through further adaptation) on a…

Artificial Intelligence · Computer Science 2022-12-20 Johannes Schneider

Foundations of Large Language Models

This is a book about large language models. As indicated by the title, it primarily focuses on foundational concepts rather than comprehensive coverage of all cutting-edge technologies. The book is structured into five main chapters, each…

Computation and Language · Computer Science 2025-06-17 Tong Xiao , Jingbo Zhu

MindLLM: Pre-training Lightweight Large Language Model from Scratch, Evaluations and Domain Applications

Large Language Models (LLMs) have demonstrated remarkable performance across various natural language tasks, marking significant strides towards general artificial intelligence. While general artificial intelligence is leveraged by…

Computation and Language · Computer Science 2023-10-31 Yizhe Yang , Huashan Sun , Jiawei Li , Runheng Liu , Yinghao Li , Yuhang Liu , Heyan Huang , Yang Gao

CHORUS: Foundation Models for Unified Data Discovery and Exploration

We apply foundation models to data discovery and exploration tasks. Foundation models include large language models (LLMs) that show promising performance on a range of diverse tasks unrelated to their training. We show that these models…

Databases · Computer Science 2024-04-09 Moe Kayali , Anton Lykov , Ilias Fountalis , Nikolaos Vasiloglou , Dan Olteanu , Dan Suciu

TinyLLM: A Framework for Training and Deploying Language Models at the Edge Computers

Language models have gained significant interest due to their general-purpose capabilities, which appear to emerge as models are scaled to increasingly larger parameter sizes. However, these large models impose stringent requirements on…

Machine Learning · Computer Science 2024-12-23 Savitha Viswanadh Kandala , Pramuka Medaranga , Ambuj Varshney

Towards a Foundation Model for Communication Systems

Artificial Intelligence (AI) has demonstrated unprecedented performance across various domains, and its application to communication systems is an active area of research. While current methods focus on task-specific solutions, the broader…

Artificial Intelligence · Computer Science 2025-05-21 Davide Buffelli , Sowmen Das , Yu-Wei Lin , Sattar Vakili , Chien-Yi Wang , Masoud Attarifar , Pritthijit Nath , Da-shan Shiu

Language Model Behavior: A Comprehensive Survey

Transformer language models have received widespread public attention, yet their generated text is often surprising even to NLP researchers. In this survey, we discuss over 250 recent studies of English language model behavior before…

Computation and Language · Computer Science 2023-08-29 Tyler A. Chang , Benjamin K. Bergen

Lost in the Pipeline: How Well Do Large Language Models Handle Data Preparation?

Large language models have recently demonstrated their exceptional capabilities in supporting and automating various tasks. Among the tasks worth exploring for testing large language model capabilities, we considered data preparation, a…

Computation and Language · Computer Science 2025-12-01 Matteo Spreafico , Ludovica Tassini , Camilla Sancricca , Cinzia Cappiello