Related papers: Graph World Model

Graph World Models: Concepts, Taxonomy, and Future Directions

As one of the mainstream models of artificial intelligence, world models allow agents to learn the representation of the environment for efficient prediction and planning. However, classical world models based on flat tensors face several…

Artificial Intelligence · Computer Science 2026-05-01 Jiawei Liu , Senqiao Yang , Mingjun Wang , Yu Wang , Bei Yu

A Wireless World Model for AI-Native 6G Networks

Integrating AI into the physical layer is a cornerstone of 6G networks. However, current data-driven approaches struggle to generalize across dynamic environments because they lack an intrinsic understanding of electromagnetic wave…

Networking and Internet Architecture · Computer Science 2026-03-27 Ziqi Chen , Yi Ren , Yixuan Huang , Qi Sun , Nan Li , Yuhong Huang , Chih-Lin I , Yifan Li , Liang Xia

Graph Foundation Models: A Comprehensive Survey

Graph-structured data pervades domains such as social networks, biological systems, knowledge graphs, and recommender systems. While foundation models have transformed natural language processing, vision, and multimodal learning through…

Machine Learning · Computer Science 2025-05-22 Zehong Wang , Zheyuan Liu , Tianyi Ma , Jiazheng Li , Zheyuan Zhang , Xingbo Fu , Yiyang Li , Zhengqing Yuan , Wei Song , Yijun Ma , Qingkai Zeng , Xiusi Chen , Jianan Zhao , Jundong Li , Meng Jiang , Pietro Lio , Nitesh Chawla , Chuxu Zhang , Yanfang Ye

World-in-World: World Models in a Closed-Loop World

Generative world models (WMs) can now simulate worlds with striking visual realism, which naturally raises the question of whether they can endow embodied agents with predictive perception for decision making. Progress on this question has…

Computer Vision and Pattern Recognition · Computer Science 2025-10-22 Jiahan Zhang , Muqing Jiang , Nanru Dai , Taiming Lu , Arda Uzunoglu , Shunchi Zhang , Yana Wei , Jiahao Wang , Vishal M. Patel , Paul Pu Liang , Daniel Khashabi , Cheng Peng , Rama Chellappa , Tianmin Shu , Alan Yuille , Yilun Du , Jieneng Chen

GWM: Towards Scalable Gaussian World Models for Robotic Manipulation

Training robot policies within a learned world model is trending due to the inefficiency of real-world interactions. The established image-based world models and policies have shown prior success, but lack robust geometric information that…

Robotics · Computer Science 2025-09-18 Guanxing Lu , Baoxiong Jia , Puhao Li , Yixin Chen , Ziwei Wang , Yansong Tang , Siyuan Huang

Position: Graph Foundation Models are Already Here

Graph Foundation Models (GFMs) are emerging as a significant research topic in the graph domain, aiming to develop graph models trained on extensive and diverse data to enhance their applicability across various tasks and domains.…

Machine Learning · Computer Science 2024-06-03 Haitao Mao , Zhikai Chen , Wenzhuo Tang , Jianan Zhao , Yao Ma , Tong Zhao , Neil Shah , Mikhail Galkin , Jiliang Tang

MoWM: Mixture-of-World-Models for Embodied Planning via Latent-to-Pixel Feature Modulation

Embodied action planning is a core challenge in robotics, requiring models to generate precise actions from visual observations and language instructions. While video generation world models are promising, their reliance on pixel-level…

Computer Vision and Pattern Recognition · Computer Science 2026-02-11 Yangcheng Yu , Xin Jin , Yu Shang , Xin Zhang , Haisheng Su , Wei Wu , Yong Li

Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets

Imitation learning has emerged as a promising approach towards building generalist robots. However, scaling imitation learning for large robot foundation models remains challenging due to its reliance on high-quality expert demonstrations.…

Robotics · Computer Science 2025-05-26 Chuning Zhu , Raymond Yu , Siyuan Feng , Benjamin Burchfiel , Paarth Shah , Abhishek Gupta

Graph Foundation Models: Concepts, Opportunities and Challenges

Foundation models have emerged as critical components in a variety of artificial intelligence applications, and showcase significant success in natural language processing and several other domains. Meanwhile, the field of graph machine…

Machine Learning · Computer Science 2025-03-11 Jiawei Liu , Cheng Yang , Zhiyuan Lu , Junze Chen , Yibo Li , Mengmei Zhang , Ting Bai , Yuan Fang , Lichao Sun , Philip S. Yu , Chuan Shi

Generative Visual Code Mobile World Models

Mobile Graphical User Interface (GUI) World Models (WMs) offer a promising path for improving mobile GUI agent performance at train- and inference-time. However, current approaches face a critical trade-off: text-based WMs sacrifice visual…

Machine Learning · Computer Science 2026-05-26 Woosung Koh , Sungjun Han , Segyu Lee , Se-Young Yun , Jamin Shin

OpenGraph: Towards Open Graph Foundation Models

Graph learning has become essential in various domains, including recommendation systems and social network analysis. Graph Neural Networks (GNNs) have emerged as promising techniques for encoding structural information and improving…

Machine Learning · Computer Science 2024-10-10 Lianghao Xia , Ben Kao , Chao Huang

GOFA: A Generative One-For-All Model for Joint Graph Language Modeling

Foundation models, such as Large Language Models (LLMs) or Large Vision Models (LVMs), have emerged as one of the most powerful tools in the respective fields. However, unlike text and image data, graph data do not have a definitive…

Machine Learning · Computer Science 2025-04-28 Lecheng Kong , Jiarui Feng , Hao Liu , Chengsong Huang , Jiaxin Huang , Yixin Chen , Muhan Zhang

Evaluating Progress in Graph Foundation Models: A Comprehensive Benchmark and New Insights

Graph foundation models (GFM) aim to acquire transferable knowledge by pre-training on diverse graphs, which can be adapted to various downstream tasks. However, domain shift in graphs is inherently two-dimensional: graphs differ not only…

Computation and Language · Computer Science 2026-03-12 Xingtong Yu , Shenghua Ye , Ruijuan Liang , Chang Zhou , Hong Cheng , Xinming Zhang , Yuan Fang

GAWM: Global-Aware World Model for Multi-Agent Reinforcement Learning

In recent years, Model-based Multi-Agent Reinforcement Learning (MARL) has demonstrated significant advantages over model-free methods in terms of sample efficiency by using independent environment dynamics world models for data sample…

Multiagent Systems · Computer Science 2025-01-20 Zifeng Shi , Meiqin Liu , Senlin Zhang , Ronghao Zheng , Shanling Dong , Ping Wei

H-WM: Robotic Task and Motion Planning Guided by Hierarchical World Model

World models are becoming central to robotic planning and control as they enable prediction of future state transitions. Existing approaches often emphasize video generation or natural-language prediction, which are difficult to ground in…

Robotics · Computer Science 2026-03-05 Jinbang Huang , Wenyuan Chen , Zhiyuan Li , Oscar Pang , Xiao Hu , Lingfeng Zhang , Yuanzhao Hu , Zhanguang Zhang , Mark Coates , Tongtong Cao , Xingyue Quan , Yingxue Zhang

stable-worldmodel-v1: Reproducible World Modeling Research and Evaluation

World Models have emerged as a powerful paradigm for learning compact, predictive representations of environment dynamics, enabling agents to reason, plan, and generalize beyond direct experience. Despite recent interest in World Models,…

Artificial Intelligence · Computer Science 2026-02-18 Lucas Maes , Quentin Le Lidec , Dan Haramati , Nassim Massaudi , Damien Scieur , Yann LeCun , Randall Balestriero

Humanoid World Models: Open World Foundation Models for Humanoid Robotics

Humanoid robots, with their human-like form, are uniquely suited for interacting in environments built for people. However, enabling humanoids to reason, plan, and act in complex open-world settings remains a challenge. World models, models…

Robotics · Computer Science 2025-07-10 Muhammad Qasim Ali , Aditya Sridhar , Shahbuland Matiana , Alex Wong , Mohammad Al-Sharman

Bridging the Gap Between Multimodal Foundation Models and World Models

Humans understand the world through the integration of multiple sensory modalities, enabling them to perceive, reason about, and imagine dynamic physical processes. Inspired by this capability, multimodal foundation models (MFMs) have…

Artificial Intelligence · Computer Science 2025-10-07 Xuehai He

MWM: Mobile World Models for Action-Conditioned Consistent Prediction

World models enable planning in imagined future predicted space, offering a promising framework for embodied navigation. However, existing navigation world models often lack action-conditioned consistency, so visually plausible predictions…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Han Yan , Zishang Xiang , Zeyu Zhang , Hao Tang

Unified 3D Scene Understanding Through Physical World Modeling

Understanding 3D scenes requires flexible combinations of visual reasoning tasks, including depth estimation, novel view synthesis, and object manipulation, all of which are essential for perception and interaction. Existing approaches have…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Wanhee Lee , Klemen Kotar , Rahul Mysore Venkatesh , Jared Watrous , Honglin Chen , Khai Loong Aw , Daniel L. K. Yamins