English
Related papers

Related papers: Graph World Model

200 papers

As one of the mainstream models of artificial intelligence, world models allow agents to learn the representation of the environment for efficient prediction and planning. However, classical world models based on flat tensors face several…

Artificial Intelligence · Computer Science 2026-05-01 Jiawei Liu , Senqiao Yang , Mingjun Wang , Yu Wang , Bei Yu

Integrating AI into the physical layer is a cornerstone of 6G networks. However, current data-driven approaches struggle to generalize across dynamic environments because they lack an intrinsic understanding of electromagnetic wave…

Networking and Internet Architecture · Computer Science 2026-03-27 Ziqi Chen , Yi Ren , Yixuan Huang , Qi Sun , Nan Li , Yuhong Huang , Chih-Lin I , Yifan Li , Liang Xia

Graph-structured data pervades domains such as social networks, biological systems, knowledge graphs, and recommender systems. While foundation models have transformed natural language processing, vision, and multimodal learning through…

Generative world models (WMs) can now simulate worlds with striking visual realism, which naturally raises the question of whether they can endow embodied agents with predictive perception for decision making. Progress on this question has…

Training robot policies within a learned world model is trending due to the inefficiency of real-world interactions. The established image-based world models and policies have shown prior success, but lack robust geometric information that…

Robotics · Computer Science 2025-09-18 Guanxing Lu , Baoxiong Jia , Puhao Li , Yixin Chen , Ziwei Wang , Yansong Tang , Siyuan Huang

Graph Foundation Models (GFMs) are emerging as a significant research topic in the graph domain, aiming to develop graph models trained on extensive and diverse data to enhance their applicability across various tasks and domains.…

Machine Learning · Computer Science 2024-06-03 Haitao Mao , Zhikai Chen , Wenzhuo Tang , Jianan Zhao , Yao Ma , Tong Zhao , Neil Shah , Mikhail Galkin , Jiliang Tang

Embodied action planning is a core challenge in robotics, requiring models to generate precise actions from visual observations and language instructions. While video generation world models are promising, their reliance on pixel-level…

Computer Vision and Pattern Recognition · Computer Science 2026-02-11 Yangcheng Yu , Xin Jin , Yu Shang , Xin Zhang , Haisheng Su , Wei Wu , Yong Li

Imitation learning has emerged as a promising approach towards building generalist robots. However, scaling imitation learning for large robot foundation models remains challenging due to its reliance on high-quality expert demonstrations.…

Robotics · Computer Science 2025-05-26 Chuning Zhu , Raymond Yu , Siyuan Feng , Benjamin Burchfiel , Paarth Shah , Abhishek Gupta

Foundation models have emerged as critical components in a variety of artificial intelligence applications, and showcase significant success in natural language processing and several other domains. Meanwhile, the field of graph machine…

Machine Learning · Computer Science 2025-03-11 Jiawei Liu , Cheng Yang , Zhiyuan Lu , Junze Chen , Yibo Li , Mengmei Zhang , Ting Bai , Yuan Fang , Lichao Sun , Philip S. Yu , Chuan Shi

Mobile Graphical User Interface (GUI) World Models (WMs) offer a promising path for improving mobile GUI agent performance at train- and inference-time. However, current approaches face a critical trade-off: text-based WMs sacrifice visual…

Machine Learning · Computer Science 2026-05-26 Woosung Koh , Sungjun Han , Segyu Lee , Se-Young Yun , Jamin Shin

Graph learning has become essential in various domains, including recommendation systems and social network analysis. Graph Neural Networks (GNNs) have emerged as promising techniques for encoding structural information and improving…

Machine Learning · Computer Science 2024-10-10 Lianghao Xia , Ben Kao , Chao Huang

Foundation models, such as Large Language Models (LLMs) or Large Vision Models (LVMs), have emerged as one of the most powerful tools in the respective fields. However, unlike text and image data, graph data do not have a definitive…

Machine Learning · Computer Science 2025-04-28 Lecheng Kong , Jiarui Feng , Hao Liu , Chengsong Huang , Jiaxin Huang , Yixin Chen , Muhan Zhang

Graph foundation models (GFM) aim to acquire transferable knowledge by pre-training on diverse graphs, which can be adapted to various downstream tasks. However, domain shift in graphs is inherently two-dimensional: graphs differ not only…

Computation and Language · Computer Science 2026-03-12 Xingtong Yu , Shenghua Ye , Ruijuan Liang , Chang Zhou , Hong Cheng , Xinming Zhang , Yuan Fang

In recent years, Model-based Multi-Agent Reinforcement Learning (MARL) has demonstrated significant advantages over model-free methods in terms of sample efficiency by using independent environment dynamics world models for data sample…

Multiagent Systems · Computer Science 2025-01-20 Zifeng Shi , Meiqin Liu , Senlin Zhang , Ronghao Zheng , Shanling Dong , Ping Wei

World models are becoming central to robotic planning and control as they enable prediction of future state transitions. Existing approaches often emphasize video generation or natural-language prediction, which are difficult to ground in…

World Models have emerged as a powerful paradigm for learning compact, predictive representations of environment dynamics, enabling agents to reason, plan, and generalize beyond direct experience. Despite recent interest in World Models,…

Artificial Intelligence · Computer Science 2026-02-18 Lucas Maes , Quentin Le Lidec , Dan Haramati , Nassim Massaudi , Damien Scieur , Yann LeCun , Randall Balestriero

Humanoid robots, with their human-like form, are uniquely suited for interacting in environments built for people. However, enabling humanoids to reason, plan, and act in complex open-world settings remains a challenge. World models, models…

Robotics · Computer Science 2025-07-10 Muhammad Qasim Ali , Aditya Sridhar , Shahbuland Matiana , Alex Wong , Mohammad Al-Sharman

Humans understand the world through the integration of multiple sensory modalities, enabling them to perceive, reason about, and imagine dynamic physical processes. Inspired by this capability, multimodal foundation models (MFMs) have…

Artificial Intelligence · Computer Science 2025-10-07 Xuehai He

World models enable planning in imagined future predicted space, offering a promising framework for embodied navigation. However, existing navigation world models often lack action-conditioned consistency, so visually plausible predictions…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Han Yan , Zishang Xiang , Zeyu Zhang , Hao Tang

Understanding 3D scenes requires flexible combinations of visual reasoning tasks, including depth estimation, novel view synthesis, and object manipulation, all of which are essential for perception and interaction. Existing approaches have…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Wanhee Lee , Klemen Kotar , Rahul Mysore Venkatesh , Jared Watrous , Honglin Chen , Khai Loong Aw , Daniel L. K. Yamins
‹ Prev 1 2 3 10 Next ›