English
Related papers

Related papers: SimWorld Studio: Automatic Environment Generation …

200 papers

While LLM/VLM-powered AI agents have advanced rapidly in math, coding, and computer use, their applications in complex physical and social environments remain challenging. Building agents that can survive and thrive in the real world (for…

As large language models (LLMs) continue to improve in reasoning and decision-making, there is a growing need for realistic and interactive environments where their abilities can be rigorously evaluated. We present VirtualEnv, a…

Artificial Intelligence · Computer Science 2026-02-10 Kabir Swain , Sijie Han , Ayush Raina , Jin Zhang , Shuang Li , Michael Stopa , Antonio Torralba

The field of Embodied AI is witnessing a rapid evolution toward general-purpose robotic systems, fueled by high-fidelity simulation and large-scale data collection. However, this scaling capability remains severely bottlenecked by a…

Artificial Intelligence · Computer Science 2026-01-30 Zixing Lei , Genjia Liu , Yuanshuo Zhang , Qipeng Liu , Chuan Wen , Shanghang Zhang , Wenzhao Lian , Siheng Chen

Recent SOTA approaches for embodied learning via interaction directly employ large language models (LLMs) as agents to determine the next steps in an environment. Due to their world knowledge and reasoning capabilities, LLM agents achieve…

Computation and Language · Computer Science 2024-07-15 Abhay Zala , Jaemin Cho , Han Lin , Jaehong Yoon , Mohit Bansal

Despite advancements in Large Language Models (LLMs) and Large Multimodal Models (LMMs), their integration into language-grounded, human-like embodied agents remains incomplete, hindering complex real-life task performance in physical…

Computation and Language · Computer Science 2024-08-20 Zhili Cheng , Zhitong Wang , Jinyi Hu , Shengding Hu , An Liu , Yuge Tu , Pengkai Li , Lei Shi , Zhiyuan Liu , Maosong Sun

Achieving spatial intelligence requires moving beyond visual plausibility to build world simulators grounded in physical laws. While coding LLMs have advanced static 3D scene generation, extending this paradigm to 4D dynamics remains a…

Computer Vision and Pattern Recognition · Computer Science 2026-02-13 Yi Zhang , Yunshuang Wang , Zeyu Zhang , Hao Tang

As embodied intelligence emerges as a core frontier in artificial intelligence research, simulation platforms must evolve beyond low-level physical interactions to capture complex, human-centered social behaviors. We introduce FreeAskWorld,…

Artificial Intelligence · Computer Science 2025-12-23 Yuhang Peng , Yizhou Pan , Xinning He , Jihaoyu Yang , Xinyu Yin , Han Wang , Xiaoji Zheng , Chao Gao , Jiangtao Gong

Embodied decision-making enables agents to translate high-level goals into executable actions through continuous interactions within the physical world, forming a cornerstone of general-purpose embodied intelligence. Large language models…

Artificial Intelligence · Computer Science 2025-10-15 Zixing Lei , Sheng Yin , Yichen Xiong , Yuanzhuo Ding , Wenhao Huang , Yuxi Wei , Qingyao Xu , Yiming Li , Weixin Li , Yunhong Wang , Siheng Chen

Recent advances in vision language models (VLMs) have shown strong potential for spatial reasoning and 3D scene layout generation from open-ended language instructions. However, generating layouts that are not only semantically coherent but…

Artificial Intelligence · Computer Science 2026-03-10 Ali Shamsaddinlou

Scalable AI agents training relies on interactive environments that faithfully simulate the consequences of agent actions. Manually crafted environments are expensive to build, brittle to extend, and fundamentally limited in diversity. A…

Artificial Intelligence · Computer Science 2026-05-11 Yi Liu , TingFeng Hui , Wei Zhang , Li Sun , Ningxin Su , Jian Wang , Sen Su

Although LLMs demonstrate proficiency in several text-based reasoning and planning tasks, their implementation in robotics control is constrained by significant deficiencies: (1) LLM agents are designed to work mainly with textual inputs…

Artificial Intelligence · Computer Science 2025-10-17 Shuang Ao , Flora D. Salim , Simon Khan

Training capable Large Language Model (LLM) agents is critically bottlenecked by the high cost and static nature of real-world interaction data. We address this by introducing GenEnv, a framework that establishes a difficulty-aligned…

Computation and Language · Computer Science 2025-12-24 Jiacheng Guo , Ling Yang , Peter Chen , Qixin Xiao , Yinjie Wang , Xinzhe Juan , Jiahao Qiu , Ke Shen , Mengdi Wang

While large language models (LLMs) excel in a simulated world of texts, they struggle to interact with the more realistic world without perceptions of other modalities such as visual or audio signals. Although vision-language models (VLMs)…

Computer Vision and Pattern Recognition · Computer Science 2024-04-01 Yijun Yang , Tianyi Zhou , Kanxue Li , Dapeng Tao , Lusong Li , Li Shen , Xiaodong He , Jing Jiang , Yuhui Shi

Text Worlds are virtual environments for embodied agents that, unlike 2D or 3D environments, are rendered exclusively using textual descriptions. These environments offer an alternative to higher-fidelity 3D environments due to their low…

Computation and Language · Computer Science 2021-07-12 Peter A Jansen

Large-scale video generative models can synthesize diverse and realistic visual content for dynamic world creation, but they often lack element-wise controllability, hindering their use in editing scenes and training embodied AI agents. We…

Computer Vision and Pattern Recognition · Computer Science 2025-06-10 Sicheng Mo , Ziyang Leng , Leon Liu , Weizhen Wang , Honglin He , Bolei Zhou

Building embodied AI systems that can follow arbitrary language instructions in any 3D environment is a key challenge for creating general AI. Accomplishing this goal requires learning to ground language in perception and embodied actions,…

Robotics · Computer Science 2024-10-14 SIMA Team , Maria Abi Raad , Arun Ahuja , Catarina Barros , Frederic Besse , Andrew Bolt , Adrian Bolton , Bethanie Brownfield , Gavin Buttimore , Max Cant , Sarah Chakera , Stephanie C. Y. Chan , Jeff Clune , Adrian Collister , Vikki Copeman , Alex Cullum , Ishita Dasgupta , Dario de Cesare , Julia Di Trapani , Yani Donchev , Emma Dunleavy , Martin Engelcke , Ryan Faulkner , Frankie Garcia , Charles Gbadamosi , Zhitao Gong , Lucy Gonzales , Kshitij Gupta , Karol Gregor , Arne Olav Hallingstad , Tim Harley , Sam Haves , Felix Hill , Ed Hirst , Drew A. Hudson , Jony Hudson , Steph Hughes-Fitt , Danilo J. Rezende , Mimi Jasarevic , Laura Kampis , Rosemary Ke , Thomas Keck , Junkyung Kim , Oscar Knagg , Kavya Kopparapu , Rory Lawton , Andrew Lampinen , Shane Legg , Alexander Lerchner , Marjorie Limont , Yulan Liu , Maria Loks-Thompson , Joseph Marino , Kathryn Martin Cussons , Loic Matthey , Siobhan Mcloughlin , Piermaria Mendolicchio , Hamza Merzic , Anna Mitenkova , Alexandre Moufarek , Valeria Oliveira , Yanko Oliveira , Hannah Openshaw , Renke Pan , Aneesh Pappu , Alex Platonov , Ollie Purkiss , David Reichert , John Reid , Pierre Harvey Richemond , Tyson Roberts , Giles Ruscoe , Jaume Sanchez Elias , Tasha Sandars , Daniel P. Sawyer , Tim Scholtes , Guy Simmons , Daniel Slater , Hubert Soyer , Heiko Strathmann , Peter Stys , Allison C. Tam , Denis Teplyashin , Tayfun Terzi , Davide Vercelli , Bojan Vujatovic , Marcus Wainwright , Jane X. Wang , Zhengdong Wang , Daan Wierstra , Duncan Williams , Nathaniel Wong , Sarah York , Nick Young

Agent self-improvement, where the backbone Large Language Model (LLM) of the agent are trained on trajectories sampled autonomously based on their own policies, has emerged as a promising approach for enhancing performance. Recent…

Computation and Language · Computer Science 2025-08-22 Tianqing Fang , Hongming Zhang , Zhisong Zhang , Kaixin Ma , Wenhao Yu , Haitao Mi , Dong Yu

The research field of Embodied AI has witnessed substantial progress in visual navigation and exploration thanks to powerful simulating platforms and the availability of 3D data of indoor and photorealistic environments. These two factors…

Computer Vision and Pattern Recognition · Computer Science 2021-11-03 Roberto Bigazzi , Federico Landi , Marcella Cornia , Silvia Cascianelli , Lorenzo Baraldi , Rita Cucchiara

Embodied AI and robotic systems increasingly depend on scalable, diverse, and physically grounded 3D content for simulation-based training and real-world deployment. While 3D generative modeling has advanced rapidly, embodied applications…

Robotics · Computer Science 2026-05-11 Tianwei Ye , Yifan Mao , Minwen Liao , Jian Liu , Chunchao Guo , Dazhao Du , Quanxin Shou , Fangqi Zhu , Song Guo

Recent advances in large language model (LLM) have empowered autonomous agents to perform multi-turn interactions with tools and environments. However, scaling such agent training is limited by the lack of diverse and reliable environments.…

Artificial Intelligence · Computer Science 2026-05-26 Zhaoyang Wang , Canwen Xu , Boyi Liu , Yite Wang , Siwei Han , Zhewei Yao , Huaxiu Yao , Yuxiong He
‹ Prev 1 2 3 10 Next ›