English
Related papers

Related papers: GENIUS: Generative Fluid Intelligence Evaluation S…

200 papers

Predictive atomistic simulations have propelled materials discovery, yet routine setup and debugging still demand computer specialists. This know-how gap limits Integrated Computational Materials Engineering (ICME), where state-of-the-art…

Generative retrieval is an emerging approach in information retrieval that generates identifiers (IDs) of target data based on a query, providing an efficient alternative to traditional embedding-based retrieval methods. However, existing…

Information Retrieval · Computer Science 2025-06-09 Sungyeon Kim , Xinliang Zhu , Xiaofan Lin , Muhammet Bastan , Douglas Gray , Suha Kwak

In this report we describe the implementation and approach developed during the GENIUS Project. The GENIUS project is about the generation of usable user interfaces. It tries to cope with issues related to automatic generation where,…

Human-Computer Interaction · Computer Science 2013-10-08 Jean-Sebastien Sottet , Alain Vagner

Unified multimodal models have recently demonstrated strong generative capabilities, yet whether and when generation improves understanding remains unclear. Existing benchmarks lack a systematic exploration of the specific tasks where…

Computer Vision and Pattern Recognition · Computer Science 2026-03-04 Zimo Wen , Boxiu Li , Wanbo Zhang , Junxiang Lei , Xiaoyu Chen , Yijia Fan , Qi Zhang , Yujiang Wang , Lili Qiu , Bo Li , Ziwei Liu , Caihua Shan , Yifan Yang , Yifei Shen

The long-standing goal of multimodal AI is to build unified models in which visual understanding and visual generation mutually enhance one another. Despite recent works such as BAGEL, BLIP3o achieves remarkable progress; In practice,…

Computer Vision and Pattern Recognition · Computer Science 2026-05-18 Yujun Tong , Dongliang Chang , Zijin Yin , Xintong Liu , Yuanchen Fang , Zhanyu Ma

We introduce GENIUS: a conditional text generation model using sketches as input, which can fill in the missing contexts for a given sketch (key information consisting of textual spans, phrases, or words, concatenated by mask tokens).…

Computation and Language · Computer Science 2022-11-21 Biyang Guo , Yeyun Gong , Yelong Shen , Songqiao Han , Hailiang Huang , Nan Duan , Weizhu Chen

Fluid intelligence (Gf) has been defined as the ability to reason and solve previously unseen problems. Links to Gf have been found in magnetic resonance imaging (MRI) sequences such as functional MRI and diffusion tensor imaging. As part…

We present JanusFlow, a powerful framework that unifies image understanding and generation in a single model. JanusFlow introduces a minimalist architecture that integrates autoregressive language models with rectified flow, a…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Yiyang Ma , Xingchao Liu , Xiaokang Chen , Wen Liu , Chengyue Wu , Zhiyu Wu , Zizheng Pan , Zhenda Xie , Haowei Zhang , Xingkai yu , Liang Zhao , Yisong Wang , Jiaying Liu , Chong Ruan

Unified multimodal models integrate the reasoning capacity of large language models with both image understanding and generation, showing great promise for advanced multimodal intelligence. However, the community still lacks a rigorous…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Hongxiang Li , Yaowei Li , Bin Lin , Yuwei Niu , Yuhang Yang , Xiaoshuang Huang , Jiayin Cai , Xiaolong Jiang , Yao Hu , Long Chen

Unified multimodal models aim to jointly enable visual understanding and generation, yet current benchmarks rarely examine their true integration. Existing evaluations either treat the two abilities in isolation or overlook tasks that…

Computer Vision and Pattern Recognition · Computer Science 2026-04-21 Kai Zou , Ziqi Huang , Yuhao Dong , Shulin Tian , Dian Zheng , Hongbo Liu , Jingwen He , Bin Liu , Yu Qiao , Ziwei Liu

Generative artificial intelligence (GenAI) models are increasingly used for scientific data generation, yet their alignment with empirical knowledge in urban science remains unclear. Here, we introduce AI4US (Artificial Intelligence for…

Physics and Society · Physics 2025-10-02 Yecheng Zhang , Rong Zhao , Zimu Huang , Xinyu Wang , Yue Ma , Ying Long

Recently, unified multimodal models (UMMs) have made remarkable progress in integrating visual understanding and generation, demonstrating strong potential for complex text-to-image (T2I) tasks. Despite their theoretical promise, a…

Computer Vision and Pattern Recognition · Computer Science 2026-03-09 Jiadong Pan , Liang Li , Yuxin Peng , Yu-Ming Tang , Shuohuan Wang , Yu Sun , Hua Wu , Qingming Huang , Haifeng Wang

We introduce Generative Universal Verifier, a novel concept and plugin designed for next-generation multimodal reasoning in vision-language models and unified multimodal models, providing the fundamental capability of reflection and…

Computer Vision and Pattern Recognition · Computer Science 2025-10-16 Xinchen Zhang , Xiaoying Zhang , Youbin Wu , Yanbin Cao , Renrui Zhang , Ruihang Chu , Ling Yang , Yujiu Yang

We present UniFluid, a unified autoregressive framework for joint visual generation and understanding leveraging continuous visual tokens. Our unified autoregressive architecture processes multimodal image and text inputs, generating…

Computer Vision and Pattern Recognition · Computer Science 2025-03-18 Lijie Fan , Luming Tang , Siyang Qin , Tianhong Li , Xuan Yang , Siyuan Qiao , Andreas Steiner , Chen Sun , Yuanzhen Li , Tao Zhu , Michael Rubinstein , Michalis Raptis , Deqing Sun , Radu Soricut

The advent of Unified Multimodal Models (UMMs) signals a paradigm shift in artificial intelligence, moving from passive perception to active, cross-modal generation. Despite their unprecedented ability to synthesize information, a critical…

Artificial Intelligence · Computer Science 2026-01-15 Jingxuan Wei , Caijun Jia , Xi Bai , Xinglong Xu , Siyuan Li , Linzhuang Sun , Bihui Yu , Conghui He , Lijun Wu , Cheng Tan

We present Thinking with Generated Images, a novel paradigm that fundamentally transforms how large multimodal models (LMMs) engage with visual reasoning by enabling them to natively think across text and vision modalities through…

Computer Vision and Pattern Recognition · Computer Science 2025-05-29 Ethan Chern , Zhulin Hu , Steffi Chern , Siqi Kou , Jiadi Su , Yan Ma , Zhijie Deng , Pengfei Liu

Generative Artificial Intelligence (GenAI) presents a governance challenge for STEM assessment. Unrestricted GenAI access enables task outsourcing that undermines the validity of traditional assessments; blanket prohibitions are difficult…

Computers and Society · Computer Science 2026-05-26 Yizhu Gao , Zhongzhou Chen , Min Li , Xiaoming Zhai

With the increasing emphasis on data privacy, the significance of machine unlearning has grown substantially. Class unlearning, which involves enabling a trained model to forget data belonging to a specific class learned before, is…

Machine Learning · Computer Science 2024-06-13 Chenhao Zhang , Shaofei Shen , Yawen Zhao , Weitong Tony Chen , Miao Xu

Generative AI has achieved remarkable empirical success, but from the perspective of statistics it often remains opaque: its predictions may be accurate, yet the underlying mechanism is difficult to interpret, analyze, and trust. This book…

Machine Learning · Statistics 2026-03-11 Shinto Eguchi

Urban design is a multifaceted process that demands careful consideration of site-specific constraints and collaboration among diverse professionals and stakeholders. The advent of generative artificial intelligence (GenAI) offers…

Artificial Intelligence · Computer Science 2025-06-02 Mingyi He , Yuebing Liang , Shenhao Wang , Yunhan Zheng , Qingyi Wang , Dingyi Zhuang , Li Tian , Jinhua Zhao
‹ Prev 1 2 3 10 Next ›