Related papers: GENIUS: Generative Fluid Intelligence Evaluation S…

GENIUS: An Agentic AI Framework for Autonomous Design and Execution of Simulation Protocols

Predictive atomistic simulations have propelled materials discovery, yet routine setup and debugging still demand computer specialists. This know-how gap limits Integrated Computational Materials Engineering (ICME), where state-of-the-art…

Artificial Intelligence · Computer Science 2026-05-25 Mohammad Soleymanibrojeni , Roland Aydin , Diego Guedes-Sobrinho , Alexandre C. Dias , Maurício J. Piotrowski , Wolfgang Wenzel , Celso Ricardo Caldeira Rêgo

GENIUS: A Generative Framework for Universal Multimodal Search

Generative retrieval is an emerging approach in information retrieval that generates identifiers (IDs) of target data based on a query, providing an efficient alternative to traditional embedding-based retrieval methods. However, existing…

Information Retrieval · Computer Science 2025-06-09 Sungyeon Kim , Xinliang Zhu , Xiaofan Lin , Muhammet Bastan , Douglas Gray , Suha Kwak

GENIUS: Generating Usable User Interfaces

In this report we describe the implementation and approach developed during the GENIUS Project. The GENIUS project is about the generation of usable user interfaces. It tries to cope with issues related to automatic generation where,…

Human-Computer Interaction · Computer Science 2013-10-08 Jean-Sebastien Sottet , Alain Vagner

UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

Unified multimodal models have recently demonstrated strong generative capabilities, yet whether and when generation improves understanding remains unclear. Existing benchmarks lack a systematic exploration of the specific tasks where…

Computer Vision and Pattern Recognition · Computer Science 2026-03-04 Zimo Wen , Boxiu Li , Wanbo Zhang , Junxiang Lei , Xiaoyu Chen , Yijia Fan , Qi Zhang , Yujiang Wang , Lili Qiu , Bo Li , Ziwei Liu , Caihua Shan , Yifan Yang , Yifei Shen

Reversing the Flow: Generation-to-Understanding Synergy in Large Multimodal Models

The long-standing goal of multimodal AI is to build unified models in which visual understanding and visual generation mutually enhance one another. Despite recent works such as BAGEL, BLIP3o achieves remarkable progress; In practice,…

Computer Vision and Pattern Recognition · Computer Science 2026-05-18 Yujun Tong , Dongliang Chang , Zijin Yin , Xintong Liu , Yuanchen Fang , Zhanyu Ma

GENIUS: Sketch-based Language Model Pre-training via Extreme and Selective Masking for Text Generation and Augmentation

We introduce GENIUS: a conditional text generation model using sketches as input, which can fill in the missing contexts for a given sketch (key information consisting of textual spans, phrases, or words, concatenated by mask tokens).…

Computation and Language · Computer Science 2022-11-21 Biyang Guo , Yeyun Gong , Yelong Shen , Songqiao Han , Hailiang Huang , Nan Duan , Weizhu Chen

Sex differences in predicting fluid intelligence of adolescent brain from T1-weighted MRIs

Fluid intelligence (Gf) has been defined as the ability to reason and solve previously unseen problems. Links to Gf have been found in magnetic resonance imaging (MRI) sequences such as functional MRI and diffusion tensor imaging. As part…

Neurons and Cognition · Quantitative Biology 2019-08-08 Sara Ranjbar , Kyle W. Singleton , Lee Curtin , Susan Christine Massey , Andrea Hawkins-Daarud , Pamela R. Jackson , Kristin R. Swanson

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

We present JanusFlow, a powerful framework that unifies image understanding and generation in a single model. JanusFlow introduces a minimalist architecture that integrates autoregressive language models with rectified flow, a…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Yiyang Ma , Xingchao Liu , Xiaokang Chen , Wen Liu , Chengyue Wu , Zhiyu Wu , Zizheng Pan , Zhenda Xie , Haowei Zhang , Xingkai yu , Liang Zhao , Yisong Wang , Jiaying Liu , Chong Ruan

GIR-Bench: Versatile Benchmark for Generating Images with Reasoning

Unified multimodal models integrate the reasoning capacity of large language models with both image understanding and generation, showing great promise for advanced multimodal intelligence. However, the community still lacks a rigorous…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Hongxiang Li , Yaowei Li , Bin Lin , Yuwei Niu , Yuhang Yang , Xiaoshuang Huang , Jiayin Cai , Xiaolong Jiang , Yao Hu , Long Chen

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

Unified multimodal models aim to jointly enable visual understanding and generation, yet current benchmarks rarely examine their true integration. Existing evaluations either treat the two abilities in isolation or overlook tasks that…

Computer Vision and Pattern Recognition · Computer Science 2026-04-21 Kai Zou , Ziqi Huang , Yuhao Dong , Shulin Tian , Dian Zheng , Hongbo Liu , Jingwen He , Bin Liu , Yu Qiao , Ziwei Liu

GenAI Models Capture Urban Science but Oversimplify Complexity

Generative artificial intelligence (GenAI) models are increasingly used for scientific data generation, yet their alignment with empirical knowledge in urban science remains unclear. Here, we introduce AI4US (Artificial Intelligence for…

Physics and Society · Physics 2025-10-02 Yecheng Zhang , Rong Zhao , Zimu Huang , Xinyu Wang , Yue Ma , Ying Long

Learning to Generate via Understanding: Understanding-Driven Intrinsic Rewarding for Unified Multimodal Models

Recently, unified multimodal models (UMMs) have made remarkable progress in integrating visual understanding and generation, demonstrating strong potential for complex text-to-image (T2I) tasks. Despite their theoretical promise, a…

Computer Vision and Pattern Recognition · Computer Science 2026-03-09 Jiadong Pan , Liang Li , Yuxin Peng , Yu-Ming Tang , Shuohuan Wang , Yu Sun , Hua Wu , Qingming Huang , Haifeng Wang

Generative Universal Verifier as Multimodal Meta-Reasoner

We introduce Generative Universal Verifier, a novel concept and plugin designed for next-generation multimodal reasoning in vision-language models and unified multimodal models, providing the fundamental capability of reflection and…

Computer Vision and Pattern Recognition · Computer Science 2025-10-16 Xinchen Zhang , Xiaoying Zhang , Youbin Wu , Yanbin Cao , Renrui Zhang , Ruihang Chu , Ling Yang , Yujiu Yang

Unified Autoregressive Visual Generation and Understanding with Continuous Tokens

We present UniFluid, a unified autoregressive framework for joint visual generation and understanding leveraging continuous visual tokens. Our unified autoregressive architecture processes multimodal image and text inputs, generating…

Computer Vision and Pattern Recognition · Computer Science 2025-03-18 Lijie Fan , Luming Tang , Siyang Qin , Tianhong Li , Xuan Yang , Siyuan Qiao , Andreas Steiner , Chen Sun , Yuanzhen Li , Tao Zhu , Michael Rubinstein , Michalis Raptis , Deqing Sun , Radu Soricut

GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models

The advent of Unified Multimodal Models (UMMs) signals a paradigm shift in artificial intelligence, moving from passive perception to active, cross-modal generation. Despite their unprecedented ability to synthesize information, a critical…

Artificial Intelligence · Computer Science 2026-01-15 Jingxuan Wei , Caijun Jia , Xi Bai , Xinglong Xu , Siyuan Li , Linzhuang Sun , Bihui Yu , Conghui He , Lijun Wu , Cheng Tan

Thinking with Generated Images

We present Thinking with Generated Images, a novel paradigm that fundamentally transforms how large multimodal models (LMMs) engage with visual reasoning by enabling them to natively think across text and vision modalities through…

Computer Vision and Pattern Recognition · Computer Science 2025-05-29 Ethan Chern , Zhulin Hu , Steffi Chern , Siqi Kou , Jiadi Su , Yan Ma , Zhijie Deng , Pengfei Liu

Generative AI as a Design Variable: An Evidence-Centered Framework for Principled Governance in STEM Assessment

Generative Artificial Intelligence (GenAI) presents a governance challenge for STEM assessment. Unrestricted GenAI access enables task outsourcing that undermines the validity of traditional assessments; blanket prohibitions are difficult…

Computers and Society · Computer Science 2026-05-26 Yizhu Gao , Zhongzhou Chen , Min Li , Xiaoming Zhai

GENIU: A Restricted Data Access Unlearning for Imbalanced Data

With the increasing emphasis on data privacy, the significance of machine unlearning has grown substantially. Class unlearning, which involves enabling a trained model to forget data belonging to a specific class learned before, is…

Machine Learning · Computer Science 2024-06-13 Chenhao Zhang , Shaofei Shen , Yawen Zhao , Weitong Tony Chen , Miao Xu

Statistical Inference via Generative Models: Flow Matching and Causal Inference

Generative AI has achieved remarkable empirical success, but from the perspective of statistics it often remains opaque: its predictions may be accurate, yet the underlying mechanism is difficult to interpret, analyze, and trust. This book…

Machine Learning · Statistics 2026-03-11 Shinto Eguchi

Generative AI for Urban Design: A Stepwise Approach Integrating Human Expertise with Multimodal Diffusion Models

Urban design is a multifaceted process that demands careful consideration of site-specific constraints and collaboration among diverse professionals and stakeholders. The advent of generative artificial intelligence (GenAI) offers…

Artificial Intelligence · Computer Science 2025-06-02 Mingyi He , Yuebing Liang , Shenhao Wang , Yunhan Zheng , Qingyi Wang , Dingyi Zhuang , Li Tian , Jinhua Zhao