Related papers: WebRPG: Automatic Web Rendering Parameters Generat…

WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics

Existing web-generation benchmarks rely on text prompts or static screenshots as input. However, videos naturally convey richer signals such as interaction flow, transition timing, and motion continuity, which are essential for faithful…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Yuhong Dai , Yanlin Lai , Mitt Huang , Hangyu Guo , Dingming Li , Hongbo Peng , Haodong Li , Yingxiu Zhao , Haoran Lyu , Zheng Ge , Xiangyu Zhang , Daxin Jiang

GraphRCG: Self-Conditioned Graph Generation

Graph generation generally aims to create new graphs that closely align with a specific graph distribution. Existing works often implicitly capture this distribution through the optimization of generators, potentially overlooking the…

Machine Learning · Computer Science 2024-07-19 Song Wang , Zhen Tan , Xinyu Zhao , Tianlong Chen , Huan Liu , Jundong Li

SceneRAG: Scene-level Retrieval-Augmented Generation for Video Understanding

Despite recent advances in retrieval-augmented generation (RAG) for video understanding, effectively understanding long-form video content remains underexplored due to the vast scale and high complexity of video data. Current RAG approaches…

Computer Vision and Pattern Recognition · Computer Science 2025-06-10 Nianbo Zeng , Haowen Hou , Fei Richard Yu , Si Shi , Ying Tiffany He

StreamingRAG: Real-time Contextual Retrieval and Generation Framework

Extracting real-time insights from multi-modal data streams from various domains such as healthcare, intelligent transportation, and satellite remote sensing remains a challenge. High computational demands and limited knowledge scope…

Computer Vision and Pattern Recognition · Computer Science 2025-01-27 Murugan Sankaradas , Ravi K. Rajendran , Srimat T. Chakradhar

HedraRAG: Coordinating LLM Generation and Database Retrieval in Heterogeneous RAG Serving

This paper addresses emerging system-level challenges in heterogeneous retrieval-augmented generation (RAG) serving, where complex multi-stage workflows and diverse request patterns complicate efficient execution. We present HedraRAG, a…

Databases · Computer Science 2025-07-15 Zhengding Hu , Vibha Murthy , Zaifeng Pan , Wanlu Li , Xiaoyi Fang , Yufei Ding , Yuke Wang

State of the Art on Neural Rendering

Efficient rendering of photo-realistic virtual worlds is a long standing effort of computer graphics. Modern graphics techniques have succeeded in synthesizing photo-realistic images from hand-crafted scene representations. However, the…

Computer Vision and Pattern Recognition · Computer Science 2020-04-09 Ayush Tewari , Ohad Fried , Justus Thies , Vincent Sitzmann , Stephen Lombardi , Kalyan Sunkavalli , Ricardo Martin-Brualla , Tomas Simon , Jason Saragih , Matthias Nießner , Rohit Pandey , Sean Fanello , Gordon Wetzstein , Jun-Yan Zhu , Christian Theobalt , Maneesh Agrawala , Eli Shechtman , Dan B Goldman , Michael Zollhöfer

Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding

The growing prevalence of visually rich documents, such as webpages and scanned/digital-born documents (images, PDFs, etc.), has led to increased interest in automatic document understanding and information extraction across academia and…

Computation and Language · Computer Science 2024-02-29 Hongshen Xu , Lu Chen , Zihan Zhao , Da Ma , Ruisheng Cao , Zichen Zhu , Kai Yu

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Retrieval-Augmented Generation (RAG) is a powerful strategy for improving the factual accuracy of models by retrieving external knowledge relevant to queries and incorporating it into the generation process. However, existing approaches…

Computer Vision and Pattern Recognition · Computer Science 2025-05-30 Soyeong Jeong , Kangsan Kim , Jinheon Baek , Sung Ju Hwang

PosterReward: Unlocking Accurate Evaluation for High-Quality Graphic Design Generation

Recent advancements in the text-rendering capabilities of image generation models have made the end-to-end creation of graphic design content, such as posters, increasingly feasible. However, existing reward models fall short of accurately…

Graphics · Computer Science 2026-04-01 Jianyu Lai , Sixiang Chen , Jialin Gao , Hengyu Shi , Zhongying Liu , Fuxiang Zhai , Junfeng Luo , Xiaoming Wei , Lujia Wang , Lei Zhu

AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation

Web scraping is a powerful technique that extracts data from websites, enabling automated data collection, enhancing data analysis capabilities, and minimizing manual data entry efforts. Existing methods, wrappers-based methods suffer from…

Computation and Language · Computer Science 2024-09-27 Wenhao Huang , Zhouhong Gu , Chenghao Peng , Zhixu Li , Jiaqing Liang , Yanghua Xiao , Liqian Wen , Zulong Chen

BlenderRAG: High-Fidelity 3D Object Generation via Retrieval-Augmented Code Synthesis

Automatic generation of executable Blender code from natural language remains challenging, with state-of-the-art LLMs producing frequent syntactic errors and geometrically inconsistent objects. We present BlenderRAG, a retrieval-augmented…

Computer Vision and Pattern Recognition · Computer Science 2026-05-04 Massimo Rondelli , Francesco Pivi , Maurizio Gabbrielli

Visual definition of procedures for automatic virtual scene generation

With more and more digital media, especially in the field of virtual reality where detailed and convincing scenes are much required, procedural scene generation is a big helping tool for artists. A problem is that defining scene…

Graphics · Computer Science 2012-02-15 Drazen Lucanin

WebGen-R1: Incentivizing Large Language Models to Generate Functional and Aesthetic Websites with Reinforcement Learning

While Large Language Models (LLMs) excel at function-level code generation, project-level tasks such as generating functional and visually aesthetic multi-page websites remain highly challenging. Existing works are often limited to…

Computation and Language · Computer Science 2026-04-23 Juyong Jiang , Chenglin Cai , Chansung Park , Jiasi Shen , Sunghun Kim , Jianguo Li , Yue Wang

WebGen-V Bench: Structured Representation for Enhancing Visual Design in LLM-based Web Generation and Evaluation

Witnessed by the recent advancements on leveraging LLM for coding and multimodal understanding, we present WebGen-V, a new benchmark and framework for instruction-to-HTML generation that enhances both data quality and evaluation…

Artificial Intelligence · Computer Science 2025-10-20 Kuang-Da Wang , Zhao Wang , Yotaro Shimose , Wei-Yao Wang , Shingo Takamatsu

Graph Retrieval-Augmented Generation: A Survey

Recently, Retrieval-Augmented Generation (RAG) has achieved remarkable success in addressing the challenges of Large Language Models (LLMs) without necessitating retraining. By referencing an external knowledge base, RAG refines LLM…

Artificial Intelligence · Computer Science 2024-09-11 Boci Peng , Yun Zhu , Yongchao Liu , Xiaohe Bo , Haizhou Shi , Chuntao Hong , Yan Zhang , Siliang Tang

Retrieval-Augmented Generation with Graphs (GraphRAG)

Retrieval-augmented generation (RAG) is a powerful technique that enhances downstream task execution by retrieving additional information, such as knowledge, skills, and tools from external sources. Graph, by its intrinsic "nodes connected…

Information Retrieval · Computer Science 2025-01-09 Haoyu Han , Yu Wang , Harry Shomer , Kai Guo , Jiayuan Ding , Yongjia Lei , Mahantesh Halappanavar , Ryan A. Rossi , Subhabrata Mukherjee , Xianfeng Tang , Qi He , Zhigang Hua , Bo Long , Tong Zhao , Neil Shah , Amin Javari , Yinglong Xia , Jiliang Tang

Efficient Dynamic Attributed Graph Generation

Data generation is a fundamental research problem in data management due to its diverse use cases, ranging from testing database engines to data-specific applications. However, real-world entities often involve complex interactions that…

Databases · Computer Science 2024-12-13 Fan Li , Xiaoyang Wang , Dawei Cheng , Cong Chen , Ying Zhang , Xuemin Lin

Generative Colorization of Structured Mobile Web Pages

Color is a critical design factor for web pages, affecting important factors such as viewer emotions and the overall trust and satisfaction of a website. Effective coloring requires design knowledge and expertise, but if this process could…

Computer Vision and Pattern Recognition · Computer Science 2023-01-24 Kotaro Kikuchi , Naoto Inoue , Mayu Otani , Edgar Simo-Serra , Kota Yamaguchi

GROWN+UP: A Graph Representation Of a Webpage Network Utilizing Pre-training

Large pre-trained neural networks are ubiquitous and critical to the success of many downstream tasks in natural language processing and computer vision. However, within the field of web information retrieval, there is a stark contrast in…

Machine Learning · Computer Science 2022-10-28 Benedict Yeoh , Huijuan Wang

ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding

Understanding visual art requires reasoning across multiple perspectives -- cultural, historical, and stylistic -- beyond mere object recognition. While recent multimodal large language models (MLLMs) perform well on general image…

Artificial Intelligence · Computer Science 2025-09-08 Shuai Wang , Ivona Najdenkoska , Hongyi Zhu , Stevan Rudinac , Monika Kackovic , Nachoem Wijnberg , Marcel Worring