English
Related papers

Related papers: WebRPG: Automatic Web Rendering Parameters Generat…

200 papers

Existing web-generation benchmarks rely on text prompts or static screenshots as input. However, videos naturally convey richer signals such as interaction flow, transition timing, and motion continuity, which are essential for faithful…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Yuhong Dai , Yanlin Lai , Mitt Huang , Hangyu Guo , Dingming Li , Hongbo Peng , Haodong Li , Yingxiu Zhao , Haoran Lyu , Zheng Ge , Xiangyu Zhang , Daxin Jiang

Graph generation generally aims to create new graphs that closely align with a specific graph distribution. Existing works often implicitly capture this distribution through the optimization of generators, potentially overlooking the…

Machine Learning · Computer Science 2024-07-19 Song Wang , Zhen Tan , Xinyu Zhao , Tianlong Chen , Huan Liu , Jundong Li

Despite recent advances in retrieval-augmented generation (RAG) for video understanding, effectively understanding long-form video content remains underexplored due to the vast scale and high complexity of video data. Current RAG approaches…

Computer Vision and Pattern Recognition · Computer Science 2025-06-10 Nianbo Zeng , Haowen Hou , Fei Richard Yu , Si Shi , Ying Tiffany He

Extracting real-time insights from multi-modal data streams from various domains such as healthcare, intelligent transportation, and satellite remote sensing remains a challenge. High computational demands and limited knowledge scope…

Computer Vision and Pattern Recognition · Computer Science 2025-01-27 Murugan Sankaradas , Ravi K. Rajendran , Srimat T. Chakradhar

This paper addresses emerging system-level challenges in heterogeneous retrieval-augmented generation (RAG) serving, where complex multi-stage workflows and diverse request patterns complicate efficient execution. We present HedraRAG, a…

Databases · Computer Science 2025-07-15 Zhengding Hu , Vibha Murthy , Zaifeng Pan , Wanlu Li , Xiaoyi Fang , Yufei Ding , Yuke Wang

Efficient rendering of photo-realistic virtual worlds is a long standing effort of computer graphics. Modern graphics techniques have succeeded in synthesizing photo-realistic images from hand-crafted scene representations. However, the…

The growing prevalence of visually rich documents, such as webpages and scanned/digital-born documents (images, PDFs, etc.), has led to increased interest in automatic document understanding and information extraction across academia and…

Computation and Language · Computer Science 2024-02-29 Hongshen Xu , Lu Chen , Zihan Zhao , Da Ma , Ruisheng Cao , Zichen Zhu , Kai Yu

Retrieval-Augmented Generation (RAG) is a powerful strategy for improving the factual accuracy of models by retrieving external knowledge relevant to queries and incorporating it into the generation process. However, existing approaches…

Computer Vision and Pattern Recognition · Computer Science 2025-05-30 Soyeong Jeong , Kangsan Kim , Jinheon Baek , Sung Ju Hwang

Recent advancements in the text-rendering capabilities of image generation models have made the end-to-end creation of graphic design content, such as posters, increasingly feasible. However, existing reward models fall short of accurately…

Web scraping is a powerful technique that extracts data from websites, enabling automated data collection, enhancing data analysis capabilities, and minimizing manual data entry efforts. Existing methods, wrappers-based methods suffer from…

Computation and Language · Computer Science 2024-09-27 Wenhao Huang , Zhouhong Gu , Chenghao Peng , Zhixu Li , Jiaqing Liang , Yanghua Xiao , Liqian Wen , Zulong Chen

Automatic generation of executable Blender code from natural language remains challenging, with state-of-the-art LLMs producing frequent syntactic errors and geometrically inconsistent objects. We present BlenderRAG, a retrieval-augmented…

Computer Vision and Pattern Recognition · Computer Science 2026-05-04 Massimo Rondelli , Francesco Pivi , Maurizio Gabbrielli

With more and more digital media, especially in the field of virtual reality where detailed and convincing scenes are much required, procedural scene generation is a big helping tool for artists. A problem is that defining scene…

Graphics · Computer Science 2012-02-15 Drazen Lucanin

While Large Language Models (LLMs) excel at function-level code generation, project-level tasks such as generating functional and visually aesthetic multi-page websites remain highly challenging. Existing works are often limited to…

Computation and Language · Computer Science 2026-04-23 Juyong Jiang , Chenglin Cai , Chansung Park , Jiasi Shen , Sunghun Kim , Jianguo Li , Yue Wang

Witnessed by the recent advancements on leveraging LLM for coding and multimodal understanding, we present WebGen-V, a new benchmark and framework for instruction-to-HTML generation that enhances both data quality and evaluation…

Artificial Intelligence · Computer Science 2025-10-20 Kuang-Da Wang , Zhao Wang , Yotaro Shimose , Wei-Yao Wang , Shingo Takamatsu

Recently, Retrieval-Augmented Generation (RAG) has achieved remarkable success in addressing the challenges of Large Language Models (LLMs) without necessitating retraining. By referencing an external knowledge base, RAG refines LLM…

Artificial Intelligence · Computer Science 2024-09-11 Boci Peng , Yun Zhu , Yongchao Liu , Xiaohe Bo , Haizhou Shi , Chuntao Hong , Yan Zhang , Siliang Tang

Retrieval-augmented generation (RAG) is a powerful technique that enhances downstream task execution by retrieving additional information, such as knowledge, skills, and tools from external sources. Graph, by its intrinsic "nodes connected…

Data generation is a fundamental research problem in data management due to its diverse use cases, ranging from testing database engines to data-specific applications. However, real-world entities often involve complex interactions that…

Databases · Computer Science 2024-12-13 Fan Li , Xiaoyang Wang , Dawei Cheng , Cong Chen , Ying Zhang , Xuemin Lin

Color is a critical design factor for web pages, affecting important factors such as viewer emotions and the overall trust and satisfaction of a website. Effective coloring requires design knowledge and expertise, but if this process could…

Computer Vision and Pattern Recognition · Computer Science 2023-01-24 Kotaro Kikuchi , Naoto Inoue , Mayu Otani , Edgar Simo-Serra , Kota Yamaguchi

Large pre-trained neural networks are ubiquitous and critical to the success of many downstream tasks in natural language processing and computer vision. However, within the field of web information retrieval, there is a stark contrast in…

Machine Learning · Computer Science 2022-10-28 Benedict Yeoh , Huijuan Wang

Understanding visual art requires reasoning across multiple perspectives -- cultural, historical, and stylistic -- beyond mere object recognition. While recent multimodal large language models (MLLMs) perform well on general image…

Artificial Intelligence · Computer Science 2025-09-08 Shuai Wang , Ivona Najdenkoska , Hongyi Zhu , Stevan Rudinac , Monika Kackovic , Nachoem Wijnberg , Marcel Worring
‹ Prev 1 2 3 10 Next ›