Related papers: Geo-FuB: A Method for Constructing an Operator-Fun…

GEE-OPs: An Operator Knowledge Base for Geospatial Code Generation on the Google Earth Engine Platform Powered by Large Language Models

As the scale and complexity of spatiotemporal data continue to grow rapidly, the use of geospatial modeling on the Google Earth Engine (GEE) platform presents dual challenges: improving the coding efficiency of domain experts and enhancing…

Software Engineering · Computer Science 2024-12-12 Shuyang Hou , Jianyuan Liang , Anqi Zhao , Huayi Wu

Spatial-RAG: Spatial Retrieval Augmented Generation for Real-World Geospatial Reasoning Questions

Answering real-world geospatial questions--such as finding restaurants along a travel route or amenities near a landmark--requires reasoning over both geographic relationships and semantic user intent. However, existing large language…

Information Retrieval · Computer Science 2025-06-12 Dazhou Yu , Riyang Bao , Ruiyu Ning , Jinghong Peng , Gengchen Mai , Liang Zhao

GeoCode-GPT: A Large Language Model for Geospatial Code Generation Tasks

The increasing demand for spatiotemporal data and modeling tasks in geosciences has made geospatial code generation technology a critical factor in enhancing productivity. Although large language models (LLMs) have demonstrated potential in…

Software Engineering · Computer Science 2025-03-11 Shuyang Hou , Zhangxiao Shen , Anqi Zhao , Jianyuan Liang , Zhipeng Gui , Xuefeng Guan , Rui Li , Huayi Wu

Can Large Language Models Generate Geospatial Code?

With the growing demand for spatiotemporal data processing and geospatial modeling, automating geospatial code generation has become essential for productivity. Large language models (LLMs) show promise in code generation but face…

Software Engineering · Computer Science 2024-10-21 Shuyang Hou , Zhangxiao Shen , Jianyuan Liang , Anqi Zhao , Zhipeng Gui , Rui Li , Huayi Wu

AutoGEEval: A Multimodal and Automated Framework for Geospatial Code Generation on GEE with Large Language Models

Geospatial code generation is emerging as a key direction in the integration of artificial intelligence and geoscientific analysis. However, there remains a lack of standardized tools for automatic evaluation in this domain. To address this…

Software Engineering · Computer Science 2025-05-20 Shuyang Hou , Zhangxiao Shen , Huayi Wu , Jianyuan Liang , Haoyue Jiao , Yaxian Qing , Xiaopu Zhang , Xu Li , Zhipeng Gui , Xuefeng Guan , Longgang Xiang

AutoGEEval++: A Multi-Level and Multi-Geospatial-Modality Automated Evaluation Framework for Large Language Models in Geospatial Code Generation on Google Earth Engine

Geospatial code generation is becoming a key frontier in integrating artificial intelligence with geo-scientific analysis, yet standardised automated evaluation tools for this task remain absent. This study presents AutoGEEval++, an…

Software Engineering · Computer Science 2025-06-13 Shuyang Hou , Zhangxiao Shen , Huayi Wu , Haoyue Jiao , Ziqi Liu , Lutong Xie , Chang Liu , Jianyuan Liang , Yaxian Qing , Xiaopu Zhang , Dehua Peng , Zhipeng Gui , Xuefeng Guan

GFM-RAG: Graph Foundation Model for Retrieval Augmented Generation

Retrieval-augmented generation (RAG) has proven effective in integrating knowledge into large language models (LLMs). However, conventional RAGs struggle to capture complex relationships between pieces of knowledge, limiting their…

Information Retrieval · Computer Science 2025-12-12 Linhao Luo , Zicheng Zhao , Gholamreza Haffari , Dinh Phung , Chen Gong , Shirui Pan

An LLM Agent for Automatic Geospatial Data Analysis

Large language models (LLMs) are being used in data science code generation tasks, but they often struggle with complex sequential tasks, leading to logical errors. Their application to geospatial data processing is particularly challenging…

Computers and Society · Computer Science 2024-10-28 Yuxing Chen , Weijie Wang , Sylvain Lobry , Camille Kurtz

GeoJSEval: An Automated Evaluation Framework for Large Language Models on JavaScript-Based Geospatial Computation and Visualization Code Generation

With the widespread adoption of large language models (LLMs) in code generation tasks, geospatial code generation has emerged as a critical frontier in the integration of artificial intelligence and geoscientific analysis. This trend…

Software Engineering · Computer Science 2025-10-09 Guanyu Chen , Haoyue Jiao , Shuyang Hou , Ziqi Liu , Lutong Xie , Shaowen Wu , Huayi Wu , Xuefeng Guan , Zhipeng Gui

Investigating the performance of Retrieval-Augmented Generation and fine-tuning for the development of AI-driven knowledge-based systems

The development of generative large language models (G-LLM) opened up new opportunities for the development of new types of knowledge-based systems similar to ChatGPT, Bing, or Gemini. Fine-tuning (FN) and Retrieval-Augmented Generation…

Computation and Language · Computer Science 2025-05-13 Robert Lakatos , Peter Pollner , Andras Hajdu , Tamas Joo

Evaluation of Code LLMs on Geospatial Code Generation

Software development support tools have been studied for a long time, with recent approaches using Large Language Models (LLMs) for code generation. These models can generate Python code for data science and machine learning applications.…

Computation and Language · Computer Science 2024-12-16 Piotr Gramacki , Bruno Martins , Piotr Szymański

RAG for Geoscience: What We Expect, Gaps and Opportunities

Retrieval-Augmented Generation (RAG) enhances language models by combining retrieval with generation. However, its current workflow remains largely text-centric, limiting its applicability in geoscience. Many geoscientific tasks are…

Emerging Technologies · Computer Science 2025-08-18 Runlong Yu , Shiyuan Luo , Rahul Ghosh , Lingyao Li , Yiqun Xie , Xiaowei Jia

GeoLLM: Extracting Geospatial Knowledge from Large Language Models

The application of machine learning (ML) in a range of geospatial tasks is increasingly common but often relies on globally available covariates such as satellite imagery that can either be expensive or lack predictive power. Here we…

Computation and Language · Computer Science 2024-02-27 Rohin Manvi , Samar Khanna , Gengchen Mai , Marshall Burke , David Lobell , Stefano Ermon

Retrieval-Augmented Code Generation: A Survey with Focus on Repository-Level Approaches

Recent advances in large language models (LLMs) have significantly improved automated code generation. While existing approaches have achieved strong performance at the function and file levels, real-world software engineering requires…

Software Engineering · Computer Science 2026-05-21 Yicheng Tao , Yuante Li , Yao Qin , Yepang Liu

GeoGPT-RAG Technical Report

GeoGPT is an open large language model system built to advance research in the geosciences. To enhance its domain-specific capabilities, we integrated Retrieval Augmented Generation(RAG), which augments model outputs with relevant…

Information Retrieval · Computer Science 2025-09-16 Fei Huang , Fan Wu , Zeqing Zhang , Qihao Wang , Long Zhang , Grant Michael Boquet , Hongyang Chen

GeoJSON Agents:A Multi-Agent LLM Architecture for Geospatial Analysis-Function Calling vs Code Generation

Large Language Models (LLMs) have demonstrated substantial progress in task automation and natural language understanding. However, without domain expertise in geographic information science (GIS), they continue to encounter limitations…

Software Engineering · Computer Science 2025-12-04 Qianqian Luo , Qingming Lin , Liuchang Xu , Sensen Wu , Ruichen Mao , Chao Wang , Hailin Feng , Bo Huang , Zhenhong Du

Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely

Large language models (LLMs) augmented with external data have demonstrated remarkable capabilities in completing real-world tasks. Techniques for integrating external data into LLMs, such as Retrieval-Augmented Generation (RAG) and…

Computation and Language · Computer Science 2024-09-24 Siyun Zhao , Yuqing Yang , Zilong Wang , Zhiyuan He , Luna K. Qiu , Lili Qiu

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Large pre-trained language models have been shown to store factual knowledge in their parameters, and achieve state-of-the-art results when fine-tuned on downstream NLP tasks. However, their ability to access and precisely manipulate…

Computation and Language · Computer Science 2021-04-13 Patrick Lewis , Ethan Perez , Aleksandra Piktus , Fabio Petroni , Vladimir Karpukhin , Naman Goyal , Heinrich Küttler , Mike Lewis , Wen-tau Yih , Tim Rocktäschel , Sebastian Riedel , Douwe Kiela

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Implementing Retrieval-Augmented Generation (RAG) systems is inherently complex, requiring deep understanding of data, use cases, and intricate design decisions. Additionally, evaluating these systems presents significant challenges,…

Computation and Language · Computer Science 2024-08-06 Daniel Fleischer , Moshe Berchansky , Moshe Wasserblat , Peter Izsak

Retrieval Feedback Memory Enhancement Large Model Retrieval Generation Method

Large Language Models (LLMs) have shown remarkable capabilities across diverse tasks, yet they face inherent limitations such as constrained parametric knowledge and high retraining costs. Retrieval-Augmented Generation (RAG) augments the…

Information Retrieval · Computer Science 2025-08-26 Leqian Li , Dianxi Shi , Jialu Zhou , Xinyu Wei , Mingyue Yang , Songchang Jin , Shaowu Yang