English
Related papers

Related papers: Pitfalls and Guidelines for Using Time-Based Git D…

200 papers

Many software engineering research papers rely on time-based data (e.g., commit timestamps, issue report creation/update/close dates, release dates). Like most real-world data however, time-based data is often dirty. To date, there are no…

Software Engineering · Computer Science 2021-03-23 Samuel W. Flint , Jigyasa Chauhan , Robert Dyer

Data from software repositories have become an important foundation for the empirical study of software engineering processes. A recurring theme in the repository mining literature is the inference of developer networks capturing e.g.…

Software Engineering · Computer Science 2019-11-22 Christoph Gote , Ingo Scholtes , Frank Schweitzer

In open-source software development environments; textual, numerical and relationship-based data generated are of interest to researchers. Various data sets are available for this data, which is frequently used in areas such as software…

Software Engineering · Computer Science 2020-10-01 Abdulkadir Şeker , Banu Diri , Halil Arslan

With the advent of open source software, a veritable treasure trove of previously proprietary software development data was made available. This opened the field of empirical software engineering research to anyone in academia. Data that is…

Software Engineering · Computer Science 2022-04-19 Adam Tutko , Austin Z. Henley , Audris Mockus

GitHub's issue reports provide developers with valuable information that is essential to the evolution of a software development project. Contributors can use these reports to perform software engineering tasks like submitting bugs,…

Software Engineering · Computer Science 2023-03-22 Nafiseh Nikeghbal , Amir Hossein Kargaran , Abbas Heydarnoori , Hinrich Schütze

Data from software repositories have become an important foundation for the empirical study of software engineering processes. A recurring theme in the repository mining literature is the inference of developer networks capturing e.g.…

Software Engineering · Computer Science 2019-11-22 Christoph Gote , Ingo Scholtes , Frank Schweitzer

Large project overruns and overtime work have been reported in the software industry, resulting in additional expense for companies and personal issues for developers. The present work aims to provide an overview of studies related to time…

Software Engineering · Computer Science 2020-04-23 Miikka Kuutila , Mika Mäntylä , Umar Farooq , Maëlick Claes

Git is used as the distributed version control system for many open-source software projects. One Git-based service, GitHub, is the most common code hosting and repository service for open-source software projects. For researchers that…

Software Engineering · Computer Science 2021-01-22 Abdulkadir Şeker , Banu Diri , Halil Arslan , Mehmet Fatih Amasyalı

Context: The establishment of the Mining Software Repositories (MSR) data showcase conference track has encouraged researchers to provide data sets as a basis for further empirical studies. Objective: Examine the usage of data papers…

Software Engineering · Computer Science 2021-11-30 Zoe Kotti , Konstantinos Kravvaritis , Konstantina Dritsa , Diomidis Spinellis

Background: Data mining and analyzing of public Git software repositories is a growing research field. The tools used for studies that investigate a single project or a group of projects have been refined, but it is not clear whether the…

Software Engineering · Computer Science 2020-08-18 Adam Tutko , Austin Henley , Audris Mockus

Researchers often delve into the connections between different factors derived from the historical data of software projects. For example, scholars have devoted their endeavors to the exploration of associations among these factors.…

Software Engineering · Computer Science 2023-11-14 Mikel Robredo , Nyyti Saarimaki , Rafael Penaloza , Valentina Lenarduzzi

Under the data-driven research paradigm, research software has come to play crucial roles in nearly every stage of scientific inquiry. Scholars are advocating for the formal citation of software in academic publications, treating it on par…

Digital Libraries · Computer Science 2023-07-19 Yuzhuo Wang , Kai Li

Without sufficient information about research data practices occurring in a particular research organisation, there is a risk of mismatching research data service efforts with the needs of its researchers. This study describes how data…

Digital Libraries · Computer Science 2022-05-12 Antti Mikael Rousi

Energy efficiency has become a growing concern in software development, leading to the need for tools designed to measure energy consumption. While several energy measurement tools are available as open-source projects, their…

Software Engineering · Computer Science 2026-03-24 Manuela Bechara Cannizza , Michel Albonico

Modern programming languages like Java require runtime systems to support the implementation and deployment of software applications in diverse computing platforms and operating systems. These runtime systems are normally developed in…

Software Engineering · Computer Science 2023-10-25 Salma Begum Tamanna , Gias Uddin , Lan Xia , Longyu Zhang

Millions of developers share their code on open-source platforms like GitHub, which offer social coding opportunities such as distributed collaboration and popularity-based ranking. Software engineering researchers have joined in as well,…

Software Engineering · Computer Science 2024-04-08 Kamel Alrashedy , Ahmed Binjahlan

Almost every Mining Software Repositories (MSR) study requires, as first step, the selection of the subject software repositories. These repositories are usually collected from hosting services like GitHub using specific selection criteria…

Software Engineering · Computer Science 2021-03-09 Ozren Dabic , Emad Aghajani , Gabriele Bavota

Recent claims about the impressive abilities of large language models (LLMs) are often supported by evaluating publicly available benchmarks. Since LLMs train on wide swaths of the internet, this practice raises concerns of data…

Computation and Language · Computer Science 2023-10-17 Manley Roberts , Himanshu Thakur , Christine Herlihy , Colin White , Samuel Dooley

Logs are widely used to record runtime information of software systems, such as the timestamp and the importance of an event, the unique ID of the source of the log, and a part of the state of a task's execution. The rich information of…

Software Engineering · Computer Science 2022-01-04 Sina Gholamian , Paul A. S. Ward

Data is a cornerstone of empirical software engineering (ESE) research and practice. Data underpin numerous process and project management activities, including the estimation of development effort and the prediction of the likely location…

Software Engineering · Computer Science 2020-12-22 Michael F. Bosu , Stephen G. MacDonell
‹ Prev 1 2 3 10 Next ›