English
Related papers

Related papers: EasyNData: A simple tool to extract numerical valu…

200 papers

Automated data extraction from research texts has been steadily improving, with the emergence of large language models (LLMs) accelerating progress even further. Extracting data from plots in research papers, however, has been such a…

Computer Vision and Pattern Recognition · Computer Science 2025-03-18 Maciej P. Polak , Dane Morgan

The table analysis application TOPCAT uses a custom Java plotting library for highly configurable high-performance interactive or exported visualisations in two and three dimensions. We present here a variety of ways for end users or…

Instrumentation and Methods for Astrophysics · Physics 2014-10-31 M. B. Taylor

Charts are an excellent way to convey patterns and trends in data, but they do not facilitate further modeling of the data or close inspection of individual data points. We present a fully automated system for extracting the numerical…

Computer Vision and Pattern Recognition · Computer Science 2018-10-10 Mathieu Cliche , David Rosenberg , Dhruv Madeka , Connie Yee

Program understanding is an important aspect in Software Maintenance and Reengineering. Understanding the program is related to execution behaviour and relationship of variable involved in the program. The task of finding all statements in…

Software Engineering · Computer Science 2011-08-08 N. Sasirekha , A. Edwin Robert , Dr. M. Hemalatha

There are plenty of excellent plotting libraries. Each excels at a different use case: one is good for printed 2D publication figures, the other at interactive 3D graphics, a third has excellent L A TEX integration or is good for creating…

It is common for authors to communicate their results in graphical figures, but those data are frequently unavailable for reanalysis. Reconstructing data points from a figure manually requires the author to measure the coordinates either on…

Digital Libraries · Computer Science 2018-03-13 Chris Hartgerink , Peter Murray-Rust

Nonparametric statistical tests are useful procedures that can be applied in a wide range of situations, such as testing randomness or goodness of fit, one-sample, two-sample and multiple-sample analysis, association between bivariate…

Computation · Statistics 2015-01-20 J. Derrac , S. García , F. Herrera

Presented here are algorithms for converting between (decimal) scientific-notation and (binary) IEEE-754 double-precision floating-point numbers. By employing a rounding integer quotient operation these algorithms are much simpler than…

Numerical Analysis · Computer Science 2018-08-08 Aubrey Jaffer

We present an application, Superplot, for calculating and plotting statistical quantities relevant to parameter inference from a "chain" of samples drawn from a parameter space, produced by e.g. MultiNest. A simple graphical interface…

Data Analysis, Statistics and Probability · Physics 2016-12-06 Andrew Fowlie , Michael Hugh Bardsley

This paper describes a new modelling language for the effective design of Java annotations. Since their inclusion in the 5th edition of Java, annotations have grown from a useful tool for the addition of meta-data to play a central role in…

Programming Languages · Computer Science 2019-10-02 Irene Córdoba , Juan de Lara

This paper presents an open tool for standardizing the evaluation process of the layout analysis task of document images at pixel level. We introduce a new evaluation tool that is both available as a standalone Java application and as a…

Computer Vision and Pattern Recognition · Computer Science 2018-04-25 Michele Alberti , Manuel Bouillon , Rolf Ingold , Marcus Liwicki

Java implementations of algorithms used by spreadsheets to automatically recompute the set of cells dependent on a changed cell are described using a mathematical model for spreadsheets based on graph theory. These solutions comprise part…

Data Structures and Algorithms · Computer Science 2007-05-23 Joe Francoeur

tabulapdf is an R package that utilizes the Tabula Java library to import tables from PDF files directly into R. This tool can reduce time and effort in data extraction processes in fields like investigative journalism. It allows for…

Information Retrieval · Computer Science 2024-09-24 Mauricio Vargas Sepúlveda , Thomas J. Leeper , Tom Paskhalis , Manuel Aristarán , Jeremy B. Merrill , Mike Tigas

Dependencies between types in object-oriented software can be viewed as directed graphs, with types as nodes and dependencies as edges. The in-degree and out-degree distributions of such graphs have quite different forms, with the former…

Software Engineering · Computer Science 2008-02-19 G. J. Baxter , M. R. Frean

Proponents of software verification have argued that simpler code is easier to verify: that is, that verification tools issue fewer false positives and require less human intervention when analyzing simpler code. We empirically validate…

Software Engineering · Computer Science 2023-11-01 Kobi Feldman , Martin Kellogg , Oscar Chaparro

Most search engines index the textual content of documents in digital libraries. However, scholarly articles frequently report important findings in figures for visual impact and the contents of these figures are not indexed. These contents…

Computer Vision and Pattern Recognition · Computer Science 2008-09-11 William Brouwer , Saurabh Kataria , Sujatha Das , Prasenjit Mitra , C. L. Giles

Numerical data processing is a key task across different fields of computer technology use. However, even simple summation of values is not precise due to the floating point representation use. This paper presents a practical algorithm for…

Data Structures and Algorithms · Computer Science 2022-11-09 Vaclav Skala

The paper advocates the use of a statistical tool dedicated to the exploration of data samples populated by several sources of events. This new technique, called sPlot, is able to unfold the contributions of the different sources to the…

Data Analysis, Statistics and Probability · Physics 2017-08-23 Muriel Pivk

Scientific software is one of the key elements for reproducible research. However, classic publications and related scientific software are typically not (sufficiently) linked, and it lacks tools to jointly explore these artefacts. In this…

Digital Libraries · Computer Science 2018-06-21 Anett Hoppe , Jascha Hagen , Helge Holzmann , Günter Kniesel , Ralph Ewerth

One major challenge in science is to make all results potentially reproducible. Thus, along with the raw data, every step from basic processing of the data, evaluation, to the generation of the figures, has to be documented as clearly as…

Graphics · Computer Science 2020-07-31 Richard Gerum
‹ Prev 1 2 3 10 Next ›