数字图书馆 — Scifaro

Quantifying Lifetime Productivity Changes: A Longitudinal Study of 320,000 Late-Career Scientists

The present study focuses on persistence in research productivity over the course of an individual's entire scientific career. We track 'late-career' scientists - scientists with at least 25 years of publishing experience (N=320,564) - in…

数字图书馆 · 计算机科学 2026-05-11 Marek Kwiek , Lukasz Szymula

The Rise and Fall of the Initial Era

Bibliographic data is a rich source of information that goes beyond the use cases of location and citation -- it also encodes both cultural and technological context. For most of its existence, the scholarly record has changed slowly and…

数字图书馆 · 计算机科学 2026-05-11 Simon J Porter , Daniel W Hook

Young Male and Female Scientists: A Quantitative Exploratory Study of the Changing Demographics of the Global Scientific Workforce

In this study, the global scientific workforce is explored through large-scale, generational, cross-sectional, and longitudinal approaches. We examine 4.3 million nonoccasional scientists from 38 OECD countries publishing in 1990-2021. Our…

数字图书馆 · 计算机科学 2026-05-11 Marek Kwiek , Lukasz Szymula

Mapping the Landscape of Open Access Dashboards -- A Dataset for Research and Infrastructure Development

As Open Access continues to gain importance in science policy, understanding the proportion of Open Access publications relative to the total research output of research-performing organizations, individual countries, or even globally has…

数字图书馆 · 计算机科学 2026-05-08 Johannes Schneider , Heinz Pampel

Science discussions of retracted articles on Bluesky: public scrutiny or misinformation spreading?

Post-publication peer review (PPPR) has emerged as an important supplement to traditional peer review, with social media playing a growing role in publicising potential problems in published research. However, it remains unclear whether…

数字图书馆 · 计算机科学 2026-05-07 Er-Te Zheng , Hui-Zhen Fu , Xiaorui Jiang , Zhichao Fang , Mike Thelwall

A Skill-Based AI Agentic Pipeline for Library of Congress Subject Indexing

This paper presents a modular AI agentic skill pipeline for automating subject indexing with Library of Congress Subject Headings (LCSH). Subject indexing - the process of analyzing a work's aboutness, selecting controlled vocabulary terms,…

数字图书馆 · 计算机科学 2026-05-06 Eric H. C. Chow

Intelligent Knowledge Mining Framework: Bridging AI Analysis and Trustworthy Preservation

The unprecedented proliferation of digital data presents significant challenges in access, integration, and value creation across all data-intensive sectors. Valuable information is frequently encapsulated within disparate systems,…

数字图书馆 · 计算机科学 2026-05-06 Binh Vu

Liberata -- Graph Scientometrics for a Share Based System of Academic Publishing

Contemporary scientometric indicators remain anchored in paradigms and axioms from when academic research was conducted in small scholarly communities. With the global proliferation of scientific research, academia is now organized in large…

数字图书馆 · 计算机科学 2026-05-05 Han Zhang , Anshuman Sabath , Timothy W. Dunn , L. Catherine Brinson

HERITRACE: a domain-agnostic framework for SHACL-driven RDF curation with provenance and change tracking

HERITRACE is an open-source web application that enables users without Semantic Web expertise to curate RDF data through form-based interfaces with automatic provenance documentation and change tracking in RDF. It uses SHACL for data model…

数字图书馆 · 计算机科学 2026-05-05 Arcangelo Massari , Silvio Peroni

Comparison of OpenAlex and Scopus coverage of German institutions' publications in top-tier journals

OpenAlex has recently emerged as a leading alternative to proprietary bibliometric sources. However, concerns remain regarding the quality of its metadata, especially the institutional profiles which are crucial for evaluating…

数字图书馆 · 计算机科学 2026-05-05 Andrey Lovakov , Ivan Sterligov

Measuring research data reuse in scholarly publications using generative artificial intelligence: Open Science Indicator development and preliminary results

Numerous metascience studies and other initiatives have begun to monitor the prevalence of open science practices when it is more important to understand the 'downstream' effects or impacts of open science. PLOS and DataSeer have developed…

数字图书馆 · 计算机科学 2026-05-01 Lauren Cadwallader , Iain Hrynaszkiewicz , parth sarin , Tim Vines

Cross-lingual Comparison of Research Funding Projects with Multilingual Sentence-BERT: Evidence from KAKENHI, NIH, NSF, and UKRI

Cross-national comparison of research funding projects is increasingly important for science policy and strategic planning, but language differences remain a major obstacle. In particular, KAKENHI project descriptions are written primarily…

数字图书馆 · 计算机科学 2026-05-01 Miki Kimura-Ida

Goals and Strategies for the Indexing of Publication Types and Study Designs

Objectives. Major research and implementation efforts have been devoted to indexing articles according to the major topics discussed, but much less effort to indexing their publication types and study designs (collectively, PTs). In this…

数字图书馆 · 计算机科学 2026-05-01 Neil R. Smalheiser , Joe D. Menke , Arthur W. Holt , Halil Kilicoglu , Jodi Schneider

Influential scientists shape knowledge flows between science and IGO policy

Intergovernmental organizations (IGOs) increasingly rely on scientific evidence, yet the pathways through which scientific research enters policy remain opaque. By linking 230,737 scientific papers cited in IGO policy documents (2015-2023)…

数字图书馆 · 计算机科学 2026-05-01 Kimitaka Asatani , Yurie Iwata , Yuta Tomokiyo , Basil Mahfouz , Masaru Yarime , Ichiro Sakata

Do E-Scooter Speed Governance Policies Reduce Harsh Acceleration and Deceleration? Evidence from 19.5 Million Trips Around a Regulatory Ban

Do e-scooter speed governance policies yield behavioral safety gains beyond the mechanical cap they impose? A firmware ceiling mechanically prevents speeding, but whether the same riders also generate fewer harsh accelerations and harsh…

数字图书馆 · 计算机科学 2026-04-30 Seongjin Choi , Sunbin Yoo , Sugie Lee

News Harvesting from Google News combining Web Scraping, LLM Metadata Extraction and SCImago Media Rankings enrichment: a case study of IFMIF-DONES

This study develops and evaluates a systematic methodology for constructing news datasets from Google News, combining automated web scraping, large language model (LLM)-based metadata extraction, and SCImago Media Rankings enrichment. Using…

数字图书馆 · 计算机科学 2026-04-30 Victor Herrero-Solana

A contemporary science map through the lens of IEEE and ACM periodicals

ACM and IEEE are the two premier associations on computing and electrical/electronics engineering which publish and organize the great majority of periodicals and conferences, respectively, serving these disciplines. Science is a constantly…

数字图书馆 · 计算机科学 2026-04-29 George Margaritis , Dionysios Kritsas , Dimitrios Katsaros , Yannis Manolopoulos

AI-Augmented Bibliometric Framework: A Paradigm Shift with Agentic AI for Dynamic, Snippet-Based Research Analysis

Our paper introduces a generative, multiagent AI framework designed to overcome the rigidity, limited flexibility and technical barriers of current bibliometric tools. The objective is to enable researchers to perform fully dynamic,…

数字图书馆 · 计算机科学 2026-04-29 Adela Bara , Simona-Vasilica Oprea

Named Entity Recognition of Historical Texts via Large Language Model

Large language models (LLMs) have demonstrated remarkable versatility across a wide range of natural language processing tasks and domains. One such task is Named Entity Recognition (NER), which involves identifying and classifying proper…

数字图书馆 · 计算机科学 2026-04-29 Shibingfeng Zhang , Giovanni Colavizza

The publication activity and migration trends of Ukrainian scientists in the social sciences and humanities during the first two years of the Russo-Ukrainian war

This study analyses the publication activity and migration patterns of Ukrainian scholars in the social sciences and humanities (SSH) during the initial two years of the Russo-Ukrainian war. Focusing on scholars who published at least three…

数字图书馆 · 计算机科学 2026-04-29 Serhii Nazarovets