English
Related papers

Related papers: Developing a comprehensive framework for multimoda…

200 papers

Information extraction (IE) is fundamental to numerous NLP applications, yet existing solutions often require specialized models for different tasks or rely on computationally expensive large language models. We present GLiNER2, a unified…

Computation and Language · Computer Science 2025-07-25 Urchade Zaratiana , Gil Pasternak , Oliver Boyd , George Hurn-Maloney , Ash Lewis

psifx is a plug-and-play multi-modal feature extraction toolkit, aiming to facilitate and democratize the use of state-of-the-art machine learning techniques for human sciences research. It is motivated by a need (a) to automate and…

Computation and Language · Computer Science 2026-05-06 Guillaume Rochette , Mathieu Rochat , Nizar Michaud , Matthew J. Vowels

Dynamic Facial Expression Recognition (DFER) is crucial for understanding human behavior. However, current methods exhibit limited performance mainly due to the scarcity of high-quality data, the insufficient utilization of facial dynamics,…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Haodong Chen , Haojian Huang , Junhao Dong , Mingzhe Zheng , Dian Shao

In the digital era, the widespread use of APIs is evident. However, scalable utilization of APIs poses a challenge due to structure divergence observed in online API documentation. This underscores the need for automatic tools to facilitate…

Computation and Language · Computer Science 2024-02-20 Koren Lazar , Matan Vetzler , Guy Uziel , David Boaz , Esther Goldbraich , David Amid , Ateret Anaby-Tavor

Aggregate programming is a field-based coordination paradigm with over a decade of exploration and successful applications across domains including sensor networks, robotics, and IoT, with implementations in various programming languages,…

Software Engineering · Computer Science 2026-04-01 Gianluca Aguzzi , Davide Domini , Nicolas Farabegoli , Mirko Viroli

We introduce a Python framework designed to automate the most common tasks associated with the extraction and upscaling of the statistics of single-impact crater functions to inform coefficients of continuum equations describing surface…

Computational Physics · Physics 2014-10-31 Scott A. Norris

Multimodal documents contain diverse elements, such as tables, figures, and layouts, which can complicate retrieval tasks. While current approaches typically combine dense visual embedding models with supervised rerankers to achieve…

Computer Vision and Pattern Recognition · Computer Science 2026-05-29 Ruofan Hu , Menghui Zhu , Jieming Zhu , Bo Chen , Shengyang Xu , Minjie Hong , Xiaoda Yang , Sashuai Zhou , Li Tang , Tao Jin , Zhou Zhao

Large Language Models (LLMs) have become powerful tools for annotating unstructured data. However, most existing workflows rely on ad hoc scripts, making reproducibility, robustness, and systematic evaluation difficult. To address these…

Information Retrieval · Computer Science 2025-09-26 Eric Fithian , Kirill Skobelev

Diverse presentation formats play a pivotal role in effectively conveying code and analytical processes during data analysis. One increasingly popular format is tutorial videos, particularly those based on Jupyter notebooks, which offer an…

Human-Computer Interaction · Computer Science 2024-08-05 Yang Ouyang , Leixian Shen , Yun Wang , Quan Li

A lot of research relies on data analysis scripts to process, clean, and visualize data. However, recent studies show that these scripts are often hard to comprehend and maintain, hindering reproducibility and reuse, accompanied by a lack…

Software Engineering · Computer Science 2026-04-20 Florian Sihler , Oliver Gerstl , Lars Pfrenger , Julian Schubert , Matthias Tichy

Data annotation is an essential component of the machine learning pipeline; it is also a costly and time-consuming process. With the introduction of transformer-based models, annotation at the document level is increasingly popular;…

Computation and Language · Computer Science 2025-06-04 Owen Cook , Jake Vasilakes , Ian Roberts , Xingyi Song

We introduce an advanced information extraction pipeline to automatically process very large collections of unstructured textual data for the purpose of investigative journalism. The pipeline serves as a new input processor for the upcoming…

Computation and Language · Computer Science 2018-09-17 Gregor Wiedemann , Seid Muhie Yimam , Chris Biemann

Computational analysis of performed music is a key component of music information research, as performance shapes much of the music we hear. Music performance analysis studies the acoustic variations introduced by performers and how these…

Sound · Computer Science 2026-05-06 Corentin Guichaoua , Daniel Bedoya , Elaine Chew

Natural language processing is used for solving a wide variety of problems. Some scholars and interest groups working with language resources are not well versed in programming, so there is a need for a good graphical framework that allows…

Computation and Language · Computer Science 2022-06-17 Timotej Knez , Marko Bajec , Slavko Žitnik

With the growing success of the social Web, most Web developers have to interact with at least one social Web platform, which implies studying the related API specifications. These are often only informally described, may contain errors,…

Software Engineering · Computer Science 2013-01-11 Georges Gouriten , Pierre Senellart

Modern imaging instruments can produce terabytes to petabytes of data for a single experiment. The biggest barrier to processing big image datasets has been computational, where image analysis algorithms often lack the efficiency needed to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-13 Nicholas Schaub , Andriy Kharchenko , Hamdah Abbasi , Sameeul Samee , Hythem Sidky , Nathan Hotaling

Large-scale datasets have been pivotal to the advancements of deep learning models in recent years, but training on such large datasets invariably incurs substantial storage and computational overhead. Meanwhile, real-world datasets often…

Computer Vision and Pattern Recognition · Computer Science 2025-06-23 Suorong Yang , Peng Ye , Wanli Ouyang , Dongzhan Zhou , Furao Shen

Entity alignment has always had significant uses within a multitude of diverse scientific fields. In particular, the concept of matching entities across networks has grown in significance in the world of social science as communicative…

Social and Information Networks · Computer Science 2020-04-21 James Flamino , Christopher Abriola , Ben Zimmerman , Zhongheng Li , Joel Douglas

Linguistic style is pivotal for understanding how texts convey meaning and fulfill communicative purposes, yet extracting detailed stylistic features at scale remains challenging. We present Neurobiber, a transformer-based system for fast,…

Computation and Language · Computer Science 2025-02-27 Kenan Alkiek , Anna Wegmann , Jian Zhu , David Jurgens

Document content extraction is a critical task in computer vision, underpinning the data needs of large language models (LLMs) and retrieval-augmented generation (RAG) systems. Despite recent progress, current document parsing methods have…

Computer Vision and Pattern Recognition · Computer Science 2025-03-26 Linke Ouyang , Yuan Qu , Hongbin Zhou , Jiawei Zhu , Rui Zhang , Qunshu Lin , Bin Wang , Zhiyuan Zhao , Man Jiang , Xiaomeng Zhao , Jin Shi , Fan Wu , Pei Chu , Minghao Liu , Zhenxiang Li , Chao Xu , Bo Zhang , Botian Shi , Zhongying Tu , Conghui He
‹ Prev 1 2 3 10 Next ›