English
Related papers

Related papers: Stopwords in Technical Language Processing

200 papers

Stop words, which are considered non-predictive, are often eliminated in natural language processing tasks. However, the definition of uninformative vocabulary is vague, so most algorithms use general knowledge-based stop lists to remove…

Software Engineering · Computer Science 2023-06-13 Yaohou Fan , Chetan Arora , Christoph Treude

Stopwords are fundamental in Natural Language Processing (NLP) techniques for information retrieval. One of the common tasks in preprocessing of text data is the removal of stopwords. Currently, while high-resource languages like English…

Information retrieval is an important application area of natural-language processing where one encounters the genuine challenge of processing large quantities of unrestricted natural-language text. This paper reports on the application of…

cmp-lg · Computer Science 2008-02-03 David A. Evans , Chengxiang Zhai

Many words in documents recur very frequently but are essentially meaningless as they are used to join words together in a sentence. It is commonly understood that stop words do not contribute to the context or content of textual documents.…

Information Retrieval · Computer Science 2012-05-30 Murphy Choy

Conversational search systems enable information retrieval via natural language interactions, with the goal of maximizing users' information gain over multiple dialogue turns. The increasing prevalence of conversational interfaces adopting…

Computation and Language · Computer Science 2024-07-02 Phillip Schneider , Wessel Poelman , Michael Rovatsos , Florian Matthes

The design of complex engineering systems is an often long and articulated process that highly relies on engineers' expertise and professional judgment. As such, the typical pitfalls of activities involving the human factor often manifest…

Computation and Language · Computer Science 2022-11-22 Shaohong Zhong , Andrea Scarinci , Alice Cicirello

A broad class of software engineering problems can be generalized as the "total recall problem". This short paper claims that identifying and exploring total recall language processing problems in software engineering is an important task…

Software Engineering · Computer Science 2018-11-13 Zhe Yu , Tim Menzies

The automatic extraction of structure from text can be difficult for machines. Yet, the elicitation of this information can provide many benefits and opportunities for various applications. Benefits have also been identified for the area of…

Computation and Language · Computer Science 2022-02-11 Maximilian Vierlboeck , Carlo Lipizzi , Roshanak Nilchiani

The requirements engineering process is a crucial stage of the software development life cycle. It involves various stakeholders from different professional backgrounds, particularly in the requirements elicitation phase. Each stakeholder…

Computation and Language · Computer Science 2020-03-31 Vaibhav Jain , Ruchika Malhotra , Sanskar Jain , Nishant Tanwar

The growing developments in general semantic networks, knowledge graphs and ontology databases have motivated us to build a large-scale comprehensive semantic network of technology-related data for engineering knowledge discovery,…

Information Retrieval · Computer Science 2019-10-07 Serhad Sarica , Jianxi Luo , Kristin L. Wood

Emergent language is unique among fields within the discipline of machine learning for its open-endedness, not obviously presenting well-defined problems to be solved. As a result, the current research in the field has largely been…

Multiagent Systems · Computer Science 2022-06-24 Brendon Boldt , David Mortensen

Stopwords carry little semantic information and are often removed from text data to reduce dataset size and improve machine learning model performance. Consequently, researchers have sought to develop techniques for generating effective…

Computation and Language · Computer Science 2022-09-07 Daniel M. DiPietro

Software developers often rely on natural language text that appears in software engineering artifacts to access critical information as they build and work on software systems. For example, developers access requirements documents to…

Software Engineering · Computer Science 2021-05-14 Arthur Marques , Giovanni Viviani , Gail C. Murphy

This paper investigates the role of text categorization in streamlining stopword extraction in natural language processing (NLP), specifically focusing on nine African languages alongside French. By leveraging the MasakhaNEWS, African…

Prompt engineering is a critical technique in the field of natural language processing that involves designing and optimizing the prompts used to input information into models, aiming to enhance their performance on specific tasks. With the…

Today's probabilistic language generators fall short when it comes to producing coherent and fluent text despite the fact that the underlying models perform well under standard metrics, e.g., perplexity. This discrepancy has puzzled the…

Computation and Language · Computer Science 2025-06-06 Clara Meister , Tiago Pimentel , Gian Wiher , Ryan Cotterell

Stopword removal is a critical stage in many Machine Learning methods but often receives little consideration, it interferes with the model visualizations and disrupts user confidence. Inappropriately chosen or hastily omitted stopwords not…

Human-Computer Interaction · Computer Science 2025-01-20 Shuangjiang Xue , Pierre Le Bras , David A. Robb , Mike J. Chantler , Stefano Padilla

The explosive rate of information growth and availability often makes it increasingly difficult to locate information pertinent to your needs. These problems are often compounded when keyword based search methodologies are not adequate for…

Information Retrieval · Computer Science 2008-10-10 Christopher M. Frenz

"Keyword Extraction" refers to the task of automatically identifying the most relevant and informative phrases in natural language text. As we are deluged with large amounts of text data in many different forms and content - emails, blogs,…

Computation and Language · Computer Science 2019-08-22 Shibamouli Lahiri

The main contribution of this paper, is to propose a novel semantic approach based on a Natural Language Processing technique in order to ensure a semantic unification of unstructured process patterns which are expressed not only in…

Artificial Intelligence · Computer Science 2012-10-03 Nahla Jlaiel , Khouloud Madhbouh , Mohamed Ben Ahmed
‹ Prev 1 2 3 10 Next ›