Related papers: Automated Single-Label Patent Classification using…

Classifying Patent Applications with Ensemble Methods

We present methods for the automatic classification of patent applications using an annotated dataset provided by the organizers of the ALTA 2018 shared task - Classifying Patent Applications. The goal of the task is to use computational…

Computation and Language · Computer Science 2018-11-13 Fernando Benites , Shervin Malmasi , Marcos Zampieri

Applying an Ensemble Learning Method for Improving Multi-label Classification Performance

In recent years, multi-label classification problem has become a controversial issue. In this kind of classification, each sample is associated with a set of class labels. Ensemble approaches are supervised learning algorithms in which an…

Machine Learning · Computer Science 2018-01-09 Amirreza Mahdavi-Shahri , Mahboobeh Houshmand , Mahdi Yaghoobi , Mehrdad Jalali

Adaptive Taxonomy Learning and Historical Patterns Modelling for Patent Classification

Patent classification aims to assign multiple International Patent Classification (IPC) codes to a given patent. Recent methods for automatically classifying patents mainly focus on analyzing the text descriptions of patents. However, apart…

Artificial Intelligence · Computer Science 2024-06-21 Tao Zou , Le Yu , Junchen Ye , Leilei Sun , Bowen Du , Deqing Wang

A Machine Learning Based Ensemble Method for Automatic Multiclass Classification of Decisions

Stakeholders make various types of decisions with respect to requirements, design, management, and so on during the software development life cycle. Nevertheless, these decisions are typically not well documented and classified due to…

Software Engineering · Computer Science 2021-05-05 Liming Fu , Peng Liang , Xueying Li , Chen Yang

An Instance-based Plus Ensemble Learning Method for Classification of Scientific Papers

The exponential growth of scientific publications in recent years has posed a significant challenge in effective and efficient categorization. This paper introduces a novel approach that combines instance-based learning and ensemble…

Digital Libraries · Computer Science 2024-09-24 Fang Zhang , Shengli Wu

PatentMatch: A Dataset for Matching Patent Claims & Prior Art

Patent examiners need to solve a complex information retrieval task when they assess the novelty and inventive step of claims made in a patent application. Given a claim, they search for prior art, which comprises all relevant publicly…

Information Retrieval · Computer Science 2020-12-29 Julian Risch , Nicolas Alder , Christoph Hewel , Ralf Krestel

Ensemble Learning Based Classification Algorithm Recommendation

Recommending appropriate algorithms to a classification problem is one of the most challenging issues in the field of data mining. The existing algorithm recommendation models are generally constructed on only one kind of meta-features by…

Information Retrieval · Computer Science 2021-06-08 Guangtao Wang , Qinbao Song , Xiaoyan Zhu

Anomaly Detection using Ensemble Classification and Evidence Theory

Multi-class ensemble classification remains a popular focus of investigation within the research community. The popularization of cloud services has sped up their adoption due to the ease of deploying large-scale machine-learning models. It…

Machine Learning · Computer Science 2024-04-17 Fernando Arévalo , Tahasanul Ibrahim , Christian Alison M. Piolo , Andreas Schwung

Benchmarking Patent Embeddings: A Multi-Task Evaluation of 22 Models Across Retrieval, Classification, and Clustering

Two questions regarding practitioners' use of patent embeddings arise: (i) Does one fine-tuning recipe suffice for all downstream applications? (ii) Is fine-tuning on one patent landscape sufficient for downstream application on other…

Information Retrieval · Computer Science 2026-05-27 Amirhossein Yousefiramandi , Ciaran Cooney

Patent Sentiment Analysis to Highlight Patent Paragraphs

Given a patent document, identifying distinct semantic annotations is an interesting research aspect. Text annotation helps the patent practitioners such as examiners and patent attorneys to quickly identify the key arguments of any…

Machine Learning · Computer Science 2021-11-19 Renukswamy Chikkamath , Vishvapalsinhji Ramsinh Parmar , Christoph Hewel , Markus Endres

In the realm of patent document analysis, assessing semantic similarity between phrases presents a significant challenge, notably amplifying the inherent complexities of Cooperative Patent Classification (CPC) research. Firstly, this study…

Computation and Language · Computer Science 2024-01-17 Liqiang Yu , Bo Liu , Qunwei Lin , Xinyu Zhao , Chang Che

A Survey on Sentence Embedding Models Performance for Patent Analysis

Patent data is an important source of knowledge for innovation research, while the technological similarity between pairs of patents is a key enabling indicator for patent analysis. Recently researchers have been using patent vector space…

Computation and Language · Computer Science 2022-08-08 Hamid Bekamiri , Daniel S. Hain , Roman Jurowetzki

Multi label classification of Artificial Intelligence related patents using Modified D2SBERT and Sentence Attention mechanism

Patent classification is an essential task in patent information management and patent knowledge mining. It is very important to classify patents related to artificial intelligence, which is the biggest topic these days. However, artificial…

Computation and Language · Computer Science 2023-03-07 Yongmin Yoo , Tak-Sung Heo , Dongjin Lim , Deaho Seo

Ensemble Classifiers and Their Applications: A Review

Ensemble classifier refers to a group of individual classifiers that are cooperatively trained on data set in a supervised classification problem. In this paper we present a review of commonly used ensemble classifiers in the literature.…

Machine Learning · Computer Science 2014-04-17 Akhlaqur Rahman , Sumaira Tasnim

Verification-Aided Deep Ensemble Selection

Deep neural networks (DNNs) have become the technology of choice for realizing a variety of complex tasks. However, as highlighted by many recent studies, even an imperceptible perturbation to a correctly classified input can lead to…

Machine Learning · Computer Science 2022-07-27 Guy Amir , Tom Zelazny , Guy Katz , Michael Schapira

CinPatent: Datasets for Patent Classification

Patent classification is the task that assigns each input patent into several codes (classes). Due to its high demand, several datasets and methods have been introduced. However, the lack of both systematic performance comparison of…

Computation and Language · Computer Science 2024-03-18 Minh-Tien Nguyen , Nhung Bui , Manh Tran-Tien , Linh Le , Huy-The Vu

Deep Super Learner: A Deep Ensemble for Classification Problems

Deep learning has become very popular for tasks such as predictive modeling and pattern recognition in handling big data. Deep learning is a powerful machine learning method that extracts lower level features and feeds them forward for the…

Machine Learning · Computer Science 2018-03-07 Steven Young , Tamer Abdou , Ayse Bener

Multi-output Headed Ensembles for Product Item Classification

In this paper, we revisit the problem of product item classification for large-scale e-commerce catalogs. The taxonomy of e-commerce catalogs consists of thousands of genres to which are assigned items that are uploaded by merchants on a…

Machine Learning · Computer Science 2023-08-01 Hotaka Shiokawa , Pradipto Das , Arthur Toth , Justin Chiu

The Multiplex Classification Framework: optimizing multi-label classifiers through problem transformation, ontology engineering, and model ensembling

Classification is a fundamental task in machine learning. While conventional methods-such as binary, multiclass, and multi-label classification-are effective for simpler problems, they may not adequately address the complexities of some…

Machine Learning · Computer Science 2024-12-20 Mauro Nievas Offidani , Facundo Roffet , Claudio Augusto Delrieux , Maria Carolina Gonzalez Galtier , Marcos Zarate

A Meta-embedding-based Ensemble Approach for ICD Coding Prediction

International Classification of Diseases (ICD) are the de facto codes used globally for clinical coding. These codes enable healthcare providers to claim reimbursement and facilitate efficient storage and retrieval of diagnostic…

Computation and Language · Computer Science 2022-02-22 Pavithra Rajendran , Alexandros Zenonos , Josh Spear , Rebecca Pope