Related papers: Accelerating System Log Processing by Semi-supervi…

LogSD: Detecting Anomalies from System Logs through Self-supervised Learning and Frequency-based Masking

Log analysis is one of the main techniques that engineers use for troubleshooting large-scale software systems. Over the years, many supervised, semi-supervised, and unsupervised log analysis methods have been proposed to detect system…

Software Engineering · Computer Science 2024-04-22 Yongzheng Xie , Hongyu Zhang , Muhammad Ali Babar

A Survey on Semi-Supervised Learning Techniques

Semisupervised learning is a learning standard which deals with the study of how computers and natural systems such as human beings acquire knowledge in the presence of both labeled and unlabeled data. Semisupervised learning based methods…

Machine Learning · Computer Science 2014-02-20 V. Jothi Prakash , Dr. L. M. Nithya

Semi-Supervised Classification for oil reservoir

This paper addresses the general problem of accurate identification of oil reservoirs. Recent improvements in well or borehole logging technology have resulted in an explosive amount of data available for processing. The traditional methods…

Machine Learning · Computer Science 2018-04-06 Yanan Li , Haixiang Guo , Andrew P Paplinski

Self-Supervised Log Parsing

Logs are extensively used during the development and maintenance of software systems. They collect runtime events and allow tracking of code execution, which enables a variety of critical tasks such as troubleshooting and fault detection.…

Machine Learning · Computer Science 2020-03-20 Sasho Nedelkoski , Jasmin Bogatinovski , Alexander Acker , Jorge Cardoso , Odej Kao

ReviewRanker: A Semi-Supervised Learning Based Approach for Code Review Quality Estimation

Code review is considered a key process in the software industry for minimizing bugs and improving code quality. Inspection of review process effectiveness and continuous improvement can boost development productivity. Such inspection is a…

Software Engineering · Computer Science 2023-07-11 Saifullah Mahbub , Md. Easin Arafat , Chowdhury Rafeed Rahman , Zannatul Ferdows , Masum Hasan

LogStamp: Automatic Online Log Parsing Based on Sequence Labelling

Logs are one of the most critical data for service management. It contains rich runtime information for both services and users. Since size of logs are often enormous in size and have free handwritten constructions, a typical log-based…

Software Engineering · Computer Science 2022-08-23 Shimin Tao , Weibin Meng , Yimeng Chen , Yichen Zhu , Ying Liu Chunning Du , Tao Han , Yongpeng Zhao , Xiangguang Wang , Hao Yang

Log severity level classification: an approach for systems in production

Context: Logs are often the primary source of information for system developers and operations engineers to understand and diagnose the behavior of a software system in production. In many cases, logs are the only evidence available for…

Software Engineering · Computer Science 2021-12-23 Eduardo Mendes , Fabio Petrillo

SemParser: A Semantic Parser for Log Analysis

Logs, being run-time information automatically generated by software, record system events and activities with their timestamps. Before obtaining more insights into the run-time status of the software, a fundamental step of log analysis,…

Software Engineering · Computer Science 2023-02-07 Yintong Huo , Yuxin Su , Cheryl Lee , Michael R. Lyu

LLM-as-classifier: Semi-Supervised, Iterative Framework for Hierarchical Text Classification using Large Language Models

The advent of Large Language Models (LLMs) has provided unprecedented capabilities for analyzing unstructured text data. However, deploying these models as reliable, robust, and scalable classifiers in production environments presents…

Computation and Language · Computer Science 2025-08-25 Doohee You , Andy Parisi , Zach Vander Velden , Lara Dantas Inojosa

LUK: Empowering Log Understanding with Expert Knowledge from Large Language Models

Logs play a critical role in providing essential information for system monitoring and troubleshooting. Recently, with the success of pre-trained language models (PLMs) and large language models (LLMs) in natural language processing (NLP),…

Software Engineering · Computer Science 2025-02-03 Lipeng Ma , Weidong Yang , Sihang Jiang , Ben Fei , Mingjie Zhou , Shuhao Li , Mingyu Zhao , Bo Xu , Yanghua Xiao

Interpretable Online Log Analysis Using Large Language Models with Prompt Strategies

Automated log analysis is crucial in modern software-intensive systems for facilitating program comprehension throughout software maintenance and engineering life cycles. Existing methods perform tasks such as log parsing and log anomaly…

Software Engineering · Computer Science 2024-01-29 Yilun Liu , Shimin Tao , Weibin Meng , Jingyu Wang , Wenbing Ma , Yanqing Zhao , Yuhang Chen , Hao Yang , Yanfei Jiang , Xun Chen

Understand customer reviews with less data and in short time: pretrained language representation and active learning

In this paper, we address customer review understanding problems by using supervised machine learning approaches, in order to achieve a fully automatic review aspects categorisation and sentiment analysis. In general, such supervised…

Computation and Language · Computer Science 2019-11-05 Yanwei Cui , Xavier Illy

Semi-supervised and Unsupervised Methods for Categorizing Posts in Web Discussion Forums

Web discussion forums are used by millions of people worldwide to share information belonging to a variety of domains such as automotive vehicles, pets, sports, etc. They typically contain posts that fall into different categories such as…

Computation and Language · Computer Science 2016-04-26 Krish Perumal

System Log Parsing with Large Language Models: A Review

Log data provides crucial insights for tasks like monitoring, root cause analysis, and anomaly detection. Due to the vast volume of logs, automated log parsing is essential to transform semi-structured log messages into structured…

Machine Learning · Computer Science 2025-05-16 Viktor Beck , Max Landauer , Markus Wurzenberger , Florian Skopik , Andreas Rauber

Leveraging Large Amounts of Weakly Supervised Data for Multi-Language Sentiment Classification

This paper presents a novel approach for multi-lingual sentiment classification in short texts. This is a challenging task as the amount of training data in languages other than English is very limited. Previously proposed multi-lingual…

Computation and Language · Computer Science 2017-03-08 Jan Deriu , Aurelien Lucchi , Valeria De Luca , Aliaksei Severyn , Simon Müller , Mark Cieliebak , Thomas Hofmann , Martin Jaggi

Semi-supervised Classification for Natural Language Processing

Semi-supervised classification is an interesting idea where classification models are learned from both labeled and unlabeled data. It has several advantages over supervised classification in natural language processing domain. For…

Computation and Language · Computer Science 2014-09-29 Rushdi Shams

How to Achieve High Classification Accuracy with Just a Few Labels: A Semi-supervised Approach Using Sampled Packets

Network traffic classification, which has numerous applications from security to billing and network provisioning, has become a cornerstone of today's computer networks. Previous studies have developed traffic classification techniques…

Networking and Internet Architecture · Computer Science 2020-05-19 Shahbaz Rezaei , Xin Liu

Semi-Supervised Learning with Scarce Annotations

While semi-supervised learning (SSL) algorithms provide an efficient way to make use of both labelled and unlabelled data, they generally struggle when the number of annotated samples is very small. In this work, we consider the problem of…

Computer Vision and Pattern Recognition · Computer Science 2020-04-23 Sylvestre-Alvise Rebuffi , Sebastien Ehrhardt , Kai Han , Andrea Vedaldi , Andrew Zisserman

Semi-supervised Batch Learning From Logged Data

Off-policy learning methods are intended to learn a policy from logged data, which includes context, action, and feedback (cost or reward) for each sample point. In this work, we build on the counterfactual risk minimization framework,…

Machine Learning · Computer Science 2024-02-20 Gholamali Aminian , Armin Behnamnia , Roberto Vega , Laura Toni , Chengchun Shi , Hamid R. Rabiee , Omar Rivasplata , Miguel R. D. Rodrigues

Automatic Bug Triage using Semi-Supervised Text Classification

In this paper, we propose a semi-supervised text classification approach for bug triage to avoid the deficiency of labeled bug reports in existing supervised approaches. This new approach combines naive Bayes classifier and…

Software Engineering · Computer Science 2017-04-18 Jifeng Xuan , He Jiang , Zhilei Ren , Jun Yan , Zhongxuan Luo