Related papers: Randomized LU decomposition: An Algorithm for Dict…

Pass-Efficient Randomized LU Algorithms for Computing Low-Rank Matrix Approximation

Low-rank matrix approximation is extremely useful in the analysis of data that arises in scientific computing, engineering applications, and data science. However, as data sizes grow, traditional low-rank matrix approximation methods, such…

Numerical Analysis · Mathematics 2020-02-26 Bolong Zhang , Michael Mascagni

File fragment recognition based on content and statistical features

Nowadays, the speed up development and use of digital devices such as smartphones have put people at risk of internet crimes. The evidence of present crimes in a computer file can be easily unreachable by changing the prefix of a file or…

Cryptography and Security · Computer Science 2021-02-26 Marzieh Masoumi , Ahmad Keshavarz , Reza Fotohi

Randomized LU Decomposition

We present a fast randomized algorithm that computes a low rank LU decomposition. Our algorithm uses random projections type techniques to efficiently compute a low rank approximation of large matrices. The randomized LU algorithm can be…

Numerical Analysis · Mathematics 2016-02-02 Gil Shabat , Yaniv Shmueli , Yariv Aizenbud , Amir Averbuch

Malware Classification using Deep Learning based Feature Extraction and Wrapper based Feature Selection Technique

In the case of malware analysis, categorization of malicious files is an essential part after malware detection. Numerous static and dynamic techniques have been reported so far for categorizing malware. This research presents a deep…

Cryptography and Security · Computer Science 2020-12-29 Muhammad Furqan Rafique , Muhammad Ali , Aqsa Saeed Qureshi , Asifullah Khan , Anwar Majid Mirza

A Decompilation-Driven Framework for Malware Detection with Large Language Models

The parallel evolution of Large Language Models (LLMs) with advanced code-understanding capabilities and the increasing sophistication of malware presents a new frontier for cybersecurity research. This paper evaluates the efficacy of…

Cryptography and Security · Computer Science 2026-01-15 Aniesh Chawla , Udbhav Prasad

Separable Dictionary Learning

Many techniques in computer vision, machine learning, and statistics rely on the fact that a signal of interest admits a sparse representation over some dictionary. Dictionaries are either available analytically, or can be learned from a…

Computer Vision and Pattern Recognition · Computer Science 2013-03-22 Simon Hawe , Matthias Seibert , Martin Kleinsteuber

Extension of Dictionary-Based Compression Algorithms for the Quantitative Visualization of Patterns from Log Files

Many services today massively and continuously produce log files of different and varying formats. These logs are important since they contain information about the application activities, which is necessary for improvements by analyzing…

Information Retrieval · Computer Science 2023-04-11 Igor Cherepanov , Jonathan Geraldi Joewono , Arjan Kuijper , Jörn Kohlhammer

A new approach to content-based file type detection

File type identification and file type clustering may be difficult tasks that have an increasingly importance in the field of computer and network security. Classical methods of file type detection including considering file extensions and…

Machine Learning · Computer Science 2012-03-20 M. C. Amirani , M. Toorani , A. A. Beheshti

Structured Analysis Dictionary Learning for Image Classification

We propose a computationally efficient and high-performance classification algorithm by incorporating class structural information in analysis dictionary learning. To achieve more consistent classification, we associate a class…

Computer Vision and Pattern Recognition · Computer Science 2018-05-03 Wen Tang , Ashkan Panahi , Hamid Krim , Liyi Dai

Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions

Low-rank matrix approximations, such as the truncated singular value decomposition and the rank-revealing QR decomposition, play a central role in data analysis and scientific computing. This work surveys and extends recent research which…

Numerical Analysis · Mathematics 2014-04-29 Nathan Halko , Per-Gunnar Martinsson , Joel A. Tropp

Unsupervised clustering of file dialects according to monotonic decompositions of mixtures

This paper proposes an unsupervised classification method that partitions a set of files into non-overlapping dialects based upon their behaviors, determined by messages produced by a collection of programs that consume them. The pattern of…

Programming Languages · Computer Science 2023-04-19 Michael Robinson , Tate Altman , Denley Lam , Letitia W. Li

A hybrid learning algorithm for text classification

Text classification is the process of classifying documents into predefined categories based on their content. Existing supervised learning algorithms to automatically classify text need sufficient documents to learn accurately. This paper…

Neural and Evolutionary Computing · Computer Science 2010-09-27 S. M. Kamruzzaman , Farhana Haider

Analysis Dictionary Learning: An Efficient and Discriminative Solution

Discriminative Dictionary Learning (DL) methods have been widely advocated for image classification problems. To further sharpen their discriminative capabilities, most state-of-the-art DL methods have additional constraints included in the…

Machine Learning · Computer Science 2019-03-08 Wen Tang , Ashkan Panahi , Hamid Krim , Liyi Dai

Randomized Matrix Decompositions using R

Matrix decompositions are fundamental tools in the area of applied mathematics, statistical computing, and machine learning. In particular, low-rank matrix decompositions are vital, and widely used for data analysis, dimensionality…

Computation · Statistics 2019-11-28 N. Benjamin Erichson , Sergey Voronin , Steven L. Brunton , J. Nathan Kutz

Many Field Packet Classification with Decomposition and Reinforcement Learning

Scalable packet classification is a key requirement to support scalable network applications like firewalls, intrusion detection, and differentiated services. With ever increasing in the line-rate in core networks, it becomes a great…

Networking and Internet Architecture · Computer Science 2022-05-18 Hasibul Jamil , Ning Yang , Ning Weng

Content-Based Textual File Type Detection at Scale

Programming language detection is a common need in the analysis of large source code bases. It is supported by a number of existing tools that rely on several features, and most notably file extensions, to determine file types. We consider…

Software Engineering · Computer Science 2021-03-02 Francesca Del Bonifro , Maurizio Gabbrielli , Stefano Zacchiroli

Automatic Generation of Language-Independent Features for Cross-Lingual Classification

Many applications require categorization of text documents using predefined categories. The main approach to performing text categorization is learning from labeled examples. For many tasks, it may be difficult to find examples in one…

Computation and Language · Computer Science 2018-02-13 Sarai Duek , Shaul Markovitch

EnCoD: Distinguishing Compressed and Encrypted File Fragments

Reliable identification of encrypted file fragments is a requirement for several security applications, including ransomware detection, digital forensics, and traffic analysis. A popular approach consists of estimating high entropy as a…

Cryptography and Security · Computer Science 2020-10-16 Fabio De Gaspari , Dorjan Hitaj , Giulio Pagnotta , Lorenzo De Carli , Luigi V. Mancini

Text Classification: A Perspective of Deep Learning Methods

In recent years, with the rapid development of information on the Internet, the number of complex texts and documents has increased exponentially, which requires a deeper understanding of deep learning methods in order to accurately…

Computation and Language · Computer Science 2023-09-26 Zhongwei Wan

Explaining Vulnerabilities of Deep Learning to Adversarial Malware Binaries

Recent work has shown that deep-learning algorithms for malware detection are also susceptible to adversarial examples, i.e., carefully-crafted perturbations to input malware that enable misleading classification. Although this has…

Cryptography and Security · Computer Science 2019-01-25 Luca Demetrio , Battista Biggio , Giovanni Lagorio , Fabio Roli , Alessandro Armando