Related papers: Automatic Classification of Object Code Using Mach…

Machine Learning Based Source Code Classification Using Syntax Oriented Features

As of today the programming language of the vast majority of the published source code is manually specified or programmatically assigned based on the sole file extension. In this paper we show that the source code programming language…

Machine Learning · Computer Science 2017-03-23 Shaul Zevin , Catherine Holzem

Classification of Documents Extracted from Images with Optical Character Recognition Methods

Over the past decade, machine learning methods have given us driverless cars, voice recognition, effective web search, and a much better understanding of the human genome. Machine learning is so common today that it is used dozens of times…

Computer Vision and Pattern Recognition · Computer Science 2021-06-22 Omer Aydin

Automating Program Structure Classification

When students write programs, their program structure provides insight into their learning process. However, analyzing program structure by hand is time-consuming, and teachers need better tools for computer-assisted exploration of student…

Computers and Society · Computer Science 2021-01-26 Will Crichton , Georgia Gabriela Sampaio , Pat Hanrahan

Towards usable automated detection of CPU architecture and endianness for arbitrary binary files and object code sequences

Static and dynamic binary analysis techniques are actively used to reverse engineer software's behavior and to detect its vulnerabilities, even when only the binary code is available for analysis. To avoid analysis errors due to misreading…

Cryptography and Security · Computer Science 2021-08-24 Sami Kairajärvi , Andrei Costin , Timo Hämäläinen

Neural Architectures for Fine-grained Entity Type Classification

In this work, we investigate several neural network architectures for fine-grained entity type classification. Particularly, we consider extensions to a recently proposed attentive neural architecture and make three key contributions.…

Computation and Language · Computer Science 2017-02-22 Sonse Shimaoka , Pontus Stenetorp , Kentaro Inui , Sebastian Riedel

Building high-level features using large scale unsupervised learning

We consider the problem of building high-level, class-specific feature detectors from only unlabeled data. For example, is it possible to learn a face detector using only unlabeled images? To answer this, we train a 9-layered locally…

Machine Learning · Computer Science 2017-04-17 Quoc V. Le , Marc'Aurelio Ranzato , Rajat Monga , Matthieu Devin , Kai Chen , Greg S. Corrado , Jeff Dean , Andrew Y. Ng

To Automatically Map Source Code Entities to Architectural Modules with Naive Bayes

Background: The process of mapping a source code entity onto an architectural module is to a large degree a manual task. Automating this process could increase the use of static architecture conformance checking methods, such as reflexion…

Software Engineering · Computer Science 2021-09-21 Tobias Olsson , Morgan Ericsson , Anna Wingkvist

Automated detection and classification of cryptographic algorithms in binary programs through machine learning

Threats from the internet, particularly malicious software (i.e., malware) often use cryptographic algorithms to disguise their actions and even to take control of a victim's system (as in the case of ransomware). Malware and other threats…

Cryptography and Security · Computer Science 2015-03-05 Diane Duros Hosfelt

Making Early Predictions of the Accuracy of Machine Learning Applications

The accuracy of machine learning systems is a widely studied research topic. Established techniques such as cross-validation predict the accuracy on unseen data of the classifier produced by applying a given learning method to a given…

Machine Learning · Computer Science 2012-12-06 J. E. Smith , P. Caleb-Solly , M. A. Tahir , D. Sannen , H. van-Brussel

On Applying Machine Learning/Object Detection Models for Analysing Digitally Captured Physical Prototypes from Engineering Design Projects

While computer vision has received increasing attention in computer science over the last decade, there are few efforts in applying this to leverage engineering design research. Existing datasets and technologies allow researchers to…

Computer Vision and Pattern Recognition · Computer Science 2019-05-16 Jorgen F. Erichsen , Sampsa Kohtala , Martin Steinert , Torgeir Welo

Optimizing Automata Learning via Monads

Automata learning has been successfully applied in the verification of hardware and software. The size of the automaton model learned is a bottleneck for scalability, and hence optimizations that enable learning of compact representations…

Formal Languages and Automata Theory · Computer Science 2019-11-04 Gerco van Heerdt , Matteo Sammartino , Alexandra Silva

Automatic Malware Description via Attribute Tagging and Similarity Embedding

With the rapid proliferation and increased sophistication of malicious software (malware), detection methods no longer rely only on manually generated signatures but have also incorporated more general approaches like machine learning…

Machine Learning · Computer Science 2020-01-24 Felipe N. Ducau , Ethan M. Rudd , Tad M. Heppner , Alex Long , Konstantin Berlin

Unsupervised machine learning via transfer learning and k-means clustering to classify materials image data

Unsupervised machine learning offers significant opportunities for extracting knowledge from unlabeled data sets and for achieving maximum machine learning performance. This paper demonstrates how to construct, use, and evaluate a high…

Materials Science · Physics 2021-04-13 Ryan Cohn , Elizabeth Holm

Malware Classification with Word Embedding Features

Malware classification is an important and challenging problem in information security. Modern malware classification techniques rely on machine learning models that can be trained on features such as opcode sequences, API calls, and byte…

Cryptography and Security · Computer Science 2021-03-05 Aparna Sunil Kale , Fabio Di Troia , Mark Stamp

Activation Analysis of a Byte-Based Deep Neural Network for Malware Classification

Feature engineering is one of the most costly aspects of developing effective machine learning models, and that cost is even greater in specialized problem domains, like malware classification, where expert skills are necessary to identify…

Machine Learning · Computer Science 2019-08-02 Scott E. Coull , Christopher Gardner

A Continual Learning Framework for Adaptive Defect Classification and Inspection

Machine-vision-based defect classification techniques have been widely adopted for automatic quality inspection in manufacturing processes. This article describes a general framework for classifying defects from high volume data batches…

Computer Vision and Pattern Recognition · Computer Science 2023-07-04 Wenbo Sun , Raed Al Kontar , Judy Jin , Tzyy-Shuh Chang

Using Convolutional Neural Networks to Detect Compression Algorithms

Machine learning is penetrating various domains virtually, thereby proliferating excellent results. It has also found an outlet in digital forensics, wherein it is becoming the prime driver of computational efficiency. A prominent feature…

Computer Vision and Pattern Recognition · Computer Science 2022-01-13 Shubham Bharadwaj

Synthetic Data for Object Classification in Industrial Applications

One of the biggest challenges in machine learning is data collection. Training data is an important part since it determines how the model will behave. In object classification, capturing a large number of images per object and in different…

Computer Vision and Pattern Recognition · Computer Science 2022-12-12 August Baaz , Yonan Yonan , Kevin Hernandez-Diaz , Fernando Alonso-Fernandez , Felix Nilsson

Label Efficient Learning by Exploiting Multi-class Output Codes

We present a new perspective on the popular multi-class algorithmic techniques of one-vs-all and error correcting output codes. Rather than studying the behavior of these techniques for supervised learning, we establish a connection between…

Machine Learning · Computer Science 2016-11-28 Maria Florina Balcan , Travis Dick , Yishay Mansour

Frustratingly Simple but Effective Zero-shot Detection and Segmentation: Analysis and a Strong Baseline

Methods for object detection and segmentation often require abundant instance-level annotations for training, which are time-consuming and expensive to collect. To address this, the task of zero-shot object detection (or segmentation) aims…

Computer Vision and Pattern Recognition · Computer Science 2023-02-16 Siddhesh Khandelwal , Anirudth Nambirajan , Behjat Siddiquie , Jayan Eledath , Leonid Sigal