Related papers: Mining CFD Rules on Big Data

A Way to Understand Various Patterns of Data Mining Techniques for Selected Domains

This has much in common with traditional work in statistics and machine learning. However, there are important new issues which arise because of the sheer size of the data. One of the important problem in data mining is the…

Databases · Computer Science 2009-11-05 Kanak Saxena , D. S Rajpoot

Discovery of Paradigm Dependencies

Missing and incorrect values often cause serious consequences. To deal with these data quality problems, a class of common employed tools are dependency rules, such as Functional Dependencies (FDs), Conditional Functional Dependencies…

Databases · Computer Science 2017-10-10 Jizhou Sun , Jianzhong Li , Hong Gao

Efficient learning of large sets of locally optimal classification rules

Conventional rule learning algorithms aim at finding a set of simple rules, where each rule covers as many examples as possible. In this paper, we argue that the rules found in this way may not be the optimal explanations for each of the…

Machine Learning · Computer Science 2023-01-27 Van Quoc Phuong Huynh , Johannes Fürnkranz , Florian Beck

Bayesian Data Cleaning for Web Data

Data Cleaning is a long standing problem, which is growing in importance with the mass of uncurated web data. State of the art approaches for handling inconsistent data are systems that learn and use conditional functional dependencies…

Databases · Computer Science 2012-04-18 Yuheng Hu , Sushovan De , Yi Chen , Subbarao Kambhampati

Credit card fraud detection - Classifier selection strategy

Machine learning has opened up new tools for financial fraud detection. Using a sample of annotated transactions, a machine learning classification algorithm learns to detect frauds. With growing credit card transaction volumes and rising…

Machine Learning · Computer Science 2022-08-26 Gayan K. Kulatilleke

Conditional Linear Regression

Work in machine learning and statistics commonly focuses on building models that capture the vast majority of data, possibly ignoring a segment of the population as outliers. However, there does not often exist a good model on the whole…

Machine Learning · Computer Science 2019-07-11 Diego Calderon , Brendan Juba , Sirui Li , Zongyi Li , Lisa Ruan

Context-Aware Rule Mining Using a Dynamic Transformer-Based Framework

This study proposes a dynamic rule data mining algorithm based on an improved Transformer architecture, aiming to improve the accuracy and efficiency of rule mining in a dynamic data environment. With the increase in data volume and…

Machine Learning · Computer Science 2025-03-17 Jie Liu , Yiwei Zhang , Yuan Sheng , Yujia Lou , Haige Wang , Bohuan Yang

A Review on Algorithms for Constraint-based Causal Discovery

Causal discovery studies the problem of mining causal relationships between variables from data, which is of primary interest in science. During the past decades, significant amount of progresses have been made toward this fundamental data…

Artificial Intelligence · Computer Science 2016-11-28 Kui Yu , Jiuyong Li , Lin Liu

The Bearable Lightness of Big Data: Towards Massive Public Datasets in Scientific Machine Learning

In general, large datasets enable deep learning models to perform with good accuracy and generalizability. However, massive high-fidelity simulation datasets (from molecular chemistry, astrophysics, computational fluid dynamics (CFD), etc.…

Machine Learning · Computer Science 2022-07-27 Wai Tong Chung , Ki Sung Jung , Jacqueline H. Chen , Matthias Ihme

DCNNs on a Diet: Sampling Strategies for Reducing the Training Set Size

Large-scale supervised classification algorithms, especially those based on deep convolutional neural networks (DCNNs), require vast amounts of training data to achieve state-of-the-art performance. Decreasing this data requirement would…

Computer Vision and Pattern Recognition · Computer Science 2016-06-15 Maya Kabkab , Azadeh Alavi , Rama Chellappa

$\mathrm{E^{2}CFD}$: Towards Effective and Efficient Cost Function Design for Safe Reinforcement Learning via Large Language Model

Different classes of safe reinforcement learning algorithms have shown satisfactory performance in various types of safety requirement scenarios. However, the existing methods mainly address one or several classes of specific safety…

Machine Learning · Computer Science 2024-07-09 Zepeng Wang , Chao Ma , Linjiang Zhou , Libing Wu , Lei Yang , Xiaochuan Shi , Guojun Peng

Impact of Training Dataset Size on Neural Answer Selection Models

It is held as a truism that deep neural networks require large datasets to train effective models. However, large datasets, especially with high-quality labels, can be expensive to obtain. This study sets out to investigate (i) how large a…

Information Retrieval · Computer Science 2019-01-31 Trond Linjordet , Krisztian Balog

A Massive Local Rules Search Approach to the Classification Problem

An approach to the classification problem of machine learning, based on building local classification rules, is developed. The local rules are considered as projections of the global classification rules to the event we want to classify. A…

Machine Learning · Computer Science 2007-05-23 Vladislav Malyshkin , Ray Bakhramov , Andrey Gorodetsky

A Stochastic Penalty Model for Convex and Nonconvex Optimization with Big Constraints

The last decade witnessed a rise in the importance of supervised learning applications involving {\em big data} and {\em big models}. Big data refers to situations where the amounts of training data available and needed causes difficulties…

Optimization and Control · Mathematics 2018-11-01 Konstantin Mishchenko , Peter Richtárik

Rule Mining for Correcting Classification Models

Machine learning models need to be continually updated or corrected to ensure that the prediction accuracy remains consistently high. In this study, we consider scenarios where developers should be careful to change the prediction results…

Software Engineering · Computer Science 2023-10-17 Hirofumi Suzuki , Hiroaki Iwashita , Takuya Takagi , Yuta Fujishige , Satoshi Hara

Data-driven Algorithm Design

Data driven algorithm design is an important aspect of modern data science and algorithm design. Rather than using off the shelf algorithms that only have worst case performance guarantees, practitioners often optimize over large families…

Data Structures and Algorithms · Computer Science 2020-11-17 Maria-Florina Balcan

Pattern-Driven Data Cleaning

Data is inherently dirty and there has been a sustained effort to come up with different approaches to clean it. A large class of data repair algorithms rely on data-quality rules and integrity constraints to detect and repair the data. A…

Databases · Computer Science 2017-12-29 El Kindi Rezig , Mourad Ouzzani , Walid G. Aref , Ahmed K. Elmagarmid , Ahmed R. Mahmood

Discovering Matching Dependencies

The concept of matching dependencies (mds) is recently pro- posed for specifying matching rules for object identification. Similar to the functional dependencies (with conditions), mds can also be applied to various data quality…

Databases · Computer Science 2009-06-13 Shaoxu Song , Lei Chen

Machine Learning-Based Optimal Mesh Generation in Computational Fluid Dynamics

Computational Fluid Dynamics (CFD) is a major sub-field of engineering. Corresponding flow simulations are typically characterized by heavy computational resource requirements. Often, very fine and complex meshes are required to resolve…

Machine Learning · Computer Science 2021-02-26 Keefe Huang , Moritz Krügener , Alistair Brown , Friedrich Menhorn , Hans-Joachim Bungartz , Dirk Hartmann

Large Language Models for Constrained-Based Causal Discovery

Causality is essential for understanding complex systems, such as the economy, the brain, and the climate. Constructing causal graphs often relies on either data-driven or expert-driven approaches, both fraught with challenges. The former…

Artificial Intelligence · Computer Science 2024-06-12 Kai-Hendrik Cohrs , Gherardo Varando , Emiliano Diaz , Vasileios Sitokonstantinou , Gustau Camps-Valls