Related papers: Data Augmentation for Sparse Multidimensional Lear…

3DG: A Framework for Using Generative AI for Handling Sparse Learner Performance Data From Intelligent Tutoring Systems

Learning performance data (e.g., quiz scores and attempts) is significant for understanding learner engagement and knowledge mastery level. However, the learning performance data collected from Intelligent Tutoring Systems (ITSs) often…

Computers and Society · Computer Science 2024-02-06 Liang Zhang , Jionghao Lin , Conrad Borchers , Meng Cao , Xiangen Hu

Generative Adversarial Networks for Imputing Sparse Learning Performance

Learning performance data, such as correct or incorrect responses to questions in Intelligent Tutoring Systems (ITSs) is crucial for tracking and assessing the learners' progress and mastery of knowledge. However, the issue of data…

Machine Learning · Computer Science 2024-09-23 Liang Zhang , Mohammed Yeasin , Jionghao Lin , Felix Havugimana , Xiangen Hu

Generative Data Imputation for Sparse Learner Performance Data Using Generative Adversarial Imputation Networks

Learner performance data collected by Intelligent Tutoring Systems (ITSs), such as responses to questions, is essential for modeling and predicting learners' knowledge states. However, missing responses due to skips or incomplete attempts…

Machine Learning · Computer Science 2025-04-15 Liang Zhang , Jionghao Lin , John Sabatini , Diego Zapata-Rivera , Carol Forsyth , Yang Jiang , John Hollander , Xiangen Hu , Arthur C. Graesser

Toward Understanding Generative Data Augmentation

Generative data augmentation, which scales datasets by obtaining fake labeled examples from a trained conditional generative model, boosts classification performance in various learning tasks including (semi-)supervised learning, few-shot…

Machine Learning · Computer Science 2023-05-30 Chenyu Zheng , Guoqiang Wu , Chongxuan Li

Improving Automated Feedback Systems for Tutor Training in Low-Resource Scenarios through Data Augmentation

Tutoring is an effective instructional method for enhancing student learning, yet its success relies on the skill and experience of the tutors. This reliance presents challenges for the widespread implementation of tutoring, particularly in…

Human-Computer Interaction · Computer Science 2025-10-21 Chentianye Xu , Jionghao Lin , Tongshuang Wu , Vincent Aleven , Kenneth R. Koedinger

Improve Learning from Crowds via Generative Augmentation

Crowdsourcing provides an efficient label collection schema for supervised machine learning. However, to control annotation cost, each instance in the crowdsourced data is typically annotated by a small number of annotators. This creates a…

Machine Learning · Computer Science 2021-07-23 Zhendong Chu , Hongning Wang

Generative Adversarial Networks for Annotated Data Augmentation in Data Sparse NLU

Data sparsity is one of the key challenges associated with model development in Natural Language Understanding (NLU) for conversational agents. The challenge is made more complex by the demand for high quality annotated utterances commonly…

Computation and Language · Computer Science 2020-12-11 Olga Golovneva , Charith Peris

Knowledge Tracing for Complex Problem Solving: Granular Rank-Based Tensor Factorization

Knowledge Tracing (KT), which aims to model student knowledge level and predict their performance, is one of the most important applications of user modeling. Modern KT approaches model and maintain an up-to-date state of student knowledge…

Computers and Society · Computer Science 2022-10-18 Chunpai Wang , Shaghayegh Sahebi , Siqian Zhao , Peter Brusilovsky , Laura O. Moraes

Efficiently Generating Multidimensional Calorimeter Data with Tensor Decomposition Parameterization

Producing large complex simulation datasets can often be a time and resource consuming task. Especially when these experiments are very expensive, it is becoming more reasonable to generate synthetic data for downstream tasks. Recently,…

Machine Learning · Computer Science 2025-08-28 Paimon Goulart , Shaan Pakala , Evangelos Papalexakis

Dynamic Tensor Clustering

Dynamic tensor data are becoming prevalent in numerous applications. Existing tensor clustering methods either fail to account for the dynamic nature of the data, or are inapplicable to a general-order tensor. Also there is often a gap…

Machine Learning · Statistics 2018-09-17 Will Wei Sun , Lexin Li

Discovering Hidden Structure in High Dimensional Human Behavioral Data via Tensor Factorization

In recent years, the rapid growth in technology has increased the opportunity for longitudinal human behavioral studies. Rich multimodal data, from wearables like Fitbit, online social networks, mobile phones etc. can be collected in…

Machine Learning · Computer Science 2019-05-23 Homa Hosseinmardi , Hsien-Te Kao , Kristina Lerman , Emilio Ferrara

Generative Learning of Continuous Data by Tensor Networks

Beyond their origin in modeling many-body quantum systems, tensor networks have emerged as a promising class of models for solving machine learning problems, notably in unsupervised generative learning. While possessing many desirable…

Machine Learning · Computer Science 2024-07-26 Alex Meiburg , Jing Chen , Jacob Miller , Raphaëlle Tihon , Guillaume Rabusseau , Alejandro Perdomo-Ortiz

Influence-guided Data Augmentation for Neural Tensor Completion

How can we predict missing values in multi-dimensional data (or tensors) more accurately? The task of tensor completion is crucial in many applications such as personalized recommendation, image and video restoration, and link prediction in…

Machine Learning · Computer Science 2021-08-24 Sejoon Oh , Sungchul Kim , Ryan A. Rossi , Srijan Kumar

Tensor Train Factorization and Completion under Noisy Data with Prior Analysis and Rank Estimation

Tensor train (TT) decomposition, a powerful tool for analyzing multidimensional data, exhibits superior performance in many machine learning tasks. However, existing methods for TT decomposition either suffer from noise overfitting, or…

Signal Processing · Electrical Eng. & Systems 2023-06-27 Le Xu , Lei Cheng , Ngai Wong , Yik-Chung Wu

Guiding Generative Language Models for Data Augmentation in Few-Shot Text Classification

Data augmentation techniques are widely used for enhancing the performance of machine learning models by tackling class imbalance issues and data sparsity. State-of-the-art generative language models have been shown to provide significant…

Computation and Language · Computer Science 2023-01-10 Aleksandra Edwards , Asahi Ushio , Jose Camacho-Collados , Hélène de Ribaupierre , Alun Preece

Data augmentation using generative networks to identify dementia

Data limitation is one of the most common issues in training machine learning classifiers for medical applications. Due to ethical concerns and data privacy, the number of people that can be recruited to such experiments is generally…

Audio and Speech Processing · Electrical Eng. & Systems 2020-04-14 Bahman Mirheidari , Yilin Pan , Daniel Blackburn , Ronan O'Malley , Traci Walker , Annalena Venneri , Markus Reuber , Heidi Christensen

Bayesian Sparse Tucker Models for Dimension Reduction and Tensor Completion

Tucker decomposition is the cornerstone of modern machine learning on tensorial data analysis, which have attracted considerable attention for multiway feature extraction, compressive sensing, and tensor completion. The most challenging…

Machine Learning · Computer Science 2015-05-12 Qibin Zhao , Liqing Zhang , Andrzej Cichocki

Generative Forests

We focus on generative AI for a type of data that still represent one of the most prevalent form of data: tabular data. Our paper introduces two key contributions: a new powerful class of forest-based models fit for such tasks and a simple…

Machine Learning · Computer Science 2024-11-15 Richard Nock , Mathieu Guillame-Bert

Graphical model for factorization and completion of relatively high rank tensors by sparse sampling

We consider tensor factorizations based on sparse measurements of the components of relatively high rank tensors. The measurements are designed in a way that the underlying graph of interactions is a random graph. The setup will be useful…

Machine Learning · Statistics 2026-04-15 Angelo Giorgio Cavaliere , Riki Nagasawa , Shuta Yokoi , Tomoyuki Obuchi , Hajime Yoshino

Automated Training of Learned Database Components with Generative AI

The use of deep learning for database optimization has gained significant traction, offering improvements in indexing, cardinality estimation, and query optimization. However, acquiring high-quality training data remains a significant…

Databases · Computer Science 2025-12-24 Angjela Davitkova , Sebastian Michel