Related papers: Maximum Entropy Regularization and Chinese Text Re…

Classes for Fast Maximum Entropy Training

Maximum entropy models are considered by many to be one of the most promising avenues of language modeling research. Unfortunately, long training times make maximum entropy research difficult. We present a novel speedup technique: we change…

Computation and Language · Computer Science 2007-05-23 Joshua Goodman

Alignment Entropy Regularization

Existing training criteria in automatic speech recognition(ASR) permit the model to freely explore more than one time alignments between the feature and label sequences. In this paper, we use entropy to measure a model's uncertainty, i.e.…

Computation and Language · Computer Science 2022-12-26 Ehsan Variani , Ke Wu , David Rybach , Cyril Allauzen , Michael Riley

Handwritten Chinese Character Recognition by Convolutional Neural Network and Similarity Ranking

Convolution Neural Networks (CNN) have recently achieved state-of-the art performance on handwritten Chinese character recognition (HCCR). However, most of CNN models employ the SoftMax activation function and minimize cross entropy loss,…

Machine Learning · Computer Science 2019-09-02 Junyi Zou , Jinliang Zhang , Ludi Wang

Enhancing Topic Extraction in Recommender Systems with Entropy Regularization

In recent years, many recommender systems have utilized textual data for topic extraction to enhance interpretability. However, our findings reveal a noticeable deficiency in the coherence of keywords within topics, resulting in low…

Computation and Language · Computer Science 2023-06-14 Xuefei Jiang , Dairui Liu , Ruihai Dong

Efficient Neural Task Adaptation by Maximum Entropy Initialization

Transferring knowledge from one neural network to another has been shown to be helpful for learning tasks with few training examples. Prevailing fine-tuning methods could potentially contaminate pre-trained features by comparably high…

Machine Learning · Computer Science 2019-07-15 Farshid Varno , Behrouz Haji Soleimani , Marzie Saghayi , Lisa Di Jorio , Stan Matwin

Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study

The flourishing blossom of deep learning has witnessed the rapid development of text recognition in recent years. However, the existing text recognition methods are mainly proposed for English texts. As another widely-spoken language,…

Computer Vision and Pattern Recognition · Computer Science 2022-11-28 Haiyang Yu , Jingye Chen , Bin Li , Jianqi Ma , Mengnan Guan , Xixi Xu , Xiaocong Wang , Shaobo Qu , Xiangyang Xue

Self-Supervised Learning via Maximum Entropy Coding

A mainstream type of current self-supervised learning methods pursues a general-purpose representation that can be well transferred to downstream tasks, typically by optimizing on a given pretext task such as instance discrimination. In…

Computer Vision and Pattern Recognition · Computer Science 2022-10-21 Xin Liu , Zhongdao Wang , Yali Li , Shengjin Wang

Preserving Fine-Grain Feature Information in Classification via Entropic Regularization

Labeling a classification dataset implies to define classes and associated coarse labels, that may approximate a smoother and more complicated ground truth. For example, natural images may contain multiple objects, only one of which is…

Computer Vision and Pattern Recognition · Computer Science 2022-08-09 Raphael Baena , Lucas Drumetz , Vincent Gripon

Mixing between the Cross Entropy and the Expectation Loss Terms

The cross entropy loss is widely used due to its effectiveness and solid theoretical grounding. However, as training progresses, the loss tends to focus on hard to classify samples, which may prevent the network from obtaining gains in…

Machine Learning · Computer Science 2021-09-14 Barak Battash , Lior Wolf , Tamir Hazan

Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning

Scene text recognition has been studied for decades due to its broad applications. However, despite Chinese characters possessing different characteristics from Latin characters, such as complex inner structures and large categories, few…

Computer Vision and Pattern Recognition · Computer Science 2023-09-06 Haiyang Yu , Xiaocong Wang , Bin Li , Xiangyang Xue

Generalization in Transfer Learning

Agents trained with deep reinforcement learning algorithms are capable of performing highly complex tasks including locomotion in continuous environments. We investigate transferring the learning acquired in one task to a set of previously…

Machine Learning · Computer Science 2024-03-06 Suzan Ece Ada , Emre Ugur , H. Levent Akin

Statistical theory for image classification using deep convolutional neural networks with cross-entropy loss under the hierarchical max-pooling model

Convolutional neural networks (CNNs) trained with cross-entropy loss have proven to be extremely successful in classifying images. In recent years, much work has been done to also improve the theoretical understanding of neural networks.…

Statistics Theory · Mathematics 2024-04-30 Michael Kohler , Sophie Langer

Minimum Entropy Aproach to Word Segmentation Problems

Given a sequence composed of a limit number of characters, we try to "read" it as a "text". This involves to segment the sequence into "words". The difficulty is to distinguish good segmentation from enormous number of random ones.Aiming at…

Biological Physics · Physics 2009-11-06 Bin Wang

Regularizing Output Distribution of Abstractive Chinese Social Media Text Summarization for Improved Semantic Consistency

Abstractive text summarization is a highly difficult problem, and the sequence-to-sequence model has shown success in improving the performance on the task. However, the generated summaries are often inconsistent with the source content in…

Computation and Language · Computer Science 2018-05-11 Bingzhen Wei , Xuancheng Ren , Xu Sun , Yi Zhang , Xiaoyan Cai , Qi Su

Incremental Learning with Maximum Entropy Regularization: Rethinking Forgetting and Intransigence

Incremental learning suffers from two challenging problems; forgetting of old knowledge and intransigence on learning new knowledge. Prediction by the model incrementally learned with a subset of the dataset are thus uncertain and the…

Machine Learning · Computer Science 2019-02-05 Dahyun Kim , Jihwan Bae , Yeonsik Jo , Jonghyun Choi

An Adversarial Multi-Task Learning Method for Chinese Text Correction with Semantic Detection

Text correction, especially the semantic correction of more widely used scenes, is strongly required to improve, for the fluency and writing efficiency of the text. An adversarial multi-task learning method is proposed to enhance the…

Computation and Language · Computer Science 2023-06-29 Fanyu Wang , Zhenping Xie

Maximum Entropy, Word-Frequency, Chinese Characters, and Multiple Meanings

The word-frequency distribution of a text written by an author is well accounted for by a maximum entropy distribution, the RGF (random group formation)-prediction. The RGF-distribution is completely determined by the a priori values of the…

Physics and Society · Physics 2017-10-03 Xiao-Yong Yan , Petter Minnhagen

Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning

Large Language Models (LLMs) have demonstrated impressive performance across various tasks. However, current training approaches combine standard cross-entropy loss with extensive data, human feedback, or ad hoc methods to enhance…

Computation and Language · Computer Science 2024-12-16 Daniele Rege Cambrin , Giuseppe Gallipoli , Irene Benedetto , Luca Cagliero , Paolo Garza

Maximum mutual information regularized classification

In this paper, a novel pattern classification approach is proposed by regularizing the classifier learning to maximize mutual information between the classification response and the true class label. We argue that, with the learned…

Machine Learning · Computer Science 2014-09-30 Jim Jing-Yan Wang , Yi Wang , Shiguang Zhao , Xin Gao

MCTS: A Multi-Reference Chinese Text Simplification Dataset

Text simplification aims to make the text easier to understand by applying rewriting transformations. There has been very little research on Chinese text simplification for a long time. The lack of generic evaluation data is an essential…

Computation and Language · Computer Science 2024-06-06 Ruining Chong , Luming Lu , Liner Yang , Jinran Nie , Zhenghao Liu , Shuo Wang , Shuhan Zhou , Yaoxin Li , Erhong Yang