中文

On data analysis and variable selection: the minimum entropy analysis

数据分析、统计与概率 2007-05-23 v1 地球物理

摘要

In this work, we present a minimum entropy analysis scheme for variable selection and preliminary data analysis. The variable selection can be achieved by the increasing preference of variables. We show such a preference to has a unqiue form, which is given by the entropy of models associated with variables. Evaluating the entropy provides a complete ranking scheme of variables. This scheme not only indicates preferred variables but also may reveal the system's nature and properties. We illustrate the proposed scheme to analyze a set of geological data for three carbonate rock units in Texas and Oklahoma, and compare to the discriminant function analysis. The result suggests this scheme to provide a quick and robust analysis, and the use in data analysis is promising.

关键词

引用

@article{arxiv.physics/0609250,
  title  = {On data analysis and variable selection: the minimum entropy analysis},
  author = {Chih-Yuan Tseng and Chien-Chih CHen},
  journal= {arXiv preprint arXiv:physics/0609250},
  year   = {2007}
}

备注

9 pages and 2 tables