Chapter of Book
Dimensionality Reduction for Classification: Comparison of Techniques and Dimension Choice
Author/s | Plastria, Frank
Bruyne, Steven de Carrizosa Priego, Emilio José |
Editor | Tang, Changjie
Ling, Charles X. Zhou, Xiaofang Cercone, Nick J. Li, Xue |
Department | Universidad de Sevilla. Departamento de Estadística e Investigación Operativa |
Publication Date | 2008 |
Deposit Date | 2016-10-26 |
Published in |
|
ISBN/ISSN | 3540881913 9783540881919 9783540881926 0302-9743 |
Abstract | We investigate the effects of dimensionality reduction using different techniques and different dimensions on six two-class data sets with numerical attributes as pre-processing for two classification algorithms.
Besides ... We investigate the effects of dimensionality reduction using different techniques and different dimensions on six two-class data sets with numerical attributes as pre-processing for two classification algorithms. Besides reducing the dimensionality with the use of principal components and linear discriminants, we also introduce four new techniques. After this dimensionality reduction two algorithms are applied. The first algorithm takes advantage of the reduced dimensionality itself while the second one directly exploits the dimensional ranking. We observe that neither a single superior dimensionality reduction technique nor a straightforward way to select the optimal dimension can be identified. On the other hand we show that a good choice of technique and dimension can have a major impact on the classification power, generating classifiers that can rival industry standards. We conclude that dimensionality reduction should not only be used for visualisation or as pre-processing on very high dimensional data, but also as a general preprocessing technique on numerical data to raise the classification power. The difficult choice of both the dimensionality reduction technique and the reduced dimension however, should be directly based on the effects on the classification power. |
Project ID. | OZR1372 |
Citation | Plastria, F., Bruyne, S.d., y Carrizosa Priego, E.J. (2008). Dimensionality Reduction for Classification: Comparison of Techniques and Dimension Choice. En X. Zhou, N.J. Cercone, C. Tang, C.X. Ling, X. Li (Ed.), Advanced Data Mining and Applications: 4th International Conference, ADMA 2008. Chengdu, China, October 8-10, 2008. Proceedings (pp. 411-418). Berlin: Springer |
Files | Size | Format | View | Description |
---|---|---|---|---|
Dimensionality Reduction for ... | 274.0Kb | [PDF] | View/ | |