Год выпуска: 2014 Автор: Md. Muzammel Hosen and Mohammad Shahidul Islam Издательство: LAP Lambert Academic Publishing Страниц: 112 ISBN: 9783659556401 Описание In the context of microarray data, a common characteristic is that the number of parameter is greater than the number of samples (n?p). Because of this feature, many existing methods, derived for the usual “small p and large n” problem, either cannot be applied or may not perform well. For the purpose of classification of tumor types in real and simulated microarray data using regularized and classification approaches, we have studied three regression methods, namely Least Absolute Shrinkage and Selection Operator (LASSO), ridge regression, elastic net and four classification methods namely KNN, SVM, RDA and DLDA. In order to evaluation, we have used four readily available real microarray data sets which are Colon, Brain, SRBCT and Spira. The lasso imposes an L1 penalty and ridge regression imposes an L2 penalty; whereas, the elastic net is a balance between these two. Real data and simulation study show that the elastic net outperforms the lasso, although they both are derived... |