Bioinformatics with Park-Kleis 기본 콘텐츠로 건너뛰기

라벨이 Radiomics인 게시물 표시

Radiomics: Feature selection 3

  Demircioğlu, Aydin PhD  Benchmarking Feature Selection Methods in Radiomics, Investigative Radiology: January 18, 2022 - Volume - Issue - doi: 10.1097/RLI.0000000000000855 High dimensionality of the datasets and small sample sizes are critical problems in radiomics. Therefore, removing redundant features and irrelevant features is needed. Overall, per dataset,  30 different feature selection methods + 10 classifiers + 70 hyperparameter settings After each feature selection method, 1, 2, ..., 64 features were selected. Altogether, 14,700=30✕70 ✕7 models were fitted, each with a 10-fold cross-validation . More complex methods are more unstable than simpler feature selection methods. LASSO performed best when analysing the predictive performance , though it showed only average feature stability . Longer training times and higher computational complexity of the feature selection method do not mean for high predictive performance necessarily. Obtaining a more stable mode...

Radiomics: Feature selection 2

 앞서 Radiomics에서 많이 사용되고 있는 Feature selection 방법에 대해서 이야기 하였다. 이번에는 조금 더 세분화하여 설명해보도록 하겠다. 14 feature selection methods &  12 classification methods  in terms of predictive performance and stability. Methods

Radiomics: Feature selection

Radiomics에서 Feature를 선택하는 것은 핵심 중의 핵심이다.  열심히 영상을 다듬고 영상에 대한 여러 value를 뽑아 놓아도 feature selection을 잘못하면 그동안의 노력이 물거품이 되기 때문이다. Feature selection에는 여러 가지 방안들이 제시되어 왔는데 가장 많이 사용되는 방법들을 정리해보고자 한다. In omics experiments, one of the ultimate goals is the identification of features(biomarkers) that are different between treatment groups. One of the very common problems in omics data is that the sample size is small but huge number of features which can lead to over-fitting. What can be alternative methods to overcome this problem? The first paradigm  - LASSO : based on classification approaches and compares the least absolute shrinkage and selection operator.  - Ridge regression  - Elastic Net feature selection methods The second paradigm  - using a linear models framework : individual features are modeled separately ignoring the correlation structure among features.   Omics data analysing 순서      ⇨ original feature subsets ⇨ classification approach...