Application of Discriminant Analysis and Cross-Validation on Proteomics Data
- Resource Type
- Authors
- Julia, Kuligowski; David, Pérez-Guaita; Guillermo, Quintás
- Source
- Methods in molecular biology (Clifton, N.J.). 1362
- Subject
- Proteomics
Discriminant Analysis
Reproducibility of Results
- Language
- ISSN
- 1940-6029
High-throughput proteomic experiments have raised the importance and complexity of bioinformatic analysis to extract useful information from raw data. Discriminant analysis is frequently used to identify differences among test groups of individuals or to describe combinations of discriminant variables. However, even in relatively large studies, the number of detected variables typically largely exceeds the number of samples and the classifiers should be thoroughly validated to assess their performance for new samples. Cross-validation is a widely approach when an external validation set is not available. In this chapter, different approaches for cross-validation are presented including relevant aspects that should be taken into account to avoid overly optimistic results and the assessment of the statistical significance of cross-validated figures of merit.