Integrative analysis of multiple cancer prognosis studies with gene expressionmeasurements

Shuangge Ma, Jian Huang, Fengrong Wei, Yang Xie, Kuangnan Fang

Research output: Contribution to journalArticlepeer-review

29 Scopus citations

Abstract

Although in cancer research microarray gene profiling studies have been successful in identifying genetic variants predisposing to the development and progression of cancer, the identified markers from analysis of single datasets often suffer low reproducibility. Among multiple possible causes, the most important one is the small sample size hence the lack of power of single studies. Integrative analysis jointly considers multiple heterogeneous studies, has a significantly larger sample size, and can improve reproducibility. In this article, we focus on cancer prognosis studies, where the response variables are progression-free, overall, or other types of survival. A group minimax concave penalty (GMCP) penalized integrative analysis approach is proposed for analyzing multiple heterogeneous cancer prognosis studies with microarray gene expression measurements. An efficient group coordinate descent algorithm is developed. The GMCP can automatically accommodate the heterogeneity across multiple datasets, and the identified markers have consistent effects across multiple studies. Simulation studies show that the GMCP provides significantly improved selection results as compared with the existing meta-analysis approaches, intensity approaches, and group Lasso penalized integrative analysis. We apply the GMCP to four microarray studies and identify genes associated with the prognosis of breast cancer.

Original languageEnglish (US)
Pages (from-to)3361-3371
Number of pages11
JournalStatistics in Medicine
Volume30
Issue number28
DOIs
StatePublished - Dec 10 2011

Keywords

  • Cancer prognosis
  • Integrative analysis
  • Microarray
  • Penalized selection

ASJC Scopus subject areas

  • Epidemiology
  • Statistics and Probability

Fingerprint

Dive into the research topics of 'Integrative analysis of multiple cancer prognosis studies with gene expressionmeasurements'. Together they form a unique fingerprint.

Cite this