434 articles – 313 references  [version française]
Short view
Identifying common prognostic factors in genomic cancer studies: a novel index for censored outcomes.
Rouam S., Moreau T., Broët P.
BMC Bioinformatics 11, 1 (2010) 150 - http://www.hal.inserm.fr/inserm-00663764
 (20334636) 
Identifying common prognostic factors in genomic cancer studies: a novel index for censored outcomes.
Sigrid Rouam () 1, 2, Thierry Moreau3, Philippe Broët1, 2
1:  Computational and Mathematical Biology
Genome Institute of Singapore
Singapore 138672
Singapore
2:  Méthodologie biostatistique de la génomique fonctionnelle en épidémiologie clinique
INSERM : JE2492 – IFR69
France
3:  Recherche en épidémiologie et biostatistique
INSERM : IFR69 – Université Paris XI - Paris Sud
16, Avenue Paul Vaillant-Couturier 94807 VILLEJUIF CEDEX
France
BACKGROUND: With the growing number of public repositories for high-throughput genomic data, it is of great interest to combine the results produced by independent research groups. Such a combination allows the identification of common genomic factors across multiple cancer types and provides new insights into the disease process. In the framework of the proportional hazards model, classical procedures, which consist of ranking genes according to the estimated hazard ratio or the p-value obtained from a test statistic of no association between survival and gene expression level, are not suitable for gene selection across multiple genomic datasets with different sample sizes. We propose a novel index for identifying genes with a common effect across heterogeneous genomic studies designed to remain stable whatever the sample size and which has a straightforward interpretation in terms of the percentage of separability between patients according to their survival times and gene expression measurements. RESULTS: The simulations results show that the proposed index is not substantially affected by the sample size of the study and the censoring. They also show that its separability performance is higher than indices of predictive accuracy relying on the likelihood function. A simulated example illustrates the good operating characteristics of our index. In addition, we demonstrate that it is linked to the score statistic and possesses a biologically relevant interpretation.The practical use of the index is illustrated for identifying genes with common effects across eight independent genomic cancer studies of different sample sizes. The meta-selection allows the identification of four genes (ESPL1, KIF4A, HJURP, LRIG1) that are biologically relevant to the carcinogenesis process and have a prognostic impact on survival outcome across various solid tumors. CONCLUSION: The proposed index is a promising tool for identifying factors having a prognostic impact across a collection of heterogeneous genomic datasets of various sizes.
Life Sciences/Biochemistry, Molecular Biology/Genomics, Transcriptomics and Proteomics
Life Sciences/Bioinformatics and Systemic Biology
Computer Science/Bioinformatics
English
1471-2105

Article in peer-reviewed journal
10.1186/1471-2105-11-150
BMC Bioinformatics (BMC Bioinformatics)
Publisher BioMed Central
ISSN 1471-2105 
international
2010
2010-03-24
11
1
150

Computer Simulation – Databases – Genetic – Gene Expression Profiling – Genome – Human – Humans – Likelihood Functions – Neoplasms – Oligonucleotide Array Sequence Analysis – Prognosis – Sample Size
We also thank the following institutions for general funding: the Genome Institute of Singapore (Singapore) and the French Ministry of Higher Education and Research (France).
Attached file list to this document: 
PDF
1471-2105-11-150.pdf(1008.2 KB)
ANNEX
1471-2105-11-150.xml(99.9 KB)
1471-2105-11-150-S1.PDF(34.5 KB)
1471-2105-11-150-S2.PDF(30.6 KB)
1471-2105-11-150-S4.PDF(24.1 KB)
1471-2105-11-150-S5.PDF(33.6 KB)
1471-2105-11-150-S3.PDF(30.2 KB)