Model selection via the lasso in conditional logistic regression

Abstract : We propose a model selection procedure in the context of matched case-control studies and, more specifically, for the conditional logistic regression. The method is based on penalized conditional likelihood with an L1-type penalty of the regression coefficients, the so-called lasso. This penalty, which shrinks coefficients to improve the accuracy of prediction, is particularly adapted when the number of covariates is large (with respect to the number of events) or in case of collinearity between them. An attractive property of the lasso is that it performs parameter estimation and variable selection simultaneously. The implementation of the method is based on a simple modification of the R-package "penalized" for fitting generalized linear and Cox proportional hazard models with L1 or L2 penalties, developed by J. Goeman. The effectiveness of the algorithms allows the use of resampling methods for choosing the regularization term and for evaluating the stability of the selected model. Thus, K-fold likelihood cross-validation and bootstrap are applied, both taking into account the dependent nature of data. The methodology is illustrated with some examples.
Type de document :
Communication dans un congrès
Second International Biometric Society Channel Network Conference, Apr 2009, Ghent, Belgium. pp.Model selection and clustering session, 2009
Liste complète des métadonnées

http://www.hal.inserm.fr/inserm-00402434
Contributeur : Marta Avalos <>
Soumis le : mardi 7 juillet 2009 - 13:06:22
Dernière modification le : mercredi 29 novembre 2017 - 14:53:55
Document(s) archivé(s) le : mardi 15 juin 2010 - 17:52:19

Fichier

AvalosMarta20090214174632.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inserm-00402434, version 1

Collections

Citation

Marta Avalos. Model selection via the lasso in conditional logistic regression. Second International Biometric Society Channel Network Conference, Apr 2009, Ghent, Belgium. pp.Model selection and clustering session, 2009. 〈inserm-00402434〉

Partager

Métriques

Consultations de la notice

209

Téléchargements de fichiers

225