Parsimonious additive logistic models - Archive ouverte HAL Access content directly
Conference Papers Year : 2008

Parsimonious additive logistic models

(1)
1

Abstract

Logistic regression is a standard tool in statistics for binary classification. The logistic model relates the logarithm of the odds-ratio to the predictors via a linear regression model. A generalization is the additive logistic model, which replaces each linear term by an unspecified smooth function, allowing for more flexibility while preserving interpretability. Another variant is penalized logistic regression, which shrinks coefficients to improve the accuracy of prediction. Ridge regression (L2-penalization) and lasso (L1-penalization) are the main penalization procedures. An attractive property of the later is that it performs parameter estimation and variable selection simultaneously. New theoretical results, efficient algorithms, and available software play a major role in the recent popularization of lasso. In this study, L1-penalization is adapted to additive logistic regression fitted by smoothing splines. Coefficients associated to predictors. Coefficients associated to predictors with little effect on the response may be shrunk (some of them to zero). This approach gives parsimonious models, removes irrelevant variables, and identifies non linear trends. The estimates are computed via the usual Newton-Raphson update, combined with the lars-lasso algorithm, to resolve the penalization problem, and the backfitting algorithm to fit additive models. Different criteria based on the effective degrees of freedom are proposed to choose the penalization parameters. Performance is illustrated with some examples.
Fichier principal
Vignette du fichier
Parsimonious_additive_log.pdf (14.91 Ko) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

inserm-00402444 , version 1 (07-07-2009)

Identifiers

  • HAL Id : inserm-00402444 , version 1

Cite

Marta Avalos. Parsimonious additive logistic models. First Workshop of the ERCIM Working Group on Computing & Statistics, Jun 2008, Neuchâtel, Switzerland. page 56. ⟨inserm-00402444⟩

Collections

INSERM
87 View
424 Download

Share

Gmail Facebook Twitter LinkedIn More