Model selection and error estimation

dc.contributor.authorBartlett, Peterca
dc.contributor.authorBoucheron, Stéphaneca
dc.contributor.authorLugosi, Gáborca
dc.contributor.otherUniversitat Pompeu Fabra. Departament d'Economia i Empresa
dc.date.accessioned2017-07-26T12:07:55Z
dc.date.available2017-07-26T12:07:55Z
dc.date.issued2000-10-01
dc.date.modified2017-07-23T02:05:46Z
dc.description.abstractWe study model selection strategies based on penalized empirical loss minimization. We point out a tight relationship between error estimation and data-based complexity penalization: any good error estimate may be converted into a data-based penalty function and the performance of the estimate is governed by the quality of the error estimate. We consider several penalty functions, involving error estimates on independent test data, empirical {\sc vc} dimension, empirical {\sc vc} entropy, and margin-based quantities. We also consider the maximal difference between the error on the first half of the training data and the second half, and the expected maximal discrepancy, a closely related capacity estimate that can be calculated by Monte Carlo integration. Maximal discrepancy penalty functions are appealing for pattern classification problems, since their computation is equivalent to empirical risk minimization over the training data with some labels flipped.
dc.format.mimetypeapplication/pdfca
dc.identifierhttps://econ-papers.upf.edu/ca/paper.php?id=508
dc.identifier.citationMachine Learning. vol.48, pp. 85-113, 2002
dc.identifier.urihttp://hdl.handle.net/10230/1148
dc.language.isoeng
dc.relation.ispartofseriesEconomics and Business Working Papers Series; 508
dc.rightsL'accés als continguts d'aquest document queda condicionat a l'acceptació de les condicions d'ús establertes per la següent llicència Creative Commons
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subject.keywordcomplexity regularization
dc.subject.keywordmodel selection
dc.subject.keyworderror estimation
dc.subject.keywordconcentration of measure
dc.subject.keywordStatistics, Econometrics and Quantitative Methods
dc.titleModel selection and error estimationca
dc.typeinfo:eu-repo/semantics/workingPaper

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
508.pdf
Size:
483.76 KB
Format:
Adobe Portable Document Format

License

Rights