Redescending M-estimators and Deterministic Annealing, with Applications to Robust Regression and Tail Index Estimation
DOI:
https://doi.org/10.17713/ajs.v37i3&4.310Abstract
A new type of redescending M-estimators is constructed, based on data augmentation with an unspecified outlier model. Necessary and sufficient conditions for the convergence of the resulting estimators to the Hubertype skipped mean are derived. By introducing a temperature parameter the concept of deterministic annealing can be applied, making the estimator insensitive to the starting point of the iteration. The properties of the annealingM-estimator as a function of the temperature are explored. Finally, two applications
are presented. The first one is the robust estimation of interaction vertices in experimental particle physics, including outlier detection. The second one is the estimation of the tail index of a distribution from a sample using robust regression diagnostics.
References
Atkinson, A., and Riani, M. (2000). Robust Diagnostic Regression Analysis. New York: Springer.
Beirlant, J., Vynckier, P., and Teugels, J. L. (1996). Tail index estimation, Pareto quantile plots, and regression diagnostics. Journal of the American Statistical Asssociation, 91, 1659.
Bickel, D. R., and Frühwirth, R. (2006). On a fast, robust estimator of the mode: Comparisons to other robust estimators with applications.
CMS collaboration. (1994). CMS Technical Proposal (Tech. Rep.). (Technical Report CERN/LHCC 94-38, CERN, Geneva)
CMS Collaboration. (2007). CMS Detector Information. (url:
http://cmsinfo.cern.ch/outreach/CMSdetectorInfo/CMSdetectorInfo.html)
Coreless, R. M., Gonnet, G. H., Hare, D. E. G., Jerrey, D. J., and Knuth, D. E. (1996). On the Lambert W Function.
Dempster, A. P., Laird, N. M., and Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 39, 1.
Frühwirth, R., and Strandlie, A. (1999). Track fitting with ambiguities and noise: a study of elastic tracking and nonlinear filters. Computer Physics Communications, 120, 197.
Garlipp, T., andMüller, C. (2005). Regression clustering with redescending M-estimators. In D. Baier and K. Wernecke (Eds.), Innovations in Classification, Data Science, and Information Systems. Berlin, Heidelberg, New York: Springer.
Hampel, F. R., Ronchetti, E. M., Rousseeuw, P. J., and Stahel, W. A. (1986). Robust Statistics: The Approach Based on Influence Functions. New York: John Wiley & Sons.
Hill, B. M. (1975). A simple general approach to inference about the tail of a distribution. The Annals of Statistics, 3, 1163.
Huber, P. J. (2004). Robust Statistics: Theory and Methods. New York: John Wiley & Sons.
Li, S. Z. (1996). Robustizing robust M-estimation using deterministic annealing. Pattern recognition, 29, 159.
Müller, C. (2004). Redescending M-estimators in regression analysis, cluster analysis and image analysis. Discussiones Mathematicae — Probability and Statistics, 24, 59.
Rose, K. (1998). Deterministic annealing for clustering, compression, classification, regression, and related optimization problems.
Rousseeuw, P. J., and Leroy, A. M. (1987). Robust Regression and Outlier Detection.
Seneta, E. (1976). Regularly Varying Functions. Berlin, Heidelberg, New York: Springer.
Waltenberger, W., Frühwirth, R., and Vanlaer, P. (2007). Adaptive vertex fitting. Journal of Physics G: Nuclear and Particle Physics, 34, 343.
Downloads
Published
How to Cite
Issue
Section
License
The Austrian Journal of Statistics publish open access articles under the terms of the Creative Commons Attribution (CC BY) License.
The Creative Commons Attribution License (CC-BY) allows users to copy, distribute and transmit an article, adapt the article and make commercial use of the article. The CC BY license permits commercial and non-commercial re-use of an open access article, as long as the author is properly attributed.
Copyright on any research article published by the Austrian Journal of Statistics is retained by the author(s). Authors grant the Austrian Journal of Statistics a license to publish the article and identify itself as the original publisher. Authors also grant any third party the right to use the article freely as long as its original authors, citation details and publisher are identified.
Manuscripts should be unpublished and not be under consideration for publication elsewhere. By submitting an article, the author(s) certify that the article is their original work, that they have the right to submit the article for publication, and that they can grant the above license.