Data Matching for the Maintenance of the Business Register of Statistics Austria
DOI:
https://doi.org/10.17713/ajs.v33i1&2.430Abstract
The Business Register of Statistics Austria is the basic instrument for all surveys conducted in economic statistics. For the maintenance mainly four different administrative sources are used. Unfortunately, the units of the different registers do not agree exactly and there is no unique numerical key in the business register and the administrative registers. Each register uses its own key. The units of an administrative register belonging to a certainunit of the business register have to be found by comparing alphanumerical items like name and address. For that purpose we use the method of Ngrams after some parsing and standardising of the texts. With that method above 90% of the profit-oriented units of the business register could be linked with a corresponding unit of the tax register (these linked units account for 99% of total turnover). 80% of the links were found fully
automatically, the rest was checked manually.
References
Council regulation (EEC) No 2186/93 of 22 July 1993 on Community coordination in drawing up business registers for statistical purposes, 1993.
Bundesstatistikgesetz 2000. Federal Statistics Act of 2000, BGBL I Nr.163/1999, idF BGBL I Nr.136/2001, Vienna, 2001.
A. Haslinger. Automatic Coding and Text Processing using N-grams. In Conference of European Statisticians. Statistical Standards and Studies – No. 48. Statistical Data Editing, Volume No. 2, Methods and Techniques, pages 199-209. UNO, New York and Geneva, 1997.
R. Müllauer. TST (MVS) 2.5X. Unpublished documentation about a collection of loadmodules and program-skeletons for the purpose of matching two text fields used in Statistics Austria, Vienna, 2003.
Office of National Statistics. Methods for Automatic Record Matching and Linkage and their Use in National Statistics, National Statistics Methodological Series No. 25, London, 2001.
Downloads
Published
Issue
Section
License
The Austrian Journal of Statistics publish open access articles under the terms of the Creative Commons Attribution (CC BY) License.
The Creative Commons Attribution License (CC-BY) allows users to copy, distribute and transmit an article, adapt the article and make commercial use of the article. The CC BY license permits commercial and non-commercial re-use of an open access article, as long as the author is properly attributed.
Copyright on any research article published by the Austrian Journal of Statistics is retained by the author(s). Authors grant the Austrian Journal of Statistics a license to publish the article and identify itself as the original publisher. Authors also grant any third party the right to use the article freely as long as its original authors, citation details and publisher are identified.
Manuscripts should be unpublished and not be under consideration for publication elsewhere. By submitting an article, the author(s) certify that the article is their original work, that they have the right to submit the article for publication, and that they can grant the above license.