- Angelegt von Taeke Gjaltema, zuletzt geändert am 05 Jan, 2017
The HLG and representatives of expert groups review the progress so far on the 2016 Data Integration project, and decide to approve the project in 2017. Project updates and outputs will be made available on these pages. | |
The 2016 Data Integration project was commissioned to gain experience in data integration by pooling resources in joint practical activities and to translate experiences into general recommendations for data integration and provide initial guidance for a quality framework. It was recognised that it would not be possible to cover all types of data integration in a single year. After a slow beginning, the 2016 project gained momentum with increasing involvement from member countries over the year. A project proposal was therefore submitted to continue and refocus the project for a second year. For 2017 the project proposes to: | |
| |
2016 Project Proposal
| UNECE High-level Group for the Modernisation of Official Statistics | ![]() | |||
Project Proposal: Develop a practical online Guide to Data Integration for Official Statistics and undertake specific experiments on priority areas | ||||
| This project proposal was prepared by the Data Integration Project and is submitted to the HLG-MOS for approval. | ||||
1 Purpose | ||||
Data integration provides the potential to produce more timely, more disaggregated statistics at higher frequencies than traditional approaches alone. In 2015, HLG MOS recognised that official statistics organisations were challenged by the capacities needed to incorporate new data sources in their statistical production processes. The 2016 Data Integration project was commissioned to
It was recognised that it would not be possible to cover all types of data integration in a single year. After a slow beginning, the 2016 project gained momentum with increasing involvement from member countries over the year. The project:
For 2016, the project was structured into 7 work packages: WP0: Data sets for common approaches WP1: Integrating survey and administrative sources WP2: New data sources (such as big data) and traditional sources WP3: Integrating geospatial and statistical information WP4: Micro-macro integration (inactive in 2016) WP5: Validating official statistics WPA: Synthesize lessons learnt from new working methods. 11 experiments were designed, based on priority work occurring in participating countries. The project has found significant common interest in progressing the experiments further and in developing a practical guide for future projects. For 2017, the project proposes to:
| ||||
2 Project description | ||||
WPA: Develop an online, adaptive, practical guide to Data Integration for Official Statistics (WPA) Using the lessons learnt from the project experiments to date and from other initiatives, there is sufficient material to develop a practical guide for undertaking data integration projects. The main challenge is to organise this knowledge into simple, easy to consume and adaptable guidance to support successful projects and common approaches. Initial guidance for a framework has been created with these aspects identified as important for achieving success in data integration projects:
The guide should:
WP0-WP5 Further work on joint experiments in priority interest areas Participants in the project in 2016 are enthusiastic about further developing common approaches in particular areas of statistics to accelerate the use of multiple data sources and provide material for the guide. The following proposals have been developed within the project and by others interested in participating in 2017. Additional proposals (and participants) may. Priority should be given to those proposals which have clear deliverables, provide practical experience and provide practical material for the guide. Proposed Activity: Align approaches for applying new data sources to integrated price measurement (WP0 and WP2) Many countries have similar challenges improving the quality of the CPI in terms of coverage and real-time quantity. For example, ABS is using scanner data in production and planning to go to use of full-coverage soon; CBS Netherlands is actively researching use of online data in production and has been using scanner data in production; Statistics Canada is considering using Price Stats data in production, including use of an API; and Statistics NZ is using online/scanner data in production. Several data providers operate in many countries and there is an opportunity to develop a common approach that can be used in multiple countries. For 2017 the proposed activities are to:
Proposed Activity: Create synthetic datasets for sandbox experiments (WP0) The creation and documentation of a set of synthetic datasets for the sandbox will allow countries to collaborate on developing common methods, removing issues of confidentiality and encouraging use of the same data formats. This activity will incorporate the use of outputs from the Big Data Project already lodged in the Sandbox. It may also provide a practical base/examples of the proposed CSPA Data Architecture. Proposed Activity: Develop practical guidance for integrating survey, administrative data and big data (including case studies) (WP1 and WP2) During 2016, participants in these work packages investigated current work in this area (within their own organisations and in other groups (eg ESSNET and the HLG Big Data project), with specific interest focussed on job vacancies statistics, employment registers and labour force, and geographic location of schools. Lessons learnt, methods used, challenges and results were gathered and the group is preparing an overview of work and developing initial draft guidelines. For 2017 the proposed activities are:
Proposed Activity: Develop practical guidance on integrating geospatial and statistical information (WP3) The geospatial and statistical data integration landscape is very complex, with many players globally. The Global Statistical Geospatial Framework (GSGF - UN GGIM), the Statistical Spatial Framework (SSF) and initiatives such as GEOSTAT2 (Eurostat) are vital for developing a consistent and systematic approach to linking geospatial and statistical data. This is likely to take some time and a considered and organised approach is being pursued. However implementation of “The 10 level model” into the lower layers of GSGF will support practical activities of integrating statistics with geospatial data. The following activities are proposed:
Because of the complexity of this area, it would be useful to conduct a face to face or virtual sprint of key players (including Australia, Poland, Colombia, Mexico, Finland, Sweden and others to be determined) possibly back to back, or incorporated into, a proposed workshop on geospatial and statistical standards (Sweden) Proposed Activity: Develop practical guidance on using additional sources to validate official statistics (WP5) Work in 2016 focused on identifying different applications and methods for validating official statistics. Issues identified with use of administrative data to validate official statistics and lessons learnt from experiments as well as other validation projects carried out within organisation of contributing members are documented to provide initial guidelines in the use of administrative data to validate official statistics as well as recommend approaches and modelling techniques to resolve issues identified. We propose, for 2017, to work towards:
Proposed activity: Urban statistics as an example of “quick response” data integration capability The HLG workshop discussed the opportunity to incorporate the experience of CBS Netherlands in developing a quick response capability for Urban statistics Proposed activity: Spines of Integration The HLG workshop discussed the opportunity to incorporate experience of NSOs with register based or other database based ‘spines of integration’ in support of the HLG priority to “provide whole of government(s) data ecosystems based on international standards for better estimates in key policy areas” | ||||
3 Alternatives considered | ||||
| ||||
4 Expected Benefits | ||||
☒ | Reduced costs | |||
☒ | Increased efficiency | |||
☒ | Reduced risks | |||
☒ | New capabilities to meet user needs | |||
Justification:
| ||||
5 Which key priorities in the HLG-MOS Strategic Framework does the proposed project relate to? | ||||
☒ | Take cost out of our organisations to reinvest in more value added areas | |||
☒ | Explore new areas collectively and leverage each other’s' research investments in specific areas | |||
☒ | Provide whole of government data ecosystems based on international standards, for better estimates in key policy areas | |||
☒ | Renew our governance and operating processes | |||
Justification:
| ||||
6 How does the proposed project relate to other activities under the HLG-MOS? | ||||
This proposal offers an opportunity to guide organisations undertaking data integration projects towards concrete, real world use of the ModernStats standards. As the project is based on practical activities, it provides an opportunity to demonstrate the use of the standards in real world activities and to feedback suggestions for future versions of the ModernStats standards. | ||||
7 Proposed timetable | ||||
| Start: January 2017 End: December 2017 | |||
8 Expected resources and costs | ||||
WPA Develop and promote an online practical Guide to Data Integration for Official Statistics Volunteers from NSOs to work together on common areas, to contribute case studies and content and to assess and critique the usefulness of the guide. Volunteers from modernisation groups (Blue Skies thinking, Supporting Standards, Processes and Skills, Sharing Tools) to contribute to the work packages and the guide in their areas of expertise 6 person months for Project Manager/Editor for Guide Costs associated with 12-16 participants to attend Integrated Price Measurement and Integrating Geospatial and Statistical Data Sprints (if possible back to back with other meetings) Costs associated with 10-12 participants to attend Face to Face project sprint in 2017. | ||||
Fehler beim Ausführen des Makros 'viewdoc'
com.atlassian.confluence.macro.MacroExecutionException: com.atlassian.confluence.macro.MacroExecutionException: The viewfile macro is unable to locate the attachment "HLG Project Proposal 2017 - Data Integration - Final.docx" on this page
- Keine Stichwörter

