Seitenhierarchie
Zum Ende der Metadaten springen
Zum Anfang der Metadaten

Synthetic Data Guide

Progress

Membership

The membership of the synthetic data project is now at 38.

Scope

The goal of the Synthetic Data Project is to develop a hands-on guide for creating and using synthetic data primarily geared towards data protection and disclosure control. The target audience of this guide includes both managers and methodologist at NSOs to provide a starting point for using and creating synthetic data. 

After the HLG-MOS Executive Board recommendations, the guide will emphasize creating synthetic data from non-survey data sources. In addition, the use cases will highlight the circumstances in which to synthetic data is the appropriate solution to their output privacy problems. 

Work Packages

Work package 2 - methodology - has been under with presentations of new method from Statistics New Zealand and Australian Bureau of Statistics. 

Patnerships

The HLG-MOS Executive Board recommended partnerships with the ML 2021 group and the Input Privacy Preserving Techniques project. I met with both projects and synthetic data can be a suitable option for testing data. 

Next Steps

Work Packages

Work Package 1 - Use Cases: the BSTN Synthetic Data Working Group in 2020 had created a foundation of use cases for synthetic data. After the EB's recommendations, the use case categories are being reassessed. The use case work package has two main outputs:

  1. A description of different use cases and highlighting their analytical and disclosure risk needs
  2. Creating a gradient of different use cases based on their analytical and disclosure risk needs

The use cases will form the basis for the methodology and quality measure recommendations. 

The target deadline for work package 1 is end of May 2021. 

Work Package 2 - methodology: The following methodology presentations have been scheduled:

  • March 4, 2021: Joerg Drechsler presenting on method used to synthesize US business data
  • April 1, 2021: Kenza Sallier presenting Statistics Canada’ experience creating public synthetic datasets using the FCS and the Synthpop package and Gillian Raab and Beata Nowok will be presenting on new additions to the Synthpop package. 
  • May 6, 2021: Tentative Christine Task and Rolando Rodriguez


Patnerships

The ML 2021 group will get back to me on their requirements for testing data and we will then determined if synthetic data is the right solution. 

Risks and Issues


IssueMitigation



Input Privacy-preserving Techniques 

Progress

The use cases submitted were analyzed by the project team and provided with advice on how they can be used for other NSOs (generalization). Subsequently, the project members have drawn up 5 generalized use cases. We are now in the midst of discussing and assessing the generalized use cases. The team has received an update from the UN global task team on the status of the new version of the PPT handbook, the work on legal matters and the option to submit the generalized use cases to the UN global task team has been discussed. Finally, we received a presentation from Fabio Ricciato (Eurostat) on: "Towards a reference architecture for Trusted Smart Surveys". This is for inspiration what it takes to make PPT a success. 

Next Steps

In the next meeting, the assessment will be completed, and a joint decision will be made which use cases will be selected for the next phase (reference scenarios can be used for multiple NSOs. This means that WP1 has been concluded and the next WP is scheduled.

Risks and Issues

IssueMitigation



Image result for input privacy-preserving techniques


News from the Groups

Blue-skies Thinking

Identifying Topics/Opportunities


IN PROGRESS

Network Data

IN PROGRESS

Covid-19 Hotspot Joint Biosecurity Centre Platform

IN PROGRESS

User Research for Official Statistics

IN PROGRESS

Rapid survey systems

IN PROGRESS

From experimentation to

implementation in official statistics

IN PROGRESS

Microdata for understanding declining response rates

IN PROGRESS

Other

IN PROGRESS

Capabilities and Communication







Future of work, future workplace

 and future skills

IN PROGRESS

Task Team met on 9th February 2021. We had a brain storm on the scope of work.  It was proposed to divide work into parts: 1) organisational matters: switch to work from home instead of office, technical requirements for work from home (Webex, Zoom, Lync, etc) and 2) what kind of skills are necessary for employees for remote working? It was also suggested, as a starting point, to review presentations from the September 2020 virtual workshop to get some input for the work of the team, and decide which presenters we would like to invite to join this team. Another proposal is to conduct the survey among NSOs to address two main points: 1) what have we learned from the pandemic? 2) what we would like to have in the future? As a following steps it was also suggested that we can have two sprint sessions soon 1) exploratory sprint, to discuss what we want to do as a Team and 2) in-depth sprint to produce some concrete output.

Next call will be on Monday 8 March at 12.00 Geneva time.

Ethical leadership

 as part of culture evolution 

IN PROGRESS

Task Team met on 10th February 2021. The first results of the preliminary poll were presented. We received 36 responses, and 29 organisations work on Ethics Policies, and 24 of them work in this area for more than 3 years. Total of 12 organisation were interested to be involved in the drafting of the survey results.

It was suggested to invite countries that expressed interest to participate in the analysis of the survey results to participate in the next call of this Task Team, so they can contribute to the design of the survey and not just analysis. For the next meeting first proposal for the survey should be prepared and the plan is to complete survey design by April/May and to collect responses by July. First results of the survey should be ready in September.

It was also discussed proposal to have a virtual meeting focusing on the topics we are working on. We can organise Webinar on Ethical Leadership (Ethics Camp), that will present results of the survey.

Next call of this group will be on 10 March at 14.00 Geneva time.

Role of market research,

digital marketing & communication strategies

and tools in managing a crisis communication situation

and in promoting public engagement in surveys

IN PROGRESS

It was agreed to review all presentations of the 2020 different virtual meeting and select information that will be relevant to the work of this team, and to contact authors of presentations to ask them to join this Task Team. It will be also reviewed past activities and documents of UNECE.

It was agreed that next call of this group will be on 3 March at 14.00 (CET).

Strategic Communication Framework Publication

IN PROGRESS

The various documents from the 2018 and 2019 SCF project have been combined (and common parts have been rewritten/edited). UNECE is now working on producing high-quality versions of all figures in the document.

HRMT Workshop 2022

NOT STARTED

Topic 6


placeholder

Other

Supporting Standards

Linking GSBPM and GSIM

IN PROGRESS

The Task Team had a sprint session in January with the aim to harmonise the work done so far and move forward with the description of the remaining sub-processes in a more harmonised manner. The current work is focusing on the harmonisation of sub-processes at the specification level (Phases 1, 2, 4, 5, 6) after which, authors of each sub-process example can reflect the changes at the implementation level. Divison of work for the remaining sub-processes is also currently ongoing.
Core Ontology for Official Statistics

IN PROGRESS

The Task Team started its work in January. Currently a roadmap is being developed for our activities in 2021. Current plans for the Task Team:

    • January: Roadmap for 2021 activities
    • February-April: Articulation with work on GSIM and Metadata Glossary
    • May-June: Work on governance (can start in Feb in parallel with other works)
    • July: Expert review
    • August: Final version 1 of the ontology
    • September-October: Document on governance
    • November: Submission to HLG-MOS for adoption


Updating GSIM

IN PROGRESS

Task Team work is ongoing as planned. Currently issues around the Business Group of GSIM are being discussed.
Application of GSBPM for Geospatial Information

IN PROGRESS

With the last meeting, the Task Team has covered all phases and overarching/corporate-level activities aimed, Currently the integration of comments into the report is ongoing and the draft report is expected to be shared around mid-March for review. After that, there will be a dedicated meeting to finalise the report.
GSBPM Task

NOT STARTED

Task Team is expected to start its work in the second half of the year, after other Task Teams have completed their work.
CSPA

NOT STARTED

Planned to start in 2nd half of 2021
ModernStats World Workshop 2022

NOT STARTED

Other

The SSG updated the previous questionnaire (Survey on the use of ModernStats models (GSBPM, GAMSO, GSIM and CSPA)) on the use of ModernStats models and the collection of information from countries is currently ongoing. Deadline for submission: 10th March.

Ideas. proposals from the SSG members on communicating the ModernStats models better and to increase visibility are currently being collected until mid-March. The exact actions on these topics will be agreed upon after.

Machine Learning 2021




Poster.jpg

  

WS1 – Pilot studies: from Idea to Valid solutions

IN PROGRESS

WS2 – From Valid Solution to Production 

IN PROGRESS

.

WS3 – Data Ethics and Governance

IN PROGRESS

WS4 – On The  Quality of Training Data

IN PROGRESS

WS5 – On The Quality Framework for Statistical Algorithms

IN PROGRESS

Other

ONS-UNECE ML 2021 Group  

Led by the Data Science Campus in partnership with the UNECE HLG-MOS, the ML 2021 Group aims to demonstrate the added value of ML in the modernisation of Official Statistics. We will achieve this by facilitating the development and implementation of ML research projects and capacity building activities that meet the global statistical community needsWith nearly 200 members from 36 countries, the Group is delivering 17 research activities grouped into workstreamspilot studies, integration into production, ethics, quality of training data and quality of statistical algorithms. The ML 2021 outputs for 2021 include:  

  1. Monthly meetings, quarterly workstreams reports and open webinar in December 
  2. Engagement of the ML group under the UN Global Network of Data Officers and Statisticians  
  3. Co-ordination and delivery of skills building activities, such as short courses, masterclasses, hackathons, coffee and coding, virtual/ physical sprint, etc 
  4. Publicity of ML 2021 via blogs and articles on key channels and platforms 

The inaugural ML 2021 meeting was hosted on  29th January 2021 The second meeting will be hosted on 23rd February and will include update from all 5 workstreams leads and three presentations. More information below. 

23 February 2021 Meeting Documents



  • Keine Stichwörter
Report inappropriate content