Seitenhierarchie
Zum Ende der Metadaten springen
Zum Anfang der Metadaten

Synthetic Data Guide

Progress

Membership

The membership of the synthetic data project is now at 38.

Scope

The goal of the Synthetic Data Project is to develop a hands-on guide for creating and using synthetic data primarily geared towards data protection and disclosure control. The target audience of this guide includes both managers and methodologist at NSOs to provide a starting point for using and creating synthetic data. 

After the HLG-MOS Executive Board recommendations, the guide will emphasize creating synthetic data from non-survey data sources. In addition, the use cases will highlight the circumstances in which to synthetic data is the appropriate solution to their output privacy problems. 

Work Packages

Work package 2 - methodology - has been under with presentations of new method from Statistics New Zealand and Australian Bureau of Statistics. 

Patnerships

The HLG-MOS Executive Board recommended partnerships with the ML 2021 group and the Input Privacy Preserving Techniques project. I met with both projects and synthetic data can be a suitable option for testing data. 

Next Steps

Work Packages

Work Package 1 - Use Cases: the BSTN Synthetic Data Working Group in 2020 had created a foundation of use cases for synthetic data. After the EB's recommendations, the use case categories are being reassessed. The use case work package has two main outputs:

  1. A description of different use cases and highlighting their analytical and disclosure risk needs
  2. Creating a gradient of different use cases based on their analytical and disclosure risk needs

The use cases will form the basis for the methodology and quality measure recommendations. 

The target deadline for work package 1 is end of May 2021. 

Work Package 2 - methodology: The following methodology presentations have been scheduled:

  • March 4, 2021: Joerg Drechsler presenting on method used to synthesize US business data
  • April 1, 2021: Kenza Sallier presenting Statistics Canada’ experience creating public synthetic datasets using the FCS and the Synthpop package and Gillian Raab and Beata Nowok will be presenting on new additions to the Synthpop package. 
  • May 6, 2021: Tentative Christine Task and Rolando Rodriguez


Patnerships

The ML 2021 group will get back to me on their requirements for testing data and we will then determined if synthetic data is the right solution. 

Risks and Issues


IssueMitigation



Input Privacy-preserving Techniques 

Progress

Next Steps

Risks and Issues

IssueMitigation



Image result for input privacy-preserving techniques


News from the Groups

Blue-skies Thinking

Identifying Topics/Opportunities


IN PROGRESS

Network Data

IN PROGRESS

Covid-19 Hotspot Joint Biosecurity Centre Platform

IN PROGRESS

User Research for Official Statistics

IN PROGRESS

Rapid survey systems

IN PROGRESS

From experimentation to

implementation in official statistics

IN PROGRESS

Microdata for understanding declining response rates

IN PROGRESS

Other

IN PROGRESS

Capabilities and Communication







Future of work, future workplace

 and future skills

IN PROGRESS

Ethical leadership

 as part of culture evolution 

IN PROGRESS

Role of market research,

digital marketing & communication strategies

and tools in managing a crisis communication situation

and in promoting public engagement in surveys

IN PROGRESS

Strategic Communication Framework Publication

IN PROGRESS

HRMT Workshop 2022

NOT STARTED

Topic 6


placeholder

Other

Supporting Standards

Linking GSBPM and GSIM

IN PROGRESS

The Task Team had a sprint session in January with the aim to harmonise the work done so far and move forward with the descrption of the remaining sub-processes in a more harmonised manner. The current work is focusing on the harmonisation of sub-processes at the specification level (Phases 1, 2, 4, 5, 6) after which, authors of each sub-process example can reflect the changes at the implementation level. Divison of work for the remaining sub-processes is also currently ongoing.


Core Ontology for Official Statistics

IN PROGRESS

Updating GSIM

IN PROGRESS

Application of GSBPM for Geospatial Information

IN PROGRESS

GSBPM Task

IN PROGRESS

CSPA

NOT STARTED

ModernStats World Workshop 2022

NOT STARTED

Other

Machine Learning 2021




Poster.jpg

  

WS1 – Pilot studies: from Idea to Valid solutions

IN PROGRESS

WS2 – From Valid Solution to Production 

IN PROGRESS

.

WS3 – Data Ethics and Governance

IN PROGRESS

WS4 – On The  Quality of Training Data

IN PROGRESS

WS5 – On The Quality Framework for Statistical Algorithms

IN PROGRESS

Other
.



  • Keine Stichwörter
Report inappropriate content