Seitenhierarchie
Zum Ende der Metadaten springen
Zum Anfang der Metadaten

Synthetic Data Guide

Progress

Membership

The membership of the synthetic data project is now at 38.

Scope

The goal of the Synthetic Data Project is to develop a hands-on guide for creating and using synthetic data primarily geared towards data protection and disclosure control. The target audience of this guide includes both managers and methodologist at NSOs to provide a starting point for using and creating synthetic data. 

After the HLG-MOS Executive Board recommendations, the guide will emphasize creating synthetic data from non-survey data sources. In addition, the use cases will highlight the circumstances in which to synthetic data is the appropriate solution to their output privacy problems. 

Work Packages

Work package 2 - methodology - has been under with presentations of new method from Statistics New Zealand and Australian Bureau of Statistics. 

Patnerships

The HLG-MOS Executive Board recommended partnerships with the ML 2021 group and the Input Privacy Preserving Techniques project. I met with both projects and synthetic data can be a suitable option for testing data. 

Next Steps

Work Packages

Work Package 1 - Use Cases: the BSTN Synthetic Data Working Group in 2020 had created a foundation of use cases for synthetic data. After the EB's recommendations, the use case categories are being reassessed. The use case work package has two main outputs:

  1. A description of different use cases and highlighting their analytical and disclosure risk needs
  2. Creating a gradient of different use cases based on their analytical and disclosure risk needs

The use cases will form the basis for the methodology and quality measure recommendations. 

The target deadline for work package 1 is end of May 2021. 

Work Package 2 - methodology: The following methodology presentations have been scheduled:

  • March 4, 2021: Joerg Drechsler presenting on method used to synthesize US business data
  • April 1, 2021: Kenza Sallier presenting Statistics Canada’ experience creating public synthetic datasets using the FCS and the Synthpop package and Gillian Raab and Beata Nowok will be presenting on new additions to the Synthpop package. 
  • May 6, 2021: Tentative Christine Task and Rolando Rodriguez


Patnerships

The ML 2021 group will get back to me on their requirements for testing data and we will then determined if synthetic data is the right solution. 

Risks and Issues


IssueMitigation



Input Privacy-preserving Techniques 

Progress

Next Steps

Risks and Issues

IssueMitigation



Image result for input privacy-preserving techniques


News from the Groups

Blue-skies Thinking

Identifying Topics/Opportunities


IN PROGRESS

Network Data

IN PROGRESS

Covid-19 Hotspot Joint Biosecurity Centre Platform

IN PROGRESS

User Research for Official Statistics

IN PROGRESS

Rapid survey systems

IN PROGRESS

From experimentation to

implementation in official statistics

IN PROGRESS

Microdata for understanding declining response rates

IN PROGRESS

Other

IN PROGRESS

Capabilities and Communication







Future of work, future workplace

 and future skills

IN PROGRESS

Ethical leadership

 as part of culture evolution 

IN PROGRESS

Role of market research,

digital marketing & communication strategies

and tools in managing a crisis communication situation

and in promoting public engagement in surveys

IN PROGRESS

Strategic Communication Framework Publication

IN PROGRESS

HRMT Workshop 2022

NOT STARTED

Topic 6


placeholder

Other

Supporting Standards

Linking GSBPM and GSIM

IN PROGRESS

The Task Team had a sprint session in January with the aim to harmonise the work done so far and move forward with the description of the remaining sub-processes in a more harmonised manner. The current work is focusing on the harmonisation of sub-processes at the specification level (Phases 1, 2, 4, 5, 6) after which, authors of each sub-process example can reflect the changes at the implementation level. Divison of work for the remaining sub-processes is also currently ongoing.
Core Ontology for Official Statistics

IN PROGRESS

The Task Team started its work in January. Currently a roadmap is being developed for our activities in 2021. Current plans for the Task Team:

    • January: Roadmap for 2021 activities
    • February-April: Articulation with work on GSIM and Metadata Glossary
    • May-June: Work on governance (can start in Feb in parallel with other works)
    • July: Expert review
    • August: Final version 1 of the ontology
    • September-October: Document on governance
    • November: Submission to HLG-MOS for adoption


Updating GSIM

IN PROGRESS

Task Team work is ongoing as planned. Currently issues around the Business Group of GSIM are being discussed.
Application of GSBPM for Geospatial Information

IN PROGRESS

With the last meeting, the Task Team has covered all phases and overarching/corporate-level activities aimed, Currently the integration of comments into the report is ongoing and the draft report is expected to be shared around mid-March for review. After that, there will be a dedicated meeting to finalise the report.
GSBPM Task

IN PROGRESS

Task Team is expected to start its work in the second half of the year, after other Task Teams have completed their work.
CSPA

NOT STARTED

ModernStats World Workshop 2022

NOT STARTED

Other

The SSG updated the previous questionnaire (Survey on the use of ModernStats models (GSBPM, GAMSO, GSIM and CSPA)) on the use of ModernStats models and the collection of information from countries is currently ongoing. Deadline for submission: 10th March.

Ideas. proposals from the SSG members on communicating the ModernStats models better and to increase visibility are currently being collected until mid-March. The exact actions on these topics will be agreed upon after.

Machine Learning 2021




Poster.jpg

  

WS1 – Pilot studies: from Idea to Valid solutions

IN PROGRESS

WS2 – From Valid Solution to Production 

IN PROGRESS

.

WS3 – Data Ethics and Governance

IN PROGRESS

WS4 – On The  Quality of Training Data

IN PROGRESS

WS5 – On The Quality Framework for Statistical Algorithms

IN PROGRESS

Other
.



  • Keine Stichwörter
Report inappropriate content