Page tree

2022 Modernisation Projects

(For project updates, please see Modernisation updates)

Data Governance for Interoperability Framework 2022

CONTEXT

The main goal of the Statistical Data Governance Framework project is to produce a document describing a reference framework containing the main elements needed to implement a governance program focused on achieving data interoperability. This framework will provide the ability to create, exchange, and use data while preserving its meaning and context independently from a given system or a set of systems.

PROJECT OBJECTIVES

The objective is to increase the value of the statistical information by establishing connections between the data from different domains. The project will aim at reducing costs by creating a way to effectively reuse information and tools as well as improve the information products and services adding the capacity to create a new generation platform of systems and tools that will enhance the analysis and dissemination of statistics. In this way, the framework can meet the emerging and more complex needs of our users while at the same time improving data and metadata quality by making it more transparent, manageable and comparable.

  • WP1: Establishing a data governance body
  • WP2: Structuring and using the existing models and standards
  • WP3: Identifying core aspects to be covered during GSBPM phases and sub-processes
  • WP4: Guide to implement transversal platforms for data interoperability and concept-driven integrated information systems

Participation is open to staff from statistical organisations and others interested in Official Statistics. Please contact the UNECE secretariat if you wish to participate in the project.



Input Privacy-preserving Techniques 2022

CONTEXT

The 2021 project on input privacy-preserving techniques (IPPT), proved that such techniques can play an important role in making external data sources accessible when there are confidentiality concerns. This allows for analysing or integrating external data sources and producing statistics without revealing the microdata to the external partner. It was concluded that a continued collaboration was needed to further develop the performed experiments and to better understand the environment that is required for IPPT as well as to get a better understanding of the methodological challenges.

PROJECT OBJECTIVES

The objective is to expand and continue the existing collaboration between the involved participating organizations and to further explore and broaden the applicability of input privacy preservation techniques. This will allow NSOs to become part of or leading in data ecosystems by allowing the use of private data between NSOs and, more generally, between organizations.

  • WP1. Deepening practical experiments
  • WP2. Document use cases and provide guidelines for implementation
  • WP3. Create user community

Participation is open to staff from statistical organisations and others interested in Official Statistics. Please contact the UNECE secretariat if you wish to participate in the project.

Meta-Academy for the Modernization of Official Statistics 2022

CONTEXT

Moving from innovation to implementation keeps being a major challenge. The purpose of the Meta-Academy for the Modernisation of Official Statistics is to remove barriers to co-creation of training and reuse of content at an international level, which will ultimately unleash the creation and use, at scale, of open digital assets to boost the National Statistical Office (NSO) upskilling necessary for modernization.

PROJECT OBJECTIVES

This project intends to raise the standards of virtual learning on topics necessary for the modernization of statistics but are missing or inconsistent from academic, commercial or in-house offerings. The meta-academy project sets out to create a benchmark to better map existing initiatives and offerings in order to better coordinate efforts, reduce duplication and fill in training gaps. This project will facilitate sharing of skills strategies, as well as catalogues of contents and pedagogical artefacts, and more generally good practices and standards in that space, so that scopes for reuse or co-creation in learning capabilities can be more easily and more systematically spotted and leveraged by all NSOs.

  • WP1: Benchmarking
  • WP2: Co-create capacity building content
  • WP3: Finalizing the framework for virtual learning, co-creating and reusing content

Participation is open to staff from statistical organisations and others interested in Official Statistics. Please contact the UNECE secretariat if you wish to participate in the project.

Previous Modernisation Projects

Input Privacy-preserving Techniques 2021
Input Privacy-preserving Techniques 2021

CONTEXT

Statistical organizations are more and more investing on becoming part of a data ecosystem where they acquire and integrate data from multiple sources and provide richer statistical products.In this scenario, the issue of privacy preservation is particularly relevant: the more sources are acquired and integrated, the higher are the risks of disclosing information violating individual privacy rights. Hence, from a legislative perspective there are indications to take privacy into account throughout the whole data treatment process, through the ‘privacy by design’ concept. National Statistical Organizations (NSOs) are used to apply techniques for enforcing privacy by design on the output side, however, NSOs have still to invest on dealing with privacy protection on the input side, in a complementary but distinct way with respect to output privacy preservation investments.

PROJECT OBJECTIVES

The first objective of the project is to scope the goals and work packages and to prevent duplication by identifying the state-of-the art and current activities in the area (WP0.) Initially, the project proposal was divided into four work packages. The approach is iterative and modular in a way that more mature techniques can be tested with PoCs to speed up their adoption and additional techniques could be added as new work packages and strengthen each other if we do them jointly.

  • WP1. Documenting statistical use-cases relevant forapplication of privacy-preserving techniques
  • WP2. Secure Multiparty Computation (SMC) methods
  • WP3. Homomorphic Encryption (HE) methods
  • WP4. Identify opportunities for operationalization of methods and sharing of solutions

During the initial stage (WP0), these might be further scoped.

Input Privacy-Preserving Techniques 2020
Input Privacy-preserving Techniques 2020

Due to staffing shortages at UNECE and the Covid19 pandemic, this project was on hold until 1 August. 

CONTEXT

Statistical organizations are more and more investing on becoming part of a data ecosystem where they acquire and integrate data from multiple sources and provide richer statistical products.In this scenario, the issue of privacy preservation is particularly relevant: the more sources are acquired and integrated, the higher are the risks of disclosing information violating individual privacy rights. Hence, from a legislative perspective there are indications to take privacy into account throughout the whole data treatment process, through the ‘privacy by design’ concept. National Statistical Organizations (NSOs) are used to apply techniques for enforcing privacy by design on the output side, however, NSOs have still to invest on dealing with privacy protection on the input side, in a complementary but distinct way with respect to output privacy preservation investments.

PROJECT OBJECTIVES

The first objective of the project is to scope the goals and work packages and to prevent duplication by identifying the state-of-the art and current activities in the area (WP0.) Initially, the project proposal was divided into four work packages. The approach is iterative and modular in a way that more mature techniques can be tested with PoCs to speed up their adoption and additional techniques could be added as new work packages and strengthen each other if we do them jointly.

  • WP1. Documenting statistical use-cases relevant forapplication of privacy-preserving techniques
  • WP2. Secure Multiparty Computation (SMC) methods
  • WP3. Homomorphic Encryption (HE) methods
  • WP4. Identify opportunities for operationalization of methods and sharing of solutions
Strategic Communication Project 2019 (Phase II)
STRATEGIC COMMUNICATION PROJECT Phase 2

CONTEXT

Within the context of today’s ever-changing data environment, many statistical organizations are in the process of developing or reviewing their strategic objectives and their business models – leading to the articulation or a review of their mission and/or vision statements.   More and more statistical organizations are involved in government-wide data strategy formulation.  For statistical organizations to become strategic partners in the development of a national data strategy and for the successful development of a solid business model or the transition to a new business model, the vision must resonate with staff at all levels.  For mission and vision statements to resonate with employees, staff need to be engaged.


PROJECT OBJECTIVES

The objective of the Strategic Communication Framework Project is to guide statistical offices in the development of a strategic approach to protect, enhance and promote the organization’s reputation and brand. Phase 2 of the Project will build on the experience and momentum gained in Phase 1 and will focus on developing a strategic approach to internal communications and stakeholder management/analysis in support of two priority topics for 2019 identified by HLG-MOS - Communicating our value and Setting the vision.  It will also explore the experience of national statistical organizations in the development of government-wide data strategies in support of a third HLG priority – National Data Strategies.

The project will focus on:

  • Developing organizational vision and strategic staff engagement strategies
  • Developing effective stakeholder engagement management strategies
  • Statistical organizations engagement in Government-wide data strategies
Strategic Communication Project 2018

STRATEGIC COMMUNICATION PROJECT OVERVIEW 

CONTEXT

Official statistics are operating in a competitive and challenging environment – one that has changed significantly over the last twenty years.  For traditional users of official statistics their values and importance is undisputed.  Yet for the average citizen the digital and social media revolutions have meant that more and more people have instantaneous access to various data sources, outside official statistics.  The 24/7 news cycle is reality, trust in government is decreasing and the fake news phenomenon is growing.

Now more than ever, timely and relevant data and stories produced by statistical organizations are essential to healthy democratic societies as they remain the only independent, impartial, trusted and reliable source of official statistics.  For official statistics to be beneficial to society, policy debate, and decision-making they must be known, understood, communicated and used.


PROJECT OBJECTIVES

The objectives for the project are to provide statistical offices with:

  • support in the development a strategic approach to communication and increase their capacity to review and renew their communication approach, methods and processes;
  • with tools to increase their visibility, relevance and brand recognition; and
  • tools to take a proactive approach to managing issues and reputation.

The outputs of the project will focus on enabling statistical offices to modernize their communications at the strategic level and help organizations look at communications strategies in a broader risk management and business continuity context. They include: 

  •  Defining skillsets of a professional communication programme and organizational options for the strategic communication function within the statistical organization;
  •  Developing a Communication Maturity Model, including metrics and a description of how to use the model and examples of how the model can be used;
  •  Developing guidelines to create a communications strategy and its implementation plan (including examples);
  •  Developing the branding options that are most relevant for statistical organizations; and
  • Establishing an issues management process including guidance and tools to support statistical organizations in times of issues or crisis management.  

TO BE ADDED:



Synthetic Data Guide 2021
Synthetic Data Guide 2021

CONTEXT

Data has become a valuable commodity, providing information for statisticians, economists, and data scientists to generate more timely and granular insights. National statistical offices (NSOs) are striving to provide greater transparency and openness and so are looking to expand safely sharing of data, expertise and best practices both internally as well as with external partners. In addition, different types of users are increasingly searching for quality data sets to support testing, evaluation, education and development purposes. These aspects provide more value to users and bring the need to uphold data integrity and confidentiality to the forefront. 

The demands for timely, integrated data compiled from ever-growing sources of increased complexity, along with the unequivocal commitment to trusted data protection call for a modernized, interoperable approach to mobilizing these large and complex data sources. Synthetic data can be a solution to providing rich data while respecting integrity and confidentiality imperatives.

PROJECT OBJECTIVES

The 'practical guide to Synthetic Data’ project sets out to develop a hands-on guide for creating and using synthetic data primarily geared towards data protection and disclosure control. The target audience of this guide includes NSOs as well as their clients such as academia, the private sector and the general public. The guide will focus on how to use synthetic data in practical applications, considerations for implementation, and important aspects to share with users. This guide can serve as the foundation for future standards as synthetic data is more broadly adopted within NSOs and by their users.

The project is divided into four work packages, with the scoping work already completed through the Working Group on Synthetic Data.

Objectives

  • WP1: Use cases for synthetic data
    WP2: Recommended methods for creating synthetic data
    WP3: Utility and Disclosure Risk Measures
    WP4: Experimenting with the recommendations
Machine Learning Project 2020
MACHINE LEARNING PROJECT 2020

CONTEXT

The interest in the use of Machine Learning (ML) for official statistics is rapidly growing. For the processing of some secondary data sources (including administrative sources, big data and Internet of Things) it seems essential to look into opportunities offered by modern ML techniques, while also for primary data ML techniques might offer added value, as illustrated in the ML position paper mentioned above. Although ML seems promising there is only limited experience with concrete applications in the UNECE statistical community, and some issues relating to e.g. quality and transparency of results obtained from ML still have to be solved. The second year of the Machine Learning Project


PROJECT OBJECTIVES

Based on mutual interest and building on existing national developments, the objective of the project is to advance the research, development and application of machine learning techniques (ML) to add value (relevance, timeliness, quality, efficiency) to the production of official statistics. To achieve this objective the Machine Learning (ML) will aim in year two, to:

  • Report on the various Pilot Studies to demonstrate the value-added of ML.
  • Identify and share best practices in the implementation of ML techniques.
  • Share knowledge, tools and best practices on implementing the ML techniques, and how National Statistical Organisations (NSOs) are organized to move them quickly to the production processes.

  • Propose a quality framework components for evaluating ML processes and statistics produced using them, as well as to bridge the gap between these components and those in existing frameworks.

Machine Learning Project 2019
MACHINE LEARNING PROJECT

CONTEXT

The interest in the use of Machine Learning (ML) for official statistics is rapidly growing. For the processing of some secondary data sources (including administrative sources, big data and Internet of Things) it seems essential to look into opportunities offered by modern ML techniques, while also for primary data ML techniques might offer added value, as illustrated in the ML position paper mentioned above. Although ML seems promising there is only limited experience with concrete applications in the UNECE statistical community, and some issues relating to e.g. quality and transparency of results obtained from ML still have to be solved.


PROJECT OBJECTIVES

Based on mutual interest and building on existing national developments, the objective of the project is to advance the research, development and application of machine learning techniques to add value to the production of official statistics. To achieve this objective the Machine Learning (ML) will aim to:

  • Investigate and demonstrate the value added of ML in the production of official statistics, where "value added" is increase in relevance, better overall quality or reduction in costs.

  • Advance the capability of ML to add value to the production of official statistics.

  • Advance the capability of national statistical organisations to use ML in the production of official statistics.

  • Enhance collaboration between statistical organisations in the development and application of ML.

The objectives will be attained by:

  • Conducting pilot studies in ML solutions in: (a) common statistical processes (classification and coding; edit and imputation); and (b) the use of alternate data sources (imagery or big data; sentiment and web).

  • Researching and experimenting approaches to inform users on the quality of ML solutions, notably on accuracy

  • Identifying best practices in the development and application of ML solutions, including organisational aspects

  • Conducting the activities in groups of national organisations

Data Architecture Project II 2018

DATA ARCHITECTURE PHASE 2 PROJECT OVERVIEW 

CONTEXT

Statistical organisations deal with many different data sources – each with their own set of characteristics. Statistical organisations need to find, acquire and integrate data from both traditional and new types of data sources in an ever increasing pace and under ever stricter budget constraints, while taking care of security and data ownership.

The 2017 HLG-MOS Data Architecture project developed the first version of the Common Statistical Data Architecture (CSDA). This Reference Architecture is a template for NSOs in the development of their own Enterprise Data Architectures. 

The project will focus on providing a more robust version of the Common Statistical Data Architecture as a result of validation against a number of use-cases and integration with the outcomes from other related groups. It will also provide guidance on implementing the architecture.


PROJECT OBJECTIVES

The objectives of this project are:

  •  To complete the development of the Common Statistical Data Architecture, testing the reference architecture defined in 2017 against other use-cases
  • To apply and validate the Data Architecture against the outcomes from other groups like UN-GWG, Data Integration project and groups working on statistical ontologies.
  • To provide guidelines to support statistical organisations in using the Common Statistical Data Architecture.

ALL OUTPUT PRODUCED BY THIS PROJECT IS AVAILABLE FROM HERE: https://statswiki.unece.org/x/4BazBw

TO BE ADDED:

Implementing ModernStats Standards 2016

Linked Open Metadata

Implementing ModernStats Standards

To be added




  • No labels
Report inappropriate content