Seitenhierarchie

Versionen im Vergleich

Schlüssel

  • Diese Zeile wurde hinzugefügt.
  • Diese Zeile wurde entfernt.
  • Formatierung wurde geändert.

Machine Learning (ML) holds a great potential for statistical organisations. It can make the production of statistics more efficient by automating certain processes or assisting humans to carry out the process. It also allows statistical organisations to use new data types of data such as social media data and imagery. Many national statistical offices (NSOs) are investigating how ML can be used in the context of increasing the relevance and quality of official statistics in an environment of growing demands for trusted information, rapidly developing and accessible technologies, and numerous competitors able to occupy more of the statistical information space. While specific business environment may vary depending on country, NSOs face similar type of challenges which can benefit from sharing knowledge and experiences, and collaborating on developing solutions within the broad official statistical community.

To address this need, UNECE High-Level Group for the Modernisation of Official Statistics (HLG-MOS) launched a Machine Learning Project in 2019. The project aims to demonstrate the added value of ML, i.e. whether its enables to production of more relevant, timely, accurate and trusted data in an efficient manner. The project also aimed at increasing NSOs’ capability to use ML by identifying and addressing some common challenges encountered when incorporating ML in organizations and their production processes.

The project started in April 2019 with 23 participants from 13 organisations and has grown to over 120 members from 23 countries, 31 national and 4 international organisations. The members either lead, assist or follow the conduct of numerous studies and other developments. The work of the project is divided into three packages: pilot studies, developments in quality and integration challenges.

The project now is immensely pleased to start sharing its numerous outputs with the official statistics community!!!

The outputs include:

  • Reports and other documents on 19 pilot studies; early developments on the use of ML for data editing; and a generic pipeline for production of official statistics using satellite data and machine learning. - available at WP1 - Pilot Studies

The pilot studies are conducted to assess the added value of ML in three areas: two statistical business processes (coding and classification, edit and imputation); and the use of imagery data. They are conducted on a wide variety of data sources (survey respondents, administrative, web-scraped, published official statistics, twitter, satellite images, aerial images) and contexts (survey, census, registers, proof of concepts, production).

  • Theme reports that analyze summarize and analyze the approaches and results from the pilot studies on each of the three areas - available at WP1 - Theme 1 Coding and Classification ReportWP1 - Theme 2 Edit and Imputation ReportWP1 - Theme 3 Imagery Analysis Report
  • A summary report on the pilot studies (yet to be released)
  • A Quality Framework for Statistical Algorithms (QF4SA) provides guidance on the choice of algorithms for the production process. It purposely uses the terminology statistical algorithm as it covers both traditional and modern methods. It proposes five quality dimensions; accuracy, timeliness, explainability and reproducibility. (yet to be released)
  • (I need to write something) (yet to be released)

These reports are accompanied with by other material to assist users in getting into or pursuing the development of ML in their respective contexts:

(I will add something on the fact that many outputs are shared, a few others will be added and some of the outputs may be updated before the end of the year)

(I thought also about writing something inviting them to either comment of this first release or share their ML experiences/developments with us; what do you think? I don’t want to know their needs, but would be interested in finding out their developments or plans.)

Purpose

  • To introduce ML project - most text copied from the current landing page.
  • To show structure of the HLG-MOS ML project - as pointer to make it easy to navigate child pages 


Image Modified

Report inappropriate content