Machine Learning (ML) holds a great potential for statistical organisations. It can make the production of statistics more efficient by automating certain processes or assisting humans to carry out the process. It also allows statistical organisations to use new types of data such as social media data and imagery.

Many national statistical offices (NSOs) are investigating how ML can be used to increase the relevance and quality of official statistics in an environment of growing demands for trusted information, rapidly developing and accessible technologies, and numerous competitors. While specific business environment may vary depending on country, NSOs face similar type of challenges which can benefit from sharing knowledge and experiences, and collaborating on developing common solutions within the broad official statistical community.

To address this need, UNECE High-Level Group for the Modernisation of Official Statistics (HLG-MOS) launched a Machine Learning Project in 2019. The project aimed to demonstrate the added value of ML, i.e. whether its enables to production of more relevant, timely, accurate and trusted data in an efficient manner. The project also aimed at increasing the capability of NSOs to use ML by identifying and addressing some common challenges encountered when incorporating ML in organisations and their production processes.

The project started in April 2019 with 23 participants from 13 organisations and has grown to over 120 members from 23 countries, 31 national and 4 international organisations. The members either lead, assist or follow numerous studies and other developments. The work of the project is divided into three work packages:

  • Work Package (WP) 1. Pilot studies
  • Work Package (WP) 2. Quality
  • Work Package (WP) 3. Integration challenges

The project is immensely pleased to share its numerous outputs with the official statistics community!!!

Machine Learning Project Report: summary of the project and recommendations on how to advance the use of ML in statistical organisations based on lessons learned and concrete experiences from three work packages (WPs).

WP1 Output

WP2 Output: A Quality Framework for Statistical Algorithms (QF4SA) provides guidance on the choice of algorithms for the production process. It purposely uses the terminology statistical algorithm as it covers both traditional and modern methods. It proposes five quality dimensions; accuracy, timeliness, cost-effectiveness, explainability and reproducibility - available at WP2 - Quality

WP3 Output: The identification of challenges in moving machine learning solutions from a proof of concept to production, as well as a review of some current practices to address some of the challenges - available at WP3 - Integration

These reports are accompanied with by other material to assist users in getting into or pursuing the development of ML in their respective contexts:

Overall structure of UNECE HLG-MOS Machine Learning Project 

The Machine Learning Community continues on its journey!


  • No labels