As it progressed, the Machine Learning project was informed about other developments of ML to produce official statistics. In particular, during a series of virtual sessions held in October 2020, several speakers were invited to provide an introduction on ML developments conducted in their statistical organisations, It is important to note that they were not carried out within the ML project. The presentations are shared to further highlight the interest in advancing the use of ML.
| Main statistical process | Development | Data source |
|---|---|---|
| Text classification | Belgium Flanders - A better statistic on innovative companies in Flanders using web scraping and machine learning | Web scraped |
| OECD - SDG Financing Lab (to be shared) | Administrative | |
| UK - Automated classification of web scraped clothing data in consumer price statistics | Web scraped | |
| Survey write-in responses | ||
USA USCB - Shared AI Services Hosting Application | Survey write-in responses | |
| Record linkage or matching | Canada - Machine Learning for Record Linkage at Statistics Canada | Any type |
| USA BLS - Matching fatal injury records with supervised machine learning | Survey and administrative | |
| Edit and Imputation | Any type | |
Australia - Census Occupancy Imputation for Census 2021 | Census | |
Australia - Repairing Big Data sets using KNN | Combination of several | |
| Estimation and Analysis | Combination of several | |
OECD - Nowcasting Services Trade | Aggregates |