| Title | Theme | Statistics Area | Country/Organisation | Reports | ML methods | Data Source | Data Type | Programming Language | Code Availability |
|---|---|---|---|---|---|---|---|---|---|
| Address Register Automated Image Recognition (AIR) model | Imagery Analysis | Australia | To be uploaded | CONVOLUTIONAL NEURAL NETWORK | Aerial Imagery | Imagery data | R | Ask for availability | |
| Learning statistical information from images: a proof of concept | Imagery Analysis | Netherlands | To be uploaded | CONVOLUTIONAL NEURAL NETWORK | ?? - Gitlab link (Joep: not public, yet? ) | ||||
| Arealstatistik Deep Learning (ADELE) | Imagery Analysis | Switzerland | To be uploaded | CONVOLUTIONAL NEURAL NETWORK, RANDOM FOREST | |||||
| Use of Landsat satellite data for the mapping of urban areas in non-census years | Imagery Analysis | Mexico | To be uploaded | EXTRA TREE, CONVOLUTIONAL NEURAL NETWORK | |||||
| Generic Pipeline for Production of Official Statistics Using Satellite Data and Machine Learning | Imagery Analysis | Not applicable | UNECE | To be uploaded | GENERAL DOCUMENT | Not applicable | Not applicable | Not applicable | Not applicable |
| Imputation of the variable “Attained Level of Education” in Base Register of Individuals | Edit & Imputation | Education statistics | Italy | To be uploaded | MULTI-LAYER PERCEPTION, LOG-LINEAR | Administrative data, Survey data, Census data | Multivariate data | Python | Github link |
| Imputation in the sample survey on participation of Polish residents in trips | Edit & Imputation | Tourism statistics | Poland | To be uploaded | CART, RANDOM FOREST, OPTIMAL WEIGHTED NEAREST NEIGHBOUR, SUPPORT VECTOR MACHINE | Survey data | Multivariate data | R | Local, not public |
| Machine learning methods for imputation | Edit & Imputation | ? | Germany | To be uploaded | K-NEAREST-NEIGHBOURS, BAYESIAN NETWORKS, RANDOM FOREST, SUPPORT VECTOR MACHINE | Survey data | Multivariate data | R | Not available |
| Early estimates of energy balance statistics using machine learning | Edit & Imputation | Energy statistics, Economic and Financial statistics, Weather statistics | Belgium VITO | To be uploaded | LASSO REGRESSION, LINEAR REGRESSION, NEURAL NETWORK, RANDOM FOREST, RIDGE REGRESSION | ? | Multivariate data | Python | Github link |
| Edit & Imputation | UK | To be uploaded | |||||||
| Editing in the Italian Register of the Public Administration | Edit & Imputation | Economic and Financial statistics | Italy | To be uploaded | DECISION TREE, RANDOM FOREST | Administrative data | Multivariate data | R | |
| Occupation and Economic activity coding using natural language processing | Coding & Classification | Demographic and Social statistics, Economic and Financial statistics, Labor statistics | Mexico | To be uploaded | EXTRA TREE, NAIVE BAYES, XGBOOST, SUPPORT VECTOR MACHINE, MULTI-LAYER PERCEPTION, DECISION TREE, RANDOM FOREST, K-NEAREST-NEIGHBOURS, LOGISTIC REGRESSION | Survey data | Text data, Multivariate data | Python | |
| Industry and Occupation Coding | Coding & Classification | Labor statistics, Business Statistics | Canada | To be uploaded | FASTTEST | Survey data | Text data | Python | Github link |
| Sentiment Analysis of twitter data | Coding & Classification | Life statistics | Belgium Flanders | To be uploaded | WORD EMBEDDING, LOGISTIC REGRESSION, XGBOOST, RANDOM FOREST | Social Media data | Text data | Python | Github link |
| Coding & Classification | Serbia | To be uploaded | Not available | ||||||
| Coding Workplace Injury and Illness | Coding & Classification | Labor Statistics | USA | To be uploaded | NEURAL NETWORK | Survey data | Text data | Python | Github link |
| Product Description to ECOICOP | Coding & Classification | ? | Poland | To be uploaded | NAIVE BAYES, LOGISTIC REGRESSION, RANDOM FOREST, SUPPORT VECTOR MACHINE, NEURAL NETWORK | Web Scraping data | Text data | Python | (to check with Marta and Krystyna) |
| Coding & Classification | Australia | To be uploaded | |||||||
| Automated Coding of IMF's Catalog of Time Series | Coding & Classification | IMF | To be uploaded | ||||||
| Coding & Classification | Iceland | To be uploaded | |||||||
Standard Industrial Code Classification by Using Machine Learning | Coding & Classification | Business Registration Statistics? | Norway | To be uploaded | LOGISTIC REGRESSION, RANDOM FOREST, NAIVE BAYES, SUPPORT VECTOR MACHINE, FASTTEST, NEURAL NETWORK | Business Registration data? | Text data |