| Title | Theme | Statistics Area | Country/Organisation | Reports | ML methods | Data Source | Data Type | Programming Language | Code Availability |
|---|---|---|---|---|---|---|---|---|---|
| Address Register Automated Image Recognition (AIR) model | Imagery Analysis | Geospatial statistics | Australia | To be uploaded | CONVOLUTIONAL NEURAL NETWORK | Aerial Imagery | Imagery data | R | Ask for availability |
| Learning statistical information from images: a proof of concept | Imagery Analysis | Geospatial statistics, Income-based Poverty statistics | Netherlands | To be uploaded | CONVOLUTIONAL NEURAL NETWORK | Aerial Imagery, Satellite Imagery | Imagery data | Python | ?? - GitLab link (Joep: not public, yet? ) |
| Arealstatistik Deep Learning (ADELE) | Imagery Analysis | Geospatial statistics | Switzerland | To be uploaded | CONVOLUTIONAL NEURAL NETWORKRANDOM FOREST | Satellite Imagery | Imagery data, Administrative data | Python | Ask for availability |
| Use of Landsat satellite data for the mapping of urban areas in non-census years | Imagery Analysis | Geospatial statistics, Urban statistics | Mexico | To be uploaded | EXTRA TREECONVOLUTIONAL NEURAL NETWORK | Satellite Imagery | Imagery data | Python | Ask for availability |
| Generic Pipeline for Production of Official Statistics Using Satellite Data and Machine Learning | Imagery Analysis | Not applicable | UNECE | To be uploaded | GENERAL DOCUMENT | Not applicable | Not applicable | Not applicable | Not applicable |
| Imputation of the variable “Attained Level of Education” in Base Register of Individuals | Edit & Imputation | Education statistics | Italy | To be uploaded | MULTI-LAYER PERCEPTIONLOG-LINEAR | Administrative data, Survey data, Census data | Python | GitHub link | |
| Imputation in the sample survey on participation of Polish residents in trips | Edit & Imputation | Tourism statistics | Poland | To be uploaded | CARTRANDOM FORESTOPTIMAL WEIGHTED NEAREST NEIGHBOURSUPPORT VECTOR MACHINE | Survey data | R | Local, not public | |
| Machine learning methods for imputation | Edit & Imputation | ? | Germany | To be uploaded | K-NEAREST-NEIGHBOURSBAYESIAN NETWORKSRANDOM FORESTSUPPORT VECTOR MACHINE | Survey data | R | Not available | |
| Early estimates of energy balance statistics using machine learning | Edit & Imputation | Energy statistics, Economic and Financial statistics, Weather statistics | Belgium VITO | To be uploaded | LASSO REGRESSIONLINEAR REGRESSION NEURAL NETWORKRANDOM FOREST RIDGE REGRESSION | ? | Python | GitHub link | |
| Edit & Imputation | UK | To be uploaded | |||||||
| Editing in the Italian Register of the Public Administration | Edit & Imputation | Economic and Financial statistics | Italy | To be uploaded | DECISION TREE RANDOM FOREST | Administrative data | R | ||
| Occupation and Economic activity coding using natural language processing | Coding & Classification | Demographic and Social statistics, Economic and Financial statistics, Labor statistics | Mexico | To be uploaded | EXTRA TREENAIVE BAYESXGBOOSTSUPPORT VECTOR MACHINEMULTI-LAYER PERCEPTIONDECISION TREERANDOM FORESTK-NEAREST-NEIGHBOURSLOGISTIC REGRESSION | Survey data | Text data | Python | |
| Industry and Occupation Coding | Coding & Classification | Labor statistics, Business Statistics | Canada | To be uploaded | FASTTEST | Survey data | Text data | Python | GitHub link |
| Sentiment Analysis of twitter data | Coding & Classification | Life statistics | Belgium Flanders | To be uploaded | WORD EMBEDDINGLOGISTIC REGRESSIONXGBOOSTRANDOM FOREST | Social Media data | Text data | Python | GitHub link |
| Coding & Classification | Serbia | To be uploaded | Not available | ||||||
| Coding Workplace Injury and Illness | Coding & Classification | Labor Statistics | USA | To be uploaded | NEURAL NETWORK | Survey data | Text data | Python | GitHub link |
| Product Description to ECOICOP | Coding & Classification | ? | Poland | To be uploaded | NAIVE BAYESLOGISTIC REGRESSIONRANDOM FORESTSUPPORT VECTOR MACHINENEURAL NETWORK | Web Scraping data | Text data | Python | (to check with Marta and Krystyna) |
| Coding & Classification | Australia | To be uploaded | |||||||
| Automated Coding of IMF's Catalog of Time Series | Coding & Classification | IMF | To be uploaded | ||||||
| Coding & Classification | Iceland | To be uploaded | |||||||
Standard Industrial Code Classification by Using Machine Learning | Coding & Classification | Business Registration Statistics? | Norway | To be uploaded | LOGISTIC REGRESSIONRANDOM FORESTNAIVE BAYESSUPPORT VECTOR MACHINEFASTTESTNEURAL NETWORK | Business Registration data? | Text data | Python | GitHub Lik (Ask for availability) |