Comment

TitleThemeStatistics AreaCountry/OrganisationReportsML methods

Data Source

Data TypeProgramming LanguageCode AvailabilityNote
Address Register Automated Image Recognition (AIR) modelImagery AnalysisGeospatial statisticsAustraliaTo be uploaded

Convolutional neural network 

Aerial ImageryImagery dataRAsk for availability
Learning statistical information from images: a proof of conceptImagery Analysis

Geospatial statistics, Income-based Poverty statistics

NetherlandsTo be uploaded

Convolutional neural network 

Aerial Imagery,

Satellite Imagery

Imagery dataPython?? - GitLab link (Joep: not public, yet? )
Arealstatistik Deep Learning (ADELE)Imagery AnalysisGeospatial statisticsSwitzerlandTo be uploaded

Convolutional neural network, Random forest

Satellite Imagery, Administrative dataImagery dataPythonAsk for availability
Use of Landsat satellite data for the mapping of urban areas in non-census yearsImagery AnalysisGeospatial statistics, Urban statisticsMexicoTo be uploaded

Convolutional neural network, Extra tree

Satellite ImageryImagery dataPythonAsk for availability
Generic Pipeline for Production of Official Statistics Using Satellite Data and Machine LearningImagery AnalysisNot applicableUNECETo be uploaded


Not applicableNot applicableNot applicableNot applicable
Imputation of the variable “Attained Level of Education” in Base Register of IndividualsEdit & ImputationEducation statisticsItalyTo be uploaded

Multilayer perceptron, Log linear

Administrative data, Survey data, Census data
Python GitHub link 
Imputation in the sample survey on participation of Polish residents in tripsEdit & Imputation

Tourism statistics 

PolandTo be uploaded

CART, Random forest, Optimal weighted nearest neighbor, Support vector machine

Survey data
RLocal, not public
Machine learning methods for imputationEdit & Imputation?GermanyTo be uploaded

K-nearest neighbors, Bayesian network, Random forest, Support vector machine

Survey data
RNot available
Early estimates of energy balance statistics using machine learningEdit & Imputation

Energy statistics,

Economic and Financial statistics,

Weather statistics

Belgium VITOTo be uploaded

Lasso regression, Linear regression, Neural network, Random forest, Ridge regression


PythonGitHub link

Edit & Imputation
UKTo be uploaded





Editing in the Italian Register of the Public AdministrationEdit & ImputationEconomic and Financial statisticsItalyTo be uploaded

Decision tree, Random forest

Administrative data 
R

Occupation and Economic activity coding using natural language processingCoding & Classification

Demographic and Social statistics,

Economic and Financial statistics, 

Labor statistics

MexicoTo be uploaded

Extra tree, Naive bayes, XGBoost, Support vector machine, Multilayer perceptron, Decision tree, Random forest, K-nearest neighbors, Logistic regression, 

Survey dataText dataPython
Industry and Occupation CodingCoding & ClassificationLabor statistics, Business StatisticsCanadaTo be uploaded

FastText

Survey dataText dataPythonGitHub link
Sentiment Analysis of twitter dataCoding & ClassificationLife statisticsBelgium FlandersTo be uploaded

Word embedding, Logistic regression, XGBoost, Random forest

Social Media  dataText dataPythonGitHub link

Coding & Classification
SerbiaTo be uploaded



Not available
Coding Workplace Injury and IllnessCoding & ClassificationLabor StatisticsUSATo be uploaded

Neural network

Survey dataText dataPythonGitHub link
Product Description to ECOICOPCoding & Classification?PolandTo be uploaded

Naive bayes,, Logistic regression, Random forest, Support vector machine, Neural network

Web Scraping dataText dataPythonGithub link

Coding & Classification
AustraliaTo be uploaded







Automated Coding of IMF's Catalog of Time SeriesCoding & Classification
IMFTo be uploaded






Coding & Classification
IcelandTo be uploaded





Standard Industrial Code Classification by Using Machine Learning


Coding & ClassificationBusiness Registration Statistics?NorwayTo be uploaded

Logistic regression, Random forest, Naive bayes, Support vector machine, FastText, Neural network

Business Registration data?Text dataPythonGitHub Link (Ask for availability)