Comment
- Intended purpose - central list for use cases and available code corresponding to the case (something like GSBPM Resources Repository)
- This kind of place has good potential, but has to be designed in advance so that there is minimum effort needed to maintain.
Data Source Convolutional neural network Geospatial statistics, Income-based poverty statistics Convolutional neural network Aerial imagery, Satellite imagery Convolutional neural network, Random forest Convolutional neural network, Extra tree Multilayer perceptron, Log linear Tourism statistics CART, Random forest, Optimal weighted nearest neighbor, Support vector machine K-nearest neighbors, Bayesian network, Random forest, Support vector machine Energy statistics, Economic and financial statistics, Weather statistics Lasso regression, Linear regression, Neural network, Random forest, Ridge regression Decision tree, Random forest Demographic and social statistics, Economic and financial statistics, Labor statistics Extra tree, Naive bayes, XGBoost, Support vector machine, Multilayer perceptron, Decision tree, Random forest, K-nearest neighbors, Logistic regression, Ensemble FastText Word embedding, Logistic regression, XGBoost, Random forest Neural network Naive bayes, Logistic regression, Random forest, Support vector machine, Neural network Logistic regression, Random forest, Naive bayes, Support vector machine, FastText, Neural networkTheme Title Country/Organisation Statistics Area ML methods Programming Language Programme code Note Imagery Analysis Address Register Automated Image Recognition (AIR) model Australia Geospatial statistics Aerial imagery R Imagery Analysis Learning statistical information from images: a proof of concept (UPDATED)
Netherlands Python Imagery Analysis Arealstatistik Deep Learning (ADELE) (UPDATED)
Switzerland Geospatial statistics Satellite imagery, Administrative data Python Imagery Analysis Use of Landsat satellite data for the mapping of urban areas in non-census years (UPDATED)
Mexico Geospatial statistics, Urban statistics Satellite imagery Python Imagery Analysis Generic Pipeline for Production of Official Statistics Using Satellite Data and Machine Learning (UPDATED)
UNECE Not applicable Not applicable Not applicable Edit & Imputation Imputation of the variable “Attained Level of Education” in Base Register of Individuals (UPDATED)
Italy Education statistics Administrative data, Survey data, Census data Python GitHub link Edit & Imputation Imputation in the sample survey on participation of Polish residents in trips (UPDATED)
Poland Survey data R Edit & Imputation Machine learning for imputation (UPDATED)
Germany ? Survey data R Edit & Imputation Early estimates of energy balance statistics using machine learning (UPDATED)
Belgium VITO Python GitHub link Edit & Imputation Editing of Living Cost and Food Survey Income data (UPDATED)
UK Edit & Imputation Editing in the Italian Register of the Public Administration (UPDATED)
Italy Economic and Financial statistics Administrative data R Edit & Imputation Machine Learning for Data Editing Cleaning in NSI : Some ideas and hints (NEW) Italy Coding & Classification Occupation and Economic activity coding using natural language processing - with comments (UPDATED)
Mexico Survey data Python Coding & Classification Industry and Occupation Coding (UPDATED)
Canada Labor statistics, Business statistics Survey data Python GitHub link Coding & Classification Sentiment Analysis of twitter data (UPDATED)
Belgium Flanders Life statistics Social media data Python GitHub link Coding & Classification Coding textually described data on economic activity collected from Labour Force Survey (UPDATED)
Serbia Coding & Classification Coding Workplace Injury and Illness (UPDATED)
USA Labor statistics Survey data Python GitHub link Coding & Classification Production description to ECOICOP (UPDATED)
Poland Price statistics Web scraping data Python Github link Coding & Classification Australia Coding & Classification Automated Coding using the IMF’s Catalog of Time Series (UPDATED)
IMF Coding & Classification Iceland Coding & Classification Standard Industrial Code Classification by Using Machine Learning (UPDATED)
Norway Business registration statistics? Administrative data Python