Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Table Filter
inversefalse,false,false,false,
ddSeparator,
sparkNameSparkline
columnTheme,ML methods,Data Type,Programming Language,Programme code availability
separatorComma (,)
labels‚‚‚‚
default,,,,
isFirstTimeEnterfalse
cell-width,,,,
datepatterndd M yy
id1604670425518_1900927542
updateSelectOptionstrue
worklog365|5|8|y w d h m|y w d h m
isORAND
order0,1,2,3,4
Advanced Tables - Table Plus
autoNumbertrue
ThemeTitleCountry/Organisation

Data Source

ML methodsProgramme code availabilityProgramming LanguageNote
Coding & Classification

Occupation and Economic activity coding using natural language processing

MexicoSurvey data

Extra tree, Naive bayes, XGBoost, Support vector machine, Multilayer perceptron, Decision tree, Random forest, K-nearest neighbors, Logistic regression, Ensemble

Yes (Click File attachment)

Python
Coding & Classification

Industry and Occupation Coding

CanadaSurvey data

FastText

Yes (Click GitHub link)Python
Coding & Classification

Sentiment Analysis of twitter data

Belgium (Statistics Flanders)Social media data

Word embedding, Logistic regression, XGBoost, Random forest

Yes (Click GitHub link)Python
Coding & Classification

Coding textually described data on economic activity collected from Labour Force Survey

SerbiaSurvey dataRandom forest, Support vector machine, Logistic regression


Coding & Classification

Coding Workplace Injury and Illness

USA BLSSurvey data

Neural network

Yes (Click GitHub link)Python
Coding & Classification

Production description to ECOICOP

PolandWeb scraping data

Naive bayes, Logistic regression, Random forest, Support vector machine, Neural network

Yes (Click Github link)Python
Coding & Classification

Pilot Phase - Automated Coding using the IMF’s Catalog of Time Series

Phase 2 - Automated production tool to code IMF member state time series data using ML algorithms

IMFDescriptions of indicators in data filesLogistic regression, K-nearest neighbors
Python
Coding & Classification

Automatic coding of occupation and industry in social statistical surveys

IcelandSurvey dataDeep learningYes (See section 5 of the report)R
Coding & Classification

Standard Industrial Code Classification by Using Machine Learning

NorwayAdministrative data

Logistic regression, Random forest, Naive bayes, Support vector machine, FastText, Neural network


Python
Edit & Imputation

Imputation of the variable “Attained Level of Education” in Base Register of Individuals

Italy

Administrative data, Survey data, Census data

Multilayer perceptron, Log linear

Yes (Click GitHub link)

Python


Edit & Imputation

Imputation in the sample survey on participation of Polish residents in trips

PolandSurvey data

CART, Random forest, Optimal weighted nearest neighbor, Support vector machine


R
Edit & Imputation

Machine learning for imputation

Germany

Survey data

K-nearest neighbors, Bayesian network, Random forest, Support vector machine


R


Edit & Imputation

Early estimates of energy balance statistics using machine learning

Belgium (VITO)


Lasso regression, Linear regression, Neural network, Random forest, Ridge regression

Yes (Click GitHub link)

Python


Edit & Imputation

Editing of Living Cost and Food Survey Income data

UK

Survey data

Decision tree, Random forest, Neural network




Edit & Imputation

Editing in the Italian Register of the Public Administration

Italy

Administrative data 

Decision tree, Random forest


R


Edit & Imputation

Machine Learning for Data Editing Cleaning in NSI : Some ideas and hints

Italy




Imagery AnalysisAustraliaAerial imagery

Convolutional neural network 


R
Imagery Analysis

Learning statistical information from images: a proof of concept

Netherlands

Aerial imagery,

Satellite imagery

Convolutional neural network 


Python
Imagery Analysis

Arealstatistik Deep Learning (ADELE)

SwitzerlandSatellite imagery, Administrative data

Convolutional neural network, Random forest


PythonLand cover statistics, Land use statistics
Imagery Analysis

Use of Landsat satellite data for the mapping of urban areas in non-census years

MexicoSatellite imagery

Convolutional neural network, Extra tree


Python
Imagery Analysis

Generic Pipeline for Production of Official Statistics Using Satellite Data and Machine Learning

UNECE





Coding & ClassificationAutomated coding of Standard Industrial and Occupational Classifications (SIC/SOC) UKSurvey data, Census data

Logistic regression 

Yes (Click Github link)Python
Coding & ClassificationApply ML techniques to classification and aggregation web scraped price dataBrazilWeb scraping data

Logistic regression, Support vector machine, Naive bayes, Random forest,  XGBoost


Python
Edit & ImputationMultiple imputation through machine learning in a survey of sport clubsPolandSurvey data

Random forest, CART


R
ModelingState level expenditure estimates based on ML techniquesUSSurvey data, Census planning data

Gradient-boosting machine, Lasso regression, K-nearest neighbors




Route OptimisationRoute Optimisation through genetic algorithmChile

Genetic algorithm


R
Coding & ClassificationUsing Big Data Tools and Machine Learning Techniques to Assign Classification of Individual Consumption by Purpose (COICOP) CategoriesTurkeySurvey data, Imagery data, Scanner data

Logistic regression, Support vector machine, Naive bayes, BERT, Convolutional neural network


Python
Imagery AnalysisFeasibility study of Satellite Imagery Analysis for Wealth Index Development in IndonesiaIndonesiaSatellite imagery

Convolutional neural network, Ridge regression, Support vector machine




Coding & Classification

Three projects (Scrape an ICT variable, Gain insights from an open-ended question, Create a framework for government R&D survey)TürkiyeWeb scraping data

Top2Vec

Yes (Github link for Project 1, Project 2, Project 3)

Python

Coding & Classification

Statistics on companies undertaking activities in the field of corporate social responsibility (CSR) using web scraping and machine learningML2022 web scraping theme group reportBelgium, Türkiye, PolandWeb scraping dataYes (from Türkiye - link)





Coding & Classification

Unsupervised ranking and categorisation of companies using web scraping and machine learningBelgium (Statistics Flanders)Web scraping dataPython