Municipality Synthetic Gini Index for Colombia: A Machine Learning Approach

05 February 2025, Version 1
This content is an early or alternative research output and has not been peer-reviewed by Cambridge University Press at the time of posting.

Abstract

This paper presents two synthetic estimations of the Gini coefficient at a municipality level for Colombia in the years 2000-2020. The methodology relies on several machine learning models to select the best model for imputation of the data. This derives in two Random Forest models were the first is characterized by containing Dominant Fixed Effects, while the second contains a set of Dominant Varying Factors. Upon these estimations, the Synthetic Gini Coefficients for both models are inspected, and public links are generated to access them. The Dominant Fixed Effects models is rather ”stiff” in contrast to the Varying Factor model. Hence, for researchers it is recommended to use the Synthetic Gini Coefficient with Varying Factors because it contains greater variability across time than the Dominant Fixed Effects models.

Keywords

Gini
Machine learning
Random forest
estimation
synthetic
economics

Supplementary materials

Title
Description
Actions
Title
Synthetic Gini Coefficient 1st Data set
Description
Municipality datasets generated of income inequality. Check the last letters to define whether if they belong to the Dominant Fixed Effects Model DFEM or to the Varying Factor Model VFM
Actions
Title
Synthetic Gini Coefficient - Estimates from the Varying Factor Model
Description
Municipality datasets generated of income inequality. Check the last letters to define whether if they belong to the Dominant Fixed Effects Model DFEM or to the Varying Factor Model VFM
Actions

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting and Discussion Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.