Determinants of informal employment in Bolivia: a combined analysis of traditional econometric techniques and machine learning methods

Authors

DOI:

https://doi.org/10.23881/idupbo.025.2-5e

Keywords:

Informal employment, Probit, Machine learning, Adaptive Lasso

Abstract

This study examines the determinants of informal employment in Bolivia by combining traditional econometric techniques, machine learning methods, and hybrid approaches. Using data from the 2022 and 2023 Household Surveys, we identify individual and household-level factors influencing the likelihood of being in informal employment. The results show that variables such as age, education level, household income, and gender are key determinants. Random Forest highlights the central role of labor income, often excluded due to endogeneity concerns. Adaptive Lasso helps identify nonlinear relationships and complex interactions, such as those associated with gender, indigenous group membership, and the presence of young children in the household. We conclude that informal employment is a multidimensional phenomenon requiring integrated analytical approaches for the design of more effective and targeted public policies.

Downloads

Download data is not yet available.

References

Agrawal, A. (2019). The economics of artificial intelligence: An agenda. University of Chicago Press.

Angel-Urdinola, D. F. y Tanabe, K. (2012). Micro-determinants of informal employment. En Striving for better jobs: The challenge of informality in the Middle East and North Africa region

Athey, S. e Imbens, G. W. (2019). Machine learning methods that economists should know about. Annual Review of Economics, 11, 685- 725 https://doi.org/10.1146/annurev-economics-080217-053433

Athey, S. y Wager, S. (2019). Estimating treatment effects with causal forests: An application. Observational Studies, 5(1), 37-51.

Banco Mundial. (2008). Aportes a una nueva visión de la informalidad laboral en la Argentina. Ministerio de Trabajo, Empleo y Seguridad Social.

Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5-32. https://doi.org/10.1023/A:1010933404324

Canelas, C. y Niño-Zarazúa, M. (2022). Informality and pension reforms in Bolivia: The case of Renta Dignidad. The Journal of Development Studies, 58(7), 1436-1458. https://doi.org/10.1080/00220388.2022.2032678

Cerquera-Losada, O. H., Arias-Barrera, C. J. y Rojas-Velásquez, L. (2020). Determinantes del subempleo en Colombia: Una aproximación a partir de un modelo PROBIT. El Ágora USB, 20(1), 157-172.https://doi.org/10.21500/16578031.4525

Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W. y Robins, J. (2018). Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal, 21(1), C1-C68. https://doi.org/10.1111/ectj.12097

Coaquira-Velásquez, M. A. (2021). Determinantes de la informalidad y subempleo en el departamento de Puno: Un modelo Probit bivariado aplicado para el año 2019. Semestre Económico, 10(1), 49-67.

Dell'Anno, R. (2021). Theories and definitions of the informal economy: A survey. Journal of Economic Surveys, 35(5), 1610-1643. https://doi.org/10.1111/joes.12450

Desai, A. (2023). Machine learning for economics research: When, what and how? arXiv. https://arxiv.org/abs/2301.05026

Duval-Hernández, R. (2022). Choices and constraints: The nature of informal employment in urban Mexico. Journal of Development Studies, 58(7), 1349-1362. https://doi.org/10.1080/00220388.2022.2032677

Espinoza, J. R. (2021). Análisis de la informalidad comercial en el distrito de San Juan de Miraflores. Horizonte Empresarial, 20(2), 782-792.

Freije, S. (2002). Informal employment in Latin America and the Caribbean: Causes, consequences and policy recommendations. Banco Interamericano de Desarrollo.

Grusky, D. B. (2008). Social stratification. Westview Press.

Hastie, T., Tibshirani, R. y Friedman, J. (2017). The elements of statistical learning: Data mining, inference, and prediction (2ª ed.). Springer. https://doi.org/10.1007/978-0-387-84858-7

Hussein, A.-O. (2020). How machine learning affects economics/econometrics? A critical review of machine learning and econometrics. Oxford Brookes Business School.

Organización Internacional del Trabajo (OIT). (2015). Policy responses to the informal economy. Oficina Internacional del Trabajo.

Organización Internacional del Trabajo (OIT). (2023). Statistics on the informal economy. https://www.ilo.org/informaleconomy

Instituto Nacional de Estadística (INE). (2022). Bolivia - Encuesta de Hogares 2022 (EH 2022). https://anda.ine.gob.bo/index.php/catalog/106

Instituto Nacional de Estadística (INE). (2023). Bolivia - Encuesta de Hogares 2023 (EH 2023). https://anda.ine.gob.bo/index.php/catalog/108

Liang, Z., Appleton, S. y Song, L. (2016). Informal employment in China: Trends, patterns and determinants of entry. IZA Discussion Paper, N.º 10139

Losby, J. L., Else, J. F., Kingslow, M. E., Edgcomb, E. L., Malm, E. T. y Kao, V. (2002). Informal economy literature review. ISED Consulting and Research.

Muhammad Tanveer, A. K. y Hussain, B. (2021). Measurement and determinants of informal employment: Evidence from Pakistan. Pakistan Social Science Review, 5(2), 309-324.

Mullainathan, S. y Spiess, J. (2017). Machine learning: An applied econometric approach. Journal of Economic Perspectives, 31(2), 87-106. https://doi.org/10.1257/jep.31.2.87

Narváez, A. R., López, J. W. y Ochoa, C. L. (2021). Causas de la informalidad laboral en Montería, Colombia: Un modelo econométrico Probit. Revista de Economía y Finanzas, 15(44), 45-60.

Onofrei, N. (2024). The determinants of informal economy in Eastern European countries. Economy and Sociology, 2, 45-58.

Ortiz, C. H., Uribe, J. I. y García, G. A. (2007). Informalidad y subempleo: Un modelo probit bivariado aplicado al Valle del Cauca. Sociedad y Economía, 13, 104-131.

Pérez-Pons, M. E., Parra-Dominguez, J., Omatu, S., Herrera-Viedma, E. y Corchado, J. M. (2022). Machine learning and traditional econometric models: A systematic mapping study. Journal of Artificial Intelligence and Soft Computing Research, 12(1), 5-20. https://doi.org/10.2478/jaiscr-2022-0001

Steuer, F. (2018). Machine learning for public policy making: How to use data driven predictive modeling for the social good. Student Paper Series. Hertie School.

Thomas, R. y Thomas, H. (1994). The informal economy and local economic development policy. Local Government Studies, 20(3), 486-501 . https://doi.org/10.1080/03003939408433740

Ticona-Carrizales, L., Portillo, H. A., Rodríguez-Limachi, O. M., Incacutipa-Limachi, D. J., Yapuchura-Saico, C. R., Huayta-Flores, L. y Ticona-Campos, V. N. (2024). Socioeconomic determinants of labor informality in the Department of Puno, Peru. Journal of Ecohumanism, 3(8), 7235-7247. https://doi.org/10.33140/JEH.03.08.02

Ulyssea, G. (2020). Informality: Causes and consequences for development. Annual Review of Economics, 12, 525-546. https://doi.org/10.1146/annurev-economics-082119-121914

Ulyssea, G., Bobba, M. y Gadenne, L. (2023). Informality. VoxDevLit. https://voxdev.org

Villalpando, M. N. (2024). Explorando las raíces de la informalidad laboral en Bolivia: Un análisis de sus determinantes. ARU Search, 5(1), 181-207.

Wooldridge, J. M. (2020). Introductory econometrics: A modern approach (7ª ed.). Cengage Learning.

Yenilmez Oğuz, S. (2024). Machine learning integration in econometric models. Next Generation Journal for the Young Researchers, 8(1), 77-80.

Zheng, E., Tan, Y., Goes, P., Chellappa, R., Wu, D., Shaw, M. y Sheng, O. (2017). When econometrics meets machine learning. Data and Information Management, 1(2), 75-83. https://doi.org/10.1515/dim-2017-0009

Zou, H. (2006). The adaptive lasso and its oracle properties. Journal of the American Statistical Association, 101(476), 1418-1429. https://doi.org/10.1198/016214506000000735

Published

2026-01-31

Issue

Section

Economía, Empresa y Sociedad

How to Cite

Beramendi Illanes, I., & Illanes Fajardo, I. (2026). Determinants of informal employment in Bolivia: a combined analysis of traditional econometric techniques and machine learning methods. Revista Investigación & Desarrollo, 25(2), 192-206. https://doi.org/10.23881/idupbo.025.2-5e