Determinants of informal employment in Bolivia: a combined analysis of traditional econometric techniques and machine learning methods
DOI:
https://doi.org/10.23881/idupbo.025.2-5eKeywords:
Informal employment, Probit, Machine learning, Adaptive LassoAbstract
This study examines the determinants of informal employment in Bolivia by combining traditional econometric techniques, machine learning methods, and hybrid approaches. Using data from the 2022 and 2023 Household Surveys, we identify individual and household-level factors influencing the likelihood of being in informal employment. The results show that variables such as age, education level, household income, and gender are key determinants. Random Forest highlights the central role of labor income, often excluded due to endogeneity concerns. Adaptive Lasso helps identify nonlinear relationships and complex interactions, such as those associated with gender, indigenous group membership, and the presence of young children in the household. We conclude that informal employment is a multidimensional phenomenon requiring integrated analytical approaches for the design of more effective and targeted public policies.Downloads
References
Agrawal, A. (2019). The economics of artificial intelligence: An agenda. University of Chicago Press.
Angel-Urdinola, D. F. y Tanabe, K. (2012). Micro-determinants of informal employment. En Striving for better jobs: The challenge of informality in the Middle East and North Africa region
Athey, S. e Imbens, G. W. (2019). Machine learning methods that economists should know about. Annual Review of Economics, 11, 685- 725 https://doi.org/10.1146/annurev-economics-080217-053433
Athey, S. y Wager, S. (2019). Estimating treatment effects with causal forests: An application. Observational Studies, 5(1), 37-51.
Banco Mundial. (2008). Aportes a una nueva visión de la informalidad laboral en la Argentina. Ministerio de Trabajo, Empleo y Seguridad Social.
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5-32. https://doi.org/10.1023/A:1010933404324
Canelas, C. y Niño-Zarazúa, M. (2022). Informality and pension reforms in Bolivia: The case of Renta Dignidad. The Journal of Development Studies, 58(7), 1436-1458. https://doi.org/10.1080/00220388.2022.2032678
Cerquera-Losada, O. H., Arias-Barrera, C. J. y Rojas-Velásquez, L. (2020). Determinantes del subempleo en Colombia: Una aproximación a partir de un modelo PROBIT. El Ágora USB, 20(1), 157-172.https://doi.org/10.21500/16578031.4525
Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W. y Robins, J. (2018). Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal, 21(1), C1-C68. https://doi.org/10.1111/ectj.12097
Coaquira-Velásquez, M. A. (2021). Determinantes de la informalidad y subempleo en el departamento de Puno: Un modelo Probit bivariado aplicado para el año 2019. Semestre Económico, 10(1), 49-67.
Dell'Anno, R. (2021). Theories and definitions of the informal economy: A survey. Journal of Economic Surveys, 35(5), 1610-1643. https://doi.org/10.1111/joes.12450
Desai, A. (2023). Machine learning for economics research: When, what and how? arXiv. https://arxiv.org/abs/2301.05026
Duval-Hernández, R. (2022). Choices and constraints: The nature of informal employment in urban Mexico. Journal of Development Studies, 58(7), 1349-1362. https://doi.org/10.1080/00220388.2022.2032677
Espinoza, J. R. (2021). Análisis de la informalidad comercial en el distrito de San Juan de Miraflores. Horizonte Empresarial, 20(2), 782-792.
Freije, S. (2002). Informal employment in Latin America and the Caribbean: Causes, consequences and policy recommendations. Banco Interamericano de Desarrollo.
Grusky, D. B. (2008). Social stratification. Westview Press.
Hastie, T., Tibshirani, R. y Friedman, J. (2017). The elements of statistical learning: Data mining, inference, and prediction (2ª ed.). Springer. https://doi.org/10.1007/978-0-387-84858-7
Hussein, A.-O. (2020). How machine learning affects economics/econometrics? A critical review of machine learning and econometrics. Oxford Brookes Business School.
Organización Internacional del Trabajo (OIT). (2015). Policy responses to the informal economy. Oficina Internacional del Trabajo.
Organización Internacional del Trabajo (OIT). (2023). Statistics on the informal economy. https://www.ilo.org/informaleconomy
Instituto Nacional de Estadística (INE). (2022). Bolivia - Encuesta de Hogares 2022 (EH 2022). https://anda.ine.gob.bo/index.php/catalog/106
Instituto Nacional de Estadística (INE). (2023). Bolivia - Encuesta de Hogares 2023 (EH 2023). https://anda.ine.gob.bo/index.php/catalog/108
Liang, Z., Appleton, S. y Song, L. (2016). Informal employment in China: Trends, patterns and determinants of entry. IZA Discussion Paper, N.º 10139
Losby, J. L., Else, J. F., Kingslow, M. E., Edgcomb, E. L., Malm, E. T. y Kao, V. (2002). Informal economy literature review. ISED Consulting and Research.
Muhammad Tanveer, A. K. y Hussain, B. (2021). Measurement and determinants of informal employment: Evidence from Pakistan. Pakistan Social Science Review, 5(2), 309-324.
Mullainathan, S. y Spiess, J. (2017). Machine learning: An applied econometric approach. Journal of Economic Perspectives, 31(2), 87-106. https://doi.org/10.1257/jep.31.2.87
Narváez, A. R., López, J. W. y Ochoa, C. L. (2021). Causas de la informalidad laboral en Montería, Colombia: Un modelo econométrico Probit. Revista de Economía y Finanzas, 15(44), 45-60.
Onofrei, N. (2024). The determinants of informal economy in Eastern European countries. Economy and Sociology, 2, 45-58.
Ortiz, C. H., Uribe, J. I. y García, G. A. (2007). Informalidad y subempleo: Un modelo probit bivariado aplicado al Valle del Cauca. Sociedad y Economía, 13, 104-131.
Pérez-Pons, M. E., Parra-Dominguez, J., Omatu, S., Herrera-Viedma, E. y Corchado, J. M. (2022). Machine learning and traditional econometric models: A systematic mapping study. Journal of Artificial Intelligence and Soft Computing Research, 12(1), 5-20. https://doi.org/10.2478/jaiscr-2022-0001
Steuer, F. (2018). Machine learning for public policy making: How to use data driven predictive modeling for the social good. Student Paper Series. Hertie School.
Thomas, R. y Thomas, H. (1994). The informal economy and local economic development policy. Local Government Studies, 20(3), 486-501 . https://doi.org/10.1080/03003939408433740
Ticona-Carrizales, L., Portillo, H. A., Rodríguez-Limachi, O. M., Incacutipa-Limachi, D. J., Yapuchura-Saico, C. R., Huayta-Flores, L. y Ticona-Campos, V. N. (2024). Socioeconomic determinants of labor informality in the Department of Puno, Peru. Journal of Ecohumanism, 3(8), 7235-7247. https://doi.org/10.33140/JEH.03.08.02
Ulyssea, G. (2020). Informality: Causes and consequences for development. Annual Review of Economics, 12, 525-546. https://doi.org/10.1146/annurev-economics-082119-121914
Ulyssea, G., Bobba, M. y Gadenne, L. (2023). Informality. VoxDevLit. https://voxdev.org
Villalpando, M. N. (2024). Explorando las raíces de la informalidad laboral en Bolivia: Un análisis de sus determinantes. ARU Search, 5(1), 181-207.
Wooldridge, J. M. (2020). Introductory econometrics: A modern approach (7ª ed.). Cengage Learning.
Yenilmez Oğuz, S. (2024). Machine learning integration in econometric models. Next Generation Journal for the Young Researchers, 8(1), 77-80.
Zheng, E., Tan, Y., Goes, P., Chellappa, R., Wu, D., Shaw, M. y Sheng, O. (2017). When econometrics meets machine learning. Data and Information Management, 1(2), 75-83. https://doi.org/10.1515/dim-2017-0009
Zou, H. (2006). The adaptive lasso and its oracle properties. Journal of the American Statistical Association, 101(476), 1418-1429. https://doi.org/10.1198/016214506000000735
Downloads
Additional Files
Published
Issue
Section
License
Copyright (c) 2026 Ibhar Beramendi, Ivette Illanes

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Creative Commons Attribution-Noncommercial-Share Alike
CC BY-NC-SA
This license lets others remix, tweak, and build upon your work for non-commercial purposes, as long as they credit the author(s) and license their new creations under the identical terms.
The authors can enter additional separate contract agreements for non-exclusive distribution of the version of the article published in the magazine (for instance, they may publish it in an institutional repository or a book), subject to an acknowledgement of its initial publication in this magazine.