Journal of Applied Economic Research
ISSN 2712-7435
Identifying Patterns of Regional Polarization in Russia: A Machine Learning Approach
Daniel M. Balungu, Anna V. Rozanova, Kristina A. Andreeva, Anastasia V. Solod, Yutong Chen
Ural Federal University named after the First President of Russia B.N. Yeltsin, Yekaterinburg, Russia
Abstract
The study of regional polarization is critically relevant for Russia, as pronounced socio-economic disparities threaten national economic stability, social cohesion, and the strategic goal of technological sovereignty. These imbalances hinder efficient resource allocation and create vulnerabilities amid global realignment. This study analyzes patterns and drivers of regional polarization in Russia by leveraging machine learning to model the complex interplay of economic and non-economic factors. The research is guided by three hypotheses: polarization is driven by multi-dimensional disparities (H1); processes of convergence and divergence coexist (H2); and machine learning can effectively identify latent polarization structures overlooked by traditional methods (H3). K-means clustering identified regional typologies, with optimal cluster count validated by the Elbow Method and Silhouette Score. Predictive power and key drivers were analyzed using ensemble methods, Random Forest and XGBoost, with performance evaluated by Mean Squared Error (MSE). Results confirm a deeply polarized landscape with four distinct socio-economic clusters: resource-rich regions plagued by inequality and outmigration; leading hubs facing over-centralization risks; industrial-agrarian regions with persistent poverty; and lagging republics trapped in subsidy dependence. Analysis validated H2, showing simultaneous catch-up growth in some areas and entrenched divergence in others. Random Forest showed superior predictive accuracy (MSE = 0.132), confirming H3 and identifying key drivers (H1) beyond economic metrics. Theoretical significance lies in its novel, data-driven typology of Russian regions, highlighting the superiority of ML ensemble methods for modeling complex spatial inequality. Practically, findings provide policymakers with an evidence-based roadmap for differentiated regional strategies and a predictive tool for proactive intervention vital to balanced national development.
Keywords
regional polarization; interregional differentiation; machine learning; cluster analysis; time series forecasting.
JEL classification
R12, C61References
1. Кuzin, V.Yu. (2023). An evaluation of the Russian Far East spatial polarization in the post-Soviet period. Vestnik of North-Eastern Federal University Series "Earth Sciences", No. 2, 102–113. (In Russ.). https://doi.org/10.25587/svfu.2023.30.2.009
2. Conover, M., Ratkiewicz, J., Francisco, M., Goncalves, B., Menczer, F., Flammini, A. (2021). Political polarization on Twitter. Proceedings of the International AAAI Conference on Web and Social Media, Vol. 5, No. 1, 89–96. https://doi.org/10.1609/icwsm.v5i1.14126
3. Da Silva, T.P. (2023). Learning beyond the spatial autocorrelation structure: A machine learning- based approach to discovering new patterns and relationships in the context of spatially contextualized modeling of voting behavior. Doctoral Thesis. Sao Carlos. https://doi.org/10.11606/t.55.2023.tde-15012024-174102
4. Levy, R. (2021). Social Media, News Consumption, and Polarization: Evidence from a Field Experiment. American Economic Review, Vol. 111, No. 3, 831–870. https://doi.org/10.1257/aer.20191777
5. Aharon-Gutman, M., Schaap, M., Lederman, I. (2018). Social topography: Studying spatial inequality using a 3D regional model. Journal of Rural Studies, Vol. 62, 40–52. https://doi.org/10.1016/j.jrurstud.2018.06.010
6. Martin, R., Sunley, P. (2020). Regional economic resilience: evolution and evaluation. In: Handbook on Regional Economic Resilience. Edited by G. Bristow, A. Healy. Edward Elgar Publishing, 10–35. https://doi.org/10.4337/9781785360862.00007
7. Aharon-Gutman, M., Burg, D. (2019). How 3D visualization can help us understand spatial inequality: On social distance and crime. Environment and Planning B: Urban Analytics and City Science, Vol. 48, Issue 4, 793–809. https://doi.org/10.1177/2399808319896524
8. Stephens, J.D. (2015). Thomas Piketty (2014), Capital in the Twenty-First Century. Translated by Arthur Goldhammer. Cambridge, Massachusetts: Belknap Press of Harvard University Press, 685 pp., Journal of Social Policy, Vol. 45, Issue 1, 172–173. https://doi.org/10.1017/s0047279415000616
9. Marco, C., Lenka, J., Christa, K.S. (2024). The Future of EU Cohesion: Scenarios and Their Impacts on Regional Inequalities. Belgium: European Parliamentary Research Service (EPRS), 61 p. Available at: https://www.europarl.europa.eu/RegData/etudes/STUD/2024/762854/EPRS_STU(2024)762854_EN.pdf
10. Glaeser, E.L. (2021). Urban resilience. Urban Studies, Vol. 59, Issue 1, 3–35. https://doi.org/10.1177/00420980211052230
11. Tebekin M.V. (2023). Methodological approach to the assessment of spatial polarization of regions. Applied Economic Research, No. 4, 86–94. (In Russ.). https://doi.org/10.47576/2949-1908_2023_4_86
12. Altunina, V., Anuchina, D. (2023). Assessment of the level of spatial polarization of Russian regions. Journal of Economics Entrepreneurship and Law, Vol. 13, No. 5, 1319–1340. (In Russ.). https://doi.org/10.18334/epp.13.5.117516
13. Taymaz, E. (2022). Regional convergence or polarization: the case of the Russian Federation. Regional Research of Russia, Vol. 12, 469–482. https://doi.org/10.1134/s2079970522700198
14. Gluschenko, K.P. (2023). Regional inequality in Russia: Anatomy of convergence. Regional Research of Russia, Vol. 13, Suppl. 1, S1–S12. https://doi.org/10.1134/s207997052360004x
15. Atkinson, A.B. (1970). On the measurement of inequality. Journal of Economic Theory, Vol. 2, Issue 3, 244–263. https://doi.org/10.1016/0022-0531(70)90039-6
16. Chi, S., Grigsby-Toussaint, D.S., Bradford, N., Choi, J. (2013). Can Geographically Weighted Regression improve our contextual understanding of obesity in the US? Findings from the USDA Food Atlas. Applied Geography, Vol. 44, 134–142. https://doi.org/10.1016/j.apgeog.2013.07.017
17. Amiri, M., Pourghasemi, H.R., Ghanbarian, G.A., Afzali, S.F. (2019). Assessment of the importance of gully erosion effective factors using Boruta algorithm and its spatial modeling and mapping using three machine learning algorithms. Geoderma, Vol. 340, 55–69. https://doi.org/10.1016/j.geoderma.2018.12.042
18. Bergal, B. (2022). Audit of the effectiveness of cluster policy implementation in the region. Pskov Region-logical Journal, Vol. 18, No. 2, 154–167. https://doi.org/10.37490/S221979310019289-0
19. Rogot, A. (2023). Dimensionality reduction techniques in Macroeconomic Analysis. CUNY Academic Works. Available at: https://academicworks.cuny.edu/bb_etds/166
20. Chagovets, L., Chahovets, V., Chernova, N. (2020). Machine Learning Methods Applications for Estimating Unevenness Level of Regional Development. In: Data-Centric Business and Applications. Evolvements in Business Information Processing and Management. Vol. 3. Edited by D. Ageyev, T. Radivilova, N. Kryvinska. Springer Cham, 115–139. https://doi.org/10.1007/978-3-030-35649-1_6
21. Veale, M., Binns, R. (2017). Fairer machine learning in the real world: Mitigating discrimination without collecting sensitive data. Big Data & Society, Vol. 4, Issue 2, 205395171774353. https://doi.org/10.1177/2053951717743530
22. Terlizzi, E.P., Cohen, R.A. (2023). Geographic Variation in Health Insurance Coverage: United States, 2022. National Health Statistics Reports. https://doi.org/10.15620/cdc:133320
23. Oberlander, J. (2024). Polarization, partisanship, and health in the United States. Journal of Health Politics Policy and Law, Vol. 49, Issue 3, 329–350. https://doi.org/10.1215/03616878-11075609
24. Giannini, M., Martini, B. (2024). Regional disparities in the European Union. A machine learning approach. Papers of the Regional Science Association, Vol. 103, Issue 4, 100033. https://doi.org/10.1016/j.pirs.2024.100033
25. Lange, T. (2015). Socio-economic and political responses to regional polarisation and socio-spatial peripheralisation in Central and Eastern Europe: A research agenda. Hungarian Geographical Bulletin, Vol. 64, No. 3, 171–185. https://doi.org/10.15201/hungeobull.64.3.2
26. Druckman, J.N., Levendusky, M.S. (2024). Correction to: What Do We Measure When We Measure Affective Polarization? Public Opinion Quarterly, Vol. 88, Issue 3, 1095–1096. https://doi.org/10.1093/poq/nfae051
27. Jo, J. (2019). Effectiveness of normalization Pre-Processing of big data to the machine learning performance. The Journal of the Korea Institute of Electronic Communication Sciences, Vol. 14, Issue 3, 547–552. https://doi.org/10.13067/jkiecs.2019.14.3.547
28. Thorndike, R.L. (1953). Who belongs in the family? Psychometrika, Vol. 18, Issue 4, 267–276. https://doi.org/10.1007/bf02289263
29. Eltibi, MF., Ashour, W.M. (2011). Initializing KMeans Clustering Algorithm using Statistical Information. International Journal of Computer Applications, Vol. 29, No. 7, 51–55. https://doi.org/10.5120/3573-4930
30. Currin, C.B., Vera, S.V., Khaledi-Nasab, A. (2022). Depolarization of echo chambers by random dynamical nudge. Scientific Reports, Vol. 12, 9234. https://doi.org/10.1038/s41598-022-12494-w
31. Behrens, T., Schmidt, K., Rossel, R.V., Gries, P., Scholten, T., MacMillan, R.A. (2018). Spatial modelling with Euclidean distance fields and machine learning. European Journal of Soil Science, Vol. 69, Issue 5, 757–770. https://doi.org/10.1111/ejss.12687
32. Likas, A., Vlassis, N., Verbeek, J.J. (2002). The global k-means clustering algorithm. Pattern Recognition, Vol. 36, Issue 2, 451–461. https://doi.org/10.1016/s0031-3203(02)00060-2
33. Breiman, L. (2001). Random forests. Machine Learning, Vol. 45, 5–32. https://doi.org/10.1023/A:1010933404324
34. Friedman, J.H. (2001). Greedy function approximation: A gradient boosting machine. The Annals of Statistics, Vol. 29, No. 5, 1189–1232. https://doi.org/10.1214/aos/1013203451
35. Friedman, J., Hastie, T., Tibshirani, R. (2000). Additive logistic regression: a statistical view of boosting (With discussion and a rejoinder by the authors). The Annals of Statistics, Vol. 28, No. 2, 337–407. https://doi.org/10.1214/aos/1016218223
36. Biau, G., Scornet, E. (2016). A random forest guided tour. Test, Vol. 25, 197–227. https://doi.org/10.1007/s11749-016-0481-7
37. Wright, M.N., Ziegler, A. (2017). ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R. Journal of Statistical Software, Vol. 77, Issue 1, 1–17. https://doi.org/10.18637/jss.v077.i01
38. Maksubova, D.M., Umargadzhieva, N.M., Aripova, P.G. (2024). Methodological approaches to assessing regional development. In: Computational and Strategic Business Modelling. Edited by D.P. Sakas, D.K. Nasiopoulos, Yu. Taratuhina. Springer Cham, 543–555. https://doi.org/10.1007/978-3-031-41371-1_45
39. Amarasinghe, K., Rodolfa, K., Lamba, H., Ghani, R. (2020). Explainable Machine learning for public Policy: use cases, gaps, and research directions. Data & Policy, Vol. 5, e5. https://doi.org/10.1017/dap.2023.2
40. Kamal, S., Gullic, J., Bagavathi, A. (2022). Modeling Polarization on Social Media posts: A Heuristic approach using media Bias. In: Foundations of Intelligent Systems. Proceedings of 26th International Symposium, ISMIS 2022. Edited by M. Ceci, S. Flesca, E. Masciari, G. Manco, Z.W. Raś. Springer Cham, 35–43. https://doi.org/10.1007/978-3-031-16564-1_4
41. Ahrend, R. (2005). Can Russia Break the “Resource Curse”? Eurasian Geography and Economics, Vol. 46, Issue 8, 584–609. https://doi.org/10.2747/1538-7216.46.8.584
42. Maximova, S.G., Omelchenko, D.A., Noyanzina, O.E. (2022). Human development, satisfaction with human capital and security in the Siberian and Far Eastern border regions. RUDN Journal of Sociology, Vol. 22, No. 3, 646–660. https://doi.org/10.22363/2313-2272-2022-22-3-646-660
43. Sitkevich, D.A. (2023). Economic and sociocultural factors of migration attitudes of residents of the North Caucasus. Regional Research of Russia, Vol. 13, Suppl 1, S78–S88. https://doi.org/10.1134/s2079970523600166
44. Berg, D.B., Balungu, D.M., Shelomentsev, A.G., Goncharova, K.S. (2024). Experimental trajectories of convergence and divergence processes of Russian regions population incomes inequality. Journal of Applied Economic Research, Vol. 23, No. 2, 364–393. (In Russ.). https://doi.org/10.15826/vestnik.2024.23.2.015
45. Balungu, D.M., Kumar, A. (2024). Forecasting the economic growth of Sverdlovsk Region: A comparative analysis of machine learning, linear regression and autoregressive models. Journal of Applied Economic Research, Vol. 23, No. 3, 674–695. https://doi.org/10.15826/vestnik.2024.23.3.027
46. Ketova, K., Kasatkina, E., Vavilova, D. (2021). Clustering Russian Federation Regions According to the Level of Socio-Economic Development with the Use of Machine Learning Methods. Economic and Social Changes: Facts, Trends, Forecast, Vol. 14, No. 6, 70–85. https://doi.org/10.15838/esc.2021.6.78.4
About Authors
Daniel Musafiri Balungu
Post-Graduate Student, Assistant, Department of Big Data Analytics and Video Analysis Methods, Institute of Radio Electronics and Information Technologies, Ural Federal University named after the first President of Russia B.N. Yeltsin, Yekaterinburg, Russia (620002, Yekaterinburg, Mira street, 19); ORCID https://orcid.org/0009-0001-5098-7603 e-mail: danielbal03.db@gmail.com
Anna Vyacheslavovna Rozanova
Master Student, Department of Big Data Analytics and Video Analysis Methods, Institute of Radio Electronics and Information Technologies, Ural Federal University named after the first President of Russia B.N. Yeltsin, Yekaterinburg, Russia (620002, Yekaterinburg, Mira street, 19); ORCID https://orcid.org/0009-0003-9803-0848 e-mail: rozanna221132@icloud.com
Kristina Aleksandrovna Andreeva
Master Student, Department of Big Data Analytics and Video Analysis Methods, Institute of Radio Electronics and Information Technologies, Ural Federal University named after the first President of Russia B.N. Yeltsin, Yekaterinburg, Russia (620002, Yekaterinburg, Mira street, 19); ORCID https://orcid.org/0009-0009-5345-6160 e-mail: kristinalezhnina88@gmail.com
Anastasia Vasil’evna Solod
Master Student, Department of Big Data Analytics and Video Analysis Methods, Institute of Radio Electronics and Information Technologies, Ural Federal University named after the first President of Russia B.N. Yeltsin, Yekaterinburg, Russia (620002, Yekaterinburg, Mira street, 19); ORCID https://orcid.org/0009-0007-8795-1640 e-mail: nsolodv@mail.ru
Yutong Chen
Master Student, Department of Big Data Analytics and Video Analysis Methods, Institute of Radio Electronics and Information Technologies, Ural Federal University named after the first President of Russia B.N. Yeltsin, Yekaterinburg, Russia (620002, Yekaterinburg, Mira street, 19); ORCID https://orcid.org/0009-0001-7684-1530 e-mail: kirito200207@gmail.com
For citation
Balungu, D.M., Rozanova, A.V., Andreeva, K.A., Solod, A.V., Chen, Yu. (2026). Identifying Patterns of Regional Polarization in Russia: A Machine Learning Approach. Journal of Applied Economic Research, Vol. 25, No. 1, 135-162. https://doi.org/10.15826/vestnik.2026.25.1.005
Article info
Received May 9, 2025; Revised October 6, 2025; Accepted November 5, 2025.
DOI: http://dx.doi.org/10.15826/vestnik.2026.25.1.005
Download full text article:
~630 KB, *.pdf
(Uploaded
15.03.2026)
Created / Updated: 2 September 2015 / 20 September 2021
© Federal State Autonomous Educational Institution of Higher Education «Ural Federal University named after the first President of Russia B.N.Yeltsin»
Remarks?
select the text and press:
Ctrl + Enter
Portal design: Artsofte
©Ural Federal University named the first President of Russia B.N.Yeltsin (Website)