Text Mining-based Economic Activity Estimation

Abstract

This paper outlines a methodology for constructing a highfrequency indicator of economic activity in Russia. News stories from internet resources are used as data sources. News data is analysed using text mining and machine learning methods, which, although developed only relatively recently, have quickly found wide application in scientific research, including economic studies. This is because news is not only a key source of information but a way to gauge the sentiment of journalists and survey respondents about the current situation and convert it into quantitative data.