Keywords

Introduction

In the socio-economic system of Russia in the spatial aspect there is a bright disproportion: overpopulation of some regions, causing environmental, housing and transport problems in them; and, conversely, outflow of population from others. The goal of the Strategy of the Russian Federation spatial development is to overcome these problems by smoothing socio-economic differences between regions and improving interregional infrastructure, which would improve resettlement of population and centers of economic activity.

Our research is aimed at developing a tool for evaluating alternative actions in tax, monetary and investment policies within aims of the Strategy. The model of the Russian Federation spatial development, which is being developed in our research, would reflect sex-age structure, composition of households and spatial distribution of Russian population; infrastructure, production capacities, educational and administrative institutions in different regions. The structure of the model is presented in [6].

In this article we present initial modeling data structures and sources and algorithms of population synthesis for the model. We have selected sample-based approach for this task, because large arrays of detailed information about population and composition of households in different regions of the Russian Federation are available from results of the All-Russian Population Census 2010 [1]. The population is reconstructed as records of individuals in a household in the model database so that attributes of the resulting synthetic population match as closely as possible the distributions in the census data, as it was proposed by Beckman for U.S. census data structure [3] and further implemented for the UK, Switzerland, Australia, Canada, Belgium [2, 7] and other countries. Since the Strategy of the Russian Federation spatial development considers regions as a geographic scale, there is no need to use sample-free techniques to simulate population of small geographic objects [4, 5] for this task.

Algorithm for Generating Synthetic Population

The algorithm is based on a number of tables from the collections of All-Russian Population Census 2010 [2], which is the most recent source with very detailed information. The main initial data tables used for generating population of the Russian Federation are Age-sex structure of population in different regions (1), Marriage status of residents (2), Number of private households (3), Number of residents in collective and homeless households (4), Age-sex composition of private households (5), Age structure of the married couples (6), Number of children in intact and single-parent families (7), Distribution of private households by number of residents (8).

The first step is to set geographical structure of the Russian Federation (Fig. 17.1). After that the original generation of agents is created, in accordance with the sex-age structure of population in each region (1). One agent in the model corresponds to 100 people of the same sex-age group. The created agents are assigned with status “single”, “married”, “divorced” or “widower” according to the table (2). The required number of private households is created (3), one model household corresponding to 100 real-world households; and also one collective household that represents prisons, shelters, monasteries, nursing homes, and one household of homeless people in each region.

Fig. 17.1
figure 1

Sequence of population synthesis in the model

The most complex task in population synthesis is distribution of agents by households, which we do in the following sequence:

  1. 1.

    The specified number of agents is fixed for the collective household and the homeless household (4). Agents associated with collective or homeless households are excluded from further consideration.

  2. 2.

    Adult agents with status “single”, “divorced” or “widower” are distributed one by one to a given number of single private households, according to sex-age structure of their residents (5).

  3. 3.

    The specified number of married couples (2) is created, choosing male and female agents with status “married” in accordance with the age structure of the married couples (6). Created couples are assigned one by one to private households.

  4. 4.

    The given number of children is attached to intact families (7), respecting difference between mothers and children in the range of 16 to 45 years.

  5. 5.

    Single-parent families are created and the given number of children (7) is attached to them, sticking to the agreed age difference. Single parents are selected from adult agents with status “single”, “divorced” or “widower”. Each single-parent family is distributed to a separate private household.

  6. 6.

    Remaining adult agents are randomly distributed by two into empty private households.

  7. 7.

    Number of agents in private households is brought up in accordance with (8) by settling the remaining agents into created private households.

Generated population is stored in a database for later use in scenario calculations on the further stages.

Simulation Results

The model of the Russian Federation spatial development is being programmed on C# in Microsoft Visual Studio 2015. Results of the generation procedure are stored in the model database, access to them is provided by SQL-queries. Table 17.1 presents variation of characteristics of the generated population of the Russian Federation in 2010 from the original Census data. To cope with computational complexity of generating synthetic population of 142 million of residents we have reduced number of agents (100 to 1), the opposite operation was made to modeling data for calculating variation of results.

Table 17.1 Simulation results

Conclusions

In this article we presented results of the first stage of development of an agent-based computer model of the Russian Federation spatial development. The model is a tool for assessing control actions aimed at eliminating imbalances in population distribution, economic achievements and quality of life in various regions. At this stage we reconstructed population of different regions and their distribution among households in accordance with information from All-Russian Population Census 2010.

The purpose of the generation procedure was to create objects of the model (population and households) and reflect their interconnections in the model database. In the process of program implementation, agents and organizations were aggregated: one agent in the model corresponds to 100 residents. Simulation results show very small variation from Census data due to used sample-based technique: the number of created objects is pre-determined and only their distribution is stochastic. Meanwhile, the obtained results show validity of the proposed population synthesis algorithms and their program implementation.