На русском языке

 

International Frontier Science Letters, ISSN: 2349-4484, Vol. 6, P. 6 – 15 (2015). http://dx.doi.org/10.18052/www.scipress.com/ifsl.6.6

 

Group Function of Income Distribution in Society

Sergey G. Fedosin

Sviazeva Str. 22-79, Perm, 614088, Perm region, Russian Federation

email  intelli@list.ru

Keywords: annual income; economic modeling; distribution of income.

 

Abstract. Based on the similarity of properties of photons and money, and on the formula for the density of distribution of photon gas by energies, the corresponding mathematical formula for distribution of annual income per capita is obtained. Application of this formula for the data analysis reveals several independent groups of population with different average levels of their income. In particular four main groups of population contribute to the distribution of income in the economy of the USA. JEL Codes: C51, E01, E66.

Introduction

Due to variety of reasons, in every society there is inequality in income distribution, which divides people into different groups. It sets a challenge to the economics, which is trying to describe mathematically this inequality, to understand its essence and to give such recommendations, which could bring the society close to the optimal state. Among the achieved results we shall note the Lorenz curve, based on which the dependence is built of the total share of the society’s income  in percentage (the vertical axis) on the share of families  in percentage (the horizontal axis). If each share of families had the same income, we would have a linear dependence:

 

,                                                                                                                                        (1)

 

where  is the constant coefficient of proportionality.

 

However, the income of families in the different groups varies. Usually, the whole society is divided into 10 shares equal by the number of people, and within each share the family incomes do not differ much. These shares are called deciles. Let's plot the Lorenz curve not for deciles but for quintiles, each of which contains 20 % of the population. We shall start with the lowest income quintile and finish with the highest income quintile. For the first quintile (and for all the rest) the share of families is , and since the income  share  of the first quintile is small, the slope  of the segment from the origin to the point A in Figure 1, the coordinates of which on the coordinate plane are , is small too. In order to obtain the second point of the Lorenz curve, it is necessary to add the shares of families in the first two quintiles, which gives . We also need to add the income shares of both quintiles and obtain the number . After that the points  and  are connected on the graph with a straight line. Obviously, due to the increased incomes in the second quintile the slope of the second segment will increase: . Indeed , and .

 

Continuing this procedure, we see that the Lorenz curve is bent upwards. If   is the number of a segment of the curve, then for each segment the slope will equal .

 

The greater is the total deviation of the Lorenz curve from the straight line of the form (1), which connects the origin and the end of the curve, the greater is the difference in income for different groups of population.

 

 

 

 

For example, we shall present the statistical data for Russia in 2006 – 2007 according to [1]. The incomes of 20 percent groups of population in 2007 were as follows: in the first group – 5.1 % (in 2006 it was 5.2 %), in the second group – 9.8 % (9.9 %), in the third – 14.8 % (15 %), in the fourth – 22.5 % (22.6 %), in the fifth group with the highest incomes, they were 47.8 % (47.3 %).

We can make the following specification for the poorest and the richest deciles: in 2007 the 10 % share of the wealthiest population was equal to 31% of the total money income (in 2006 – 30.6 %), and 10 % of the least wealthy population had only 1.9 % (1.9 %). If we divide 31 % by 1.9 % we shall obtain the decile coefficient equal to 16.3. This gap between the rich and the poor in Russia is obscenely large because in all developed countries the decile coefficient ranges from 6 to 9. For comparison in 1991 in the beginning of “perestroika” (restructuring) in Russia, this coefficient was equal to 4.5 [2].

The Lorenz curve is associated with another characteristic, which is called the Gini coefficient (the index of income concentration). This coefficient is determined as the ratio of the area of the figure lying between the segment  and the curve  to the area of the triangle  in Figure 1. Obviously, in case of uniform distribution of income the Gini coefficient tends to zero, and in case of the extreme income inequality it reaches unity.

The Gini coefficient in Russia in 2007 equaled 0.422 as against 0.416 in 2006. The Gini coefficient is the standard tool for comparing countries with each other in the global economy. It turns out that in terms of income inequality Russia is at the level of Latin American countries [3]. If in Brazil the Gini coefficient is 0.57, in Venezuela – 0.486, then in Belarus it is 0.25, in the Czech Republic – 0.27, in Ukraine – 0.28, in Germany – 0.283, in France – 0.327, in Kazakhstan – 0.33, in the UK – 0.368, in India – about 0.37. Even lower values of the Gini coefficient are in the Scandinavian countries (Sweden and Denmark – about 0.25).

Figure 2 shows changing of the density of the distribution function of the per capita annual income in the Soviet Union and the countries of the former Soviet Union from 1970 to 2000 according to [4].

The function is shown with accuracy of the order of 1 %, as the variation range of the income was divided into 100 intervals according to centiles (each centile contains 1 % of the population, which has only slight difference in the income), and the distribution function is calibrated. This allows us by adding the number of people in 100 intervals to find the total number of people that equals the number of population. In addition, the sum of the products of the number of people and the average income in the intervals gives the total income of the society. The vertical line represents the income level of $ 1 a day, according to the prices in 1985, as the level of absolute poverty according to the definition of the World Bank.

From Figure 2 we see rather complicated structure of the distribution function. If we build this function on the ordinary but not on the logarithmic scale, then we could see the long falling tail in the region of high incomes. In 1897 the Italian economist Vilfredo Pareto tried to present in terms of quantity this decline with the help of the power function of the form [5]:

 

,                                                                                                                        (2)

 

where the index  is of the order of and less than 2,

 is the income of the citizens or businesses.

 

Function (2) is called the Pareto distribution for the number of people (companies) depending on their income, and was intended to analyze the nature of the income inequality in the society. Mathematicians and economists have also tried to describe the tail of the income distribution function by exponential functions. Thus according to the estimates in [6], in the UK and USA the income and the property are distributed mainly exponentially, and only a small part of the richest population satisfies the Pareto distribution.

 

 

However, the most important is the analytical description of the entire distribution function and not just a part of it. Some researchers modeled the general function of the income distribution density by non-parametric methods. For example, in [7] the method of nuclear estimates was used, in [8] and [9] – the diagram method, in [10] – the method of Fourier series. In the conditions when the distribution function is theoretically unknown and therefore can not be written by a simple mathematical formula with a small number of parameters, non-parametric methods give a possibility to structure large data arrays, to estimate the economic inequality and the levels of poverty and wealth. In [11] we can find the nuclear estimates of distribution densities of the logarithms of per capita incomes in nominal calculation in Russia in 1994 – 1997. In the obtained dependences there are a number of peaks, which are not seen in the estimates, based on the available for the public average data of the State Committee on Statistics and the standard log-normal distribution. The latter stands for such representation of the per capita income distribution, which is given by:

 

,                                                                         (3)

 

 

where ,

 is the number equal to the ratio of the perimeter of the circle to the diameter of the circle,

 is the weight of the observed income ,

 is the value of the income with respect to which the distribution is centered.

 

Formula (3) is written by analogy with the Gaussian normal distribution of the random variable for which the probability density is given by:

 

,                                                                                               (4)

 

 

where  is the distribution center for the random variable , which is also the point of maximum distribution and the center of symmetry.

 

Expression (4) for the function  describes the bell-shaped curve. The parameter  is the distance from the vertical line of symmetry, specified by the equation , to the inflection points that are on the right or left wings of the curve.

In Soviet times, the spread of the income of the population in Russia was small and was well modeled by the normal distribution law. But in the market economy with a large private sector the incomes of different groups differ significantly, depending on the level of skills in the achieving the results, on various kinds of talents, proficiency, the value of the employee for the employer, the possibility of fair pay in the corresponding sector of the economy taking into account various accepted norms of pay in different sectors, on the geographic and demographic characteristics, health, or the presence of bringing profit property, land, means of production, stocks and other tangible and intangible rights. The income level is also influenced by discrimination in its various forms, as well as by the objective phenomena (natural disasters, unemployment).

Studies show that the actual income distribution is far from the distribution of the form (4) with a single maximum. Using the sum in distribution (3) and the logarithms of the quantities instead of the quantities themselves improves the situation to some extent, allowing us to describe several peaks in the income distribution. However, we find it difficult to agree with the point of view that the incomes can be considered simple random quantities, in some way distributed around the average values. It is well known that the country’s economy is not just the sum of its sub-systems such as firms, organizations and households, but is something qualitatively different. The presence of logarithms in (3) also points to the special type of distribution, different in its essence from random distribution. Apparently, the quasi-normal logarithmic distribution is only one of the possible approximations to describe the situation, without revealing its internal features.

The general function of the income distribution density

Figure 3 shows the normalized density function of the per capita annual income distribution in the USA for the year 2000, taken from [4]. The range of changing of the income is divided into 100 intervals, each interval corresponds to one centile of population. How can we describe the function of income distribution from Figure 3?

The presented function has two clear maximums and a remarkable detail on the right wing in the region of high incomes. Obviously, neither normal nor log-normal distribution are suitable, as well as the Pareto distribution or simple exponential distribution.

In search of a suitable distribution function we shall refer to the Bose-Einstein function of distribution of the particles by energies, which has the following form:

 

,                                                                                                           (5)

 

 

 

where  is the number of particles in the system, the energy of which is in the narrow interval from  to ,

is the number of corresponding various quantum states of the particle,

 is the chemical potential per particle,

 is the Boltzmann constant,

 is the temperature of the system of particles.

 

Distribution (5) is used for particles – bosons, which are, in particular, the neutral atoms and photons. The initial idea in derivation of (5) in quantum mechanics is the quantization of the possible energies of particles on the one hand, and the exponential dependence of the probability of finding particles in the state with the energy : .

 

 

As the consequence of (5) and of zero chemical potential  for the light, for the spectral distribution of the intensity of the electromagnetic emission inside the black box with temperature  we obtain the Planck's law:

 

,                                                                                       (6)

 

 

 

where  is the speed of light,

 is the Dirac constant ( is the Planck constant),

 is the power of emission passing in one way through the surface area  oriented perpendicular to the direction of emission in the interval of the angular frequency of light from  to .

 

At low energies of photons  as compared to the average energy of atoms , we can expand the exponent in (6) into series and confine ourselves with only the first expansion terms. This gives the following:

 

,                      .                                                                         (7)

 

The last relation in (7) is known as the Rayleigh–Jeans law and it approximates (6) in the low frequency region. We shall suppose now that money and working people in a certain sense have the properties of bosons and therefore can be described by distributions found for bosons. In favor of this hypothesis is the fact that money is the embodiment of the quantity, which is called in physics energy. Electromagnetic quanta or photons transfer electromagnetic energy, and money is the universal measure of the economic value. Changing the amount of energy in physics is associated either with doing the work by the system or on the system, or with accumulation or dissipation of energy. Similarly, changing of the quantity of money in the subject of economic activity is associated with performing paid work and services, and with the processes of accumulation and distribution of money. Photons can be absorbed completely in case of coincidence of their frequency with the resonant frequencies in the atom. The emission from atoms is quantized, since the photons’ energies are equal to the difference between the energies of the levels of atoms. In a solid body the electromagnetic energy can be efficiently converted into other forms of energy.

Similarly, in the society the incomes of employees are also discrete and correlated with the known rates of wages and with the level of work performed. The surplus value of goods and services produced by an employee is equal to the difference between the price of their realization and the cost of paid labor, raw materials, tools, third-party services needed on their production. At the moment of realization of goods we can say that the energy of money (spent on the production of goods and their realization) is converted into the internal energy of goods in the form of its cost. The fact that the cost has not only the objective but also the subjective property follows from the cost estimation procedure by means of the market or expert evaluation. But the presence of subjectivity in evaluating the cost of goods by the seller and the buyer does not mean the absence of contained in the product corresponding energy of cost. In respect of money, just as in many things related to the human society, there is a significant subjective component in the evaluation and adoption of equivalent relations between money, cost, price, etc.

The property of atoms and photons as bosons is that a number of particles of the system can be in the same state at a time. For example, in a small spatial volume a large number of photons with the same energy can be concentrated (the example is laser). In contrast to this, the fermions due to the Pauli principle should have obligatory difference in the states of particles. It is obvious that tangible and intangible values circulating in the society, expressed in the money equivalent, are closer in the properties to bosons than to fermions.

Based on the stated above, we shall assume money similarly to photons to be the carriers of corresponding energy and referring by their properties to bosons. This property will be true for incomes, as they can be expressed in money obtained for a certain period. We shall now use the formula of the type (6) to describe the density distribution function of the per capita annual income in the society. In general terms, we can write it according to [12] as follows:

 

,                                                                                                      (8)

 

 

 

where  is the number of the population referring to the range of income change , located in the interval from to ,

 is a constant coefficient,

 is the quantity, similar in its meaning to the chemical potential in (5),

 is the quantity, specifying the characteristic "temperature" of the considered sector of the economy.

 

With low incomes, when  tends to ,  will tend to zero in the case when the exponent is . The constant  in (8) is determined from the condition that:

 

,                                                                                 (9)

 

 

 

where  is the total population of the region.

 

With  corresponding to the cubic degree with  in (6), from (9) we find: . The sum of the products of the average income  in the i-th income interval and the number of people  in this interval should be equal to the total income of the population: . Moving to the integral, taking into account (8), we have:

 

.                                                                           (10)

 

 

 

As it will be shown below, there are several groups in the society at the same time, which differ significantly from each other in their income and their work intensity. In this regard, we can apply the known in physics superposition principle, assuming each group’s contribution into the total density of distribution  of the annual per capita income in the society relatively independent of each other. Then, instead of (8), we should use the sum of contributions of all groups:

 

,                                                     (11)

 

 

 

where the index  denotes the group number and it ranges from 1 to the number , equal to the number of population groups; the coefficients  are proportional to the relative weight of the respective group; and the coefficients  by their meaning specify the minimum income in each group. The coefficients  correspond to the temperature in (5) and (6), and, according to (7), the greater is  in some group, the larger are the cash flows in this group, all other conditions being equal.

 

In case of several population groups, instead of (9) we can write:

 

,                                  (12)

 

 

 

and if , we can estimate the number of people in each individual group: . For the total income of the entire population, instead of (10), we must have the following:

 

.                                 (13)

 

 

 

Income distribution in the USA in 2000

We shall now apply the function (11) to the analysis of the density of the income distribution function in Figure 3. As we can see, the curve in Figure 3 has at least two maximums. Therefore, we must take the sum of several terms in (11), and find for each of them the quantities . Doing this, we arrive at the following result for the income distribution in the USA in 2000:

 

              (14)

 

 

We obtained four terms in (14), all of which have the same degree in the numerator . The total number of unknown coefficients  for the four terms equals 12, so in order to find them it is enough to take 12 different points on the curve in Figure (3). In this case, its own  corresponds to each , and we substituted these  and  in (11), and then solved a system of 12 equations and determined the coefficients  in (14).

The per capita annual income  in (14) is measured in thousands of dollars, and  – in the millions of people per income interval, corresponding to one centile. In Figure 4, we present four curves based on four functions in (14). The algebraic addition of these curves gives exactly the same envelope as in Figure 3.

 

 

From (14) it follows that the US population consists of four major groups, which mainly form the density of the total income distribution function. The first group apparently consists of low-skilled people, or part-time employees. The incomes of this group are small, on the average about $ 7,000 per year (we shall remind that we analyze available for us data for 2000). The second group is more qualified employees who have obtained the minimum general education and have a specialty. The income of this group at the peak of the corresponding curve is about $ 22,000 a year. The third group mainly consists of people with higher education. The income in this group varies around the average of $ 43,000. Finally, the fourth group includes all highly skilled and highly paid jobs, with an average income of $ 76,000 a year.

In all groups, to this or that degree, there are also businessmen and owners of capitals, who get additional income as interest on their business, dividends or rent. A more precise analysis of these income groups is possible in those areas of income, in which the curves in Figure 4 do not cross.

From (14) we can find the coefficients of some distribution functions: , , , . These coefficients should have the dimension of a million people divided by the fourth power of the income, according to the income scale, divided into centiles. Taking the relations ,  etc.,

 

we find that on the average per one rich American there are 588 low-paid, 30 middle-paid and 6 well-paid Americans. However, these figures are not absolutely accurate, since they would be valid only in case if the coefficients  in all the groups would coincide. If the coefficients  are different, the more informative are the ratios , , etc., where  is from (12).

 

The constants , , ,  in (14) define the lower limit of per capita annual income in thousands of dollars of the corresponding group.

The fact that the found groups exist relatively independent on each other is confirmed by different constants , , ,  in (14). These constants are measured as a per capita annual income in thousands of dollars and similar in their meaning to the temperature in (6).

Apparently, the job market offers such jobs and niches, which in different groups are fundamentally different in their ability to provide income to the employee.

Conclusion

Our goal was to determine the mathematical formulas to describe the dependence of the distribution of annual income per capita. This was achieved in equations (8) – (14). These formulas are in good agreement with the statistical data for the distribution of per capita annual income in the United States in 2000, and four large economically independent groups were discovered in the data.

To some regret, the dependence of the density of the income distribution function presented in Figure 3 is not sufficient in general for the purposes of economic research. The fact is that it was built based on the use of centiles. Each centile contains 1 % of population, and the average income in a given centile depends on the position of the centile on the income axis. Therefore, the width of the income interval  corresponding to some centile is different in different parts of the income axis. Now if we want to use the formulas (12) or (13), substituting in them  from (14), the income intervals  should be transformed into differentials  for further integration. However, the information for such transformation is available only in the statistical data, used by the authors of [4], so that the direct integration by the formulas (12) or (13), without taking into account the difference between  and  in Figure 3, is inaccurate, giving too much deviation. In order to fully use the expansion of the form of (14) into the groups for the density of income distribution function, we must initially have other dependencies of the income distribution. They should be built not on the basis of centiles, but by determining the number of people with the same range of income change , while moving along the income axis . This will lead to some change in the parameters in (14), without changing the quality of the general picture. The result is the ability to use quickly and easily the distributions of the form of (14) in any economic research concerning the obtaining, accumulating and spending money in the society, as well as during the monitoring of the economy.

References

[1]    The socio-economic situation in Russia, the State Statistics Committee of Russia, Moscow, 2007.

[2]    M.I. Zelensky, The origins of right, St. Petersburg, 2003.

[3]    The World Factbook, Field Listing, Distribution of family income, Gini index, 2015. https://www.cia.gov/library/publications/the-world-factbook/rankorder/2172rank.html.

[4]    Xavier Sala-I-Martin, The World Distribution of Income: Falling Poverty and . . . Convergence, Period, Quarterly Journal of Economics, CXXI (2006) 351-397.

[5]    V. Pareto, Cours d’Economie Politique, Vol. 2. F. Rouge, Editeur, Lausanne and Paris, 1897.

[6]    A. Dragulesku, V.M. Yakovenko, Exponential and power-law distributions of wealth and income in the United Kingdom and the United States, Physica A, 299 (2001) 213-221.

[7]    E. Parzen. On the Estimation of a Probability Density Function and Mode, Annals of Math. Statistics, 33 (1962) 1065-1076.

[8]    J. Van Ryzin, Classification and clustering. Academic Press, N.Y., C.F., London, 1977.

[9]    J. Van Ryzin, A Histogram Method of Density Estimation, Comm. Statist. 2 (1973) 493-506.

[10]              E. Wegman, Non-parametric Probability Density Estimation: I. A Summary of Available Methods, Technometrics, 14 (1972) 533-546.

[11]              A.Y. Sheviakov, A.J. Kiruta, Economic inequality, poverty and the standard of living of the Russian population: methods of measurement and analysis of causal relationships, EERC, Moscow, 2001.

[12]              Sergey Fedosin. The physical theories and infinite hierarchical nesting of matter, Volume 2. – LAP LAMBERT Academic Publishing, pages: 420. (2015).

 

Source: http://sergf.ru/fren.htm

Scientific site