Housing prices in China follow Zipf’s law

Yalin He; Bailin Zheng; Yue Kai

doi:10.1371/journal.pone.0324239

Abstract

This study verifies that housing prices in China follow Zipf’s law by analyzing the average housing prices in 200 Chinese cities between 2015 and 2023. The Kolmogorov-Smirnov (KS) test further supports this conclusion. Furthermore, the study explores housing price trends in different city tiers and uses rank clocks to reveal price fluctuations, which is not captured by the Zipf’s law. Using nine-year housing price time series data, the ARIMA and ConvLSTM models generate short-term forecasts. The forecasting results are evaluated using various indicators, and KS tests on the predicted prices show a better conformity to Zipf’s law. This paper also builds a dynamic housing price model, offering a new perspective to understand and predict housing price trends. The results of this paper can shed light on the real estate market.

Citation: He Y, Zheng B, Kai Y (2025) Housing prices in China follow Zipf’s law. PLoS One 20(5): e0324239. https://doi.org/10.1371/journal.pone.0324239

Editor: Gábor Vattay, Eötvös Loránd University, HUNGARY

Received: December 29, 2024; Accepted: April 23, 2025; Published: May 30, 2025

Copyright: © 2025 He et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The data supporting the findings of this study are available in figshare at https://doi.org/10.6084/m9.figshare.26968507.v1.

Funding: Shanghai Sailing Program.

Competing interests: The authors have declared that no competing interests exist.

Introduction

George Zipf first systematically proposed Zipf’s law in 1949, which aimed to describe the distribution of the frequency of word usage in the natural language [1, 2]. Subsequently, Zipf’s law has been widely used in many fields [3–8]. For example, Auerbach found that city sizes follow Zipf’s law [9]. Huberman observed that the distribution of web links adheres to Zipf’s law [10], and Robert Axtell revealed that firm sizes in economics align with Zipf’s law [11]. The Zipf’s law is described by the following formula

(1)

where x represents the ranking and is a positive number which is the exponent of the distribution.

Housing prices are influenced by multiple factors, such as population and income, many of which have been shown to follow Zipf’s law [12–14]. It is natural to explore whether estate prices also conform to this law. There has been some progress in related research. Kaizoji [15] first analyzed the land market in Japan and found that its price distribution conformed to Zipf’s law. Coad [16] studied housing prices in London and confirmed that they conformed to Zipf’s law to a certain extent. Ohnishi et al. [17] examined the price distribution of the Japanese real estate market during the bubble period and found that it had a power-law tail. Furthermore, their study demonstrated that US states that experienced real estate bubbles exhibited heavier tail characteristics in their housing price distributions. Blackwell [18] investigated the real estate prices in Charleston County, South Carolina, and also found that they aligned with Zipf’s law to some extent. These studies suggest that prices in some real estate markets may follow Zipf’s law. Therefore, this paper further explores the price distribution characteristics of China’s real estate market.

The distribution of housing prices can also be effectively assessed by comparing the distributions of predicted and actual housing prices. Many scholars have dedicated themselves to researching housing prices, developing a variety of analytical tools and forecasting models [19–21]. These tools leverage historical data to analyze key factors that influence price fluctuations and their interrelationships. For example, the ARCH models by Engle effectively capture housing price dynamics [22]. The Leo Breiman Random Forest model elucidates the complex interplay behind price determinants, achieving precise forecasts [23]. Recently, advances in neural networks and deep learning have further enhanced analytical and predictive capabilities [24–26]. These evolving technologies not only deepen our understanding of the real estate market, but also provide valuable insights for market participants’ decision-making.

As mentioned earlier, significant progress has been made in housing price distribution research, but several key areas remain underexplored. China has the largest housing market worldwide, which has seen remarkable growth since 2015 [27]. Furthermore, while there are abundant researches on Zipf’s law, applying the Zipf’s law to housing price analysis are few, focus on only a few countries and cities. In this paper, we will investigate whether China’s housing prices and their rankings are consistent with Zipf’s law, and also verify the correctness of the conclusions using the KS test.

The purpose of this paper is to explore the distribution of housing prices in the top 200 cities using housing price data from China. Through double logarithmic figures, we preliminarily conclude that housing prices in Chinese cities follow Zipf’s law. To statistically verify this conclusion, this paper uses the KS test for validation. Furthermore, we compare different prediction models and select an appropriate model for short-term housing price forecasting. After analyzing the distribution of the prediction results, we find that it also follows Zipf’s law. Finally, we propose a housing price dynamics model, which aims to better explain the underlying mechanism of housing price distribution and provide theoretical support for housing price research.

Zipf’s law and KS test

The data for this paper are sourced from the Anjuke website, which has been collecting housing prices and rankings for more than 200 Chinese cities since 2015, accumulating nine years of data to date. All data are available on FigShare (DOI: 10.6084/m9.figshare.26968507.v1). The data were preprocessed to ensure quality. Outliers with excessively high prices have not been removed, considering the specificity of housing price data. This paper uses housing price and ranking data from 2015 to 2023, where housing price data represent the average annual price in each city. To ensure comparability across years, the top 200 cities by housing price ranking are selected for analysis each year.

From the data collected, it can be observed that annual housing prices in Chinese cities have exhibited some volatility. Although the overall trend has shown a steady increase, there has been a decline in recent years. For example, in the first-tier city of Guangzhou, housing prices and rankings from 2015 to 2023 are as follows: RMB 19,855 (ranked 5th), RMB 22,742 (ranked 7th), RMB 28,349 (ranked 7th), RMB 31,831 (ranked 4th), RMB 31,423 (ranked 5th), RMB 30,726 (ranked 5th), RMB 34,690 (ranked 5th), RMB 33,294 (ranked 6th) and RMB 30,953 (ranked 5th). In the case of the fourth-tier city of Hegang, the housing prices and rankings over the same period are as follows. RMB 4,033 (ranked 177th), RMB 3,716 (ranked 236th), RMB 3,922 (ranked 264th), RMB 3,400 (ranked 296th), RMB 3,100 (ranked 349th), RMB 2,022 (ranked 350th), RMB 2,162 (ranked 356th), RMB 2,077 (ranked 357th) and RMB 1,927 (ranked 364th).

In addition, we find that housing prices in most Chinese cities are concentrated below RMB 10,000, with only a few cities showing extremely high prices. Due to the large number of small and medium cities, their housing prices tend to be more centralized, with smaller differences. In contrast, cities at the top of the housing price rankings exhibit significant differences in housing prices. For example, the difference in housing prices between Beijing and Guangzhou exceeds RMB 25,000 in 2023. The gap between the top-ranked and the bottom-ranked city is even more astonishing, at more than RMB 50,000.

Figs 1–9 present a double logarithmic plot of China’s housing prices and rankings for the period 2015 to 2023. The results suggest that housing prices in China largely conform to Zipf’s law [28]. The corresponding exponent values are shown in Table 1. The exponent values in Table 1 indicate that the differences in housing prices among Chinese cities gradually narrowed between 2015 and 2023. This trend reflects the rapid development of China’s real estate market over the past nine years, during which many cities have experienced significant increases in housing prices, in some cases increasing several times. As prices in different cities converge, the gap in overall average city prices naturally decreases. This phenomenon aligns with real-world observations and suggests that China’s real estate market is transitioning from regional differentiation to more balanced development. The R-squared values for Figs 1–9, each exceeding 0.95, suggest that the model closely matches the observed data, indicating excellent goodness of fit.

Download:

Fig 1. 2015 Chinese housing prices and rankings in log–log scale.

https://doi.org/10.1371/journal.pone.0324239.g001

Download:

Fig 2. 2016 Chinese housing prices and rankings in log–log scale.

https://doi.org/10.1371/journal.pone.0324239.g002

Download:

Fig 3. 2017 Chinese housing prices and rankings in log–log scale.

https://doi.org/10.1371/journal.pone.0324239.g003

Download:

Fig 4. 2018 Chinese housing prices and rankings in log–log scale.

https://doi.org/10.1371/journal.pone.0324239.g004

Download:

Fig 5. 2019 Chinese housing prices and rankings in log–log scale.

https://doi.org/10.1371/journal.pone.0324239.g005

Download:

Fig 6. 2020 Chinese housing prices and rankings in log–log scale.

https://doi.org/10.1371/journal.pone.0324239.g006

Download:

Fig 7. 2021 Chinese housing prices and rankings in log–log scale.

https://doi.org/10.1371/journal.pone.0324239.g007

Download:

Fig 8. 2022 Chinese housing prices and rankings in log–log scale.

https://doi.org/10.1371/journal.pone.0324239.g008

Download:

Fig 9. 2023 Chinese housing prices and rankings in log–log scale.

https://doi.org/10.1371/journal.pone.0324239.g009

Download:

Table 1. Distributional estimates of housing prices in China.

https://doi.org/10.1371/journal.pone.0324239.t001

The KS test is performed by comparing the maximum difference between the cumulative distribution function (CDF) of the actual data and the theoretically fitted CDF [29–31]. Specifically, it assesses the maximum deviation between these two CDFs. The KS statistic and the p-value are calculated to determine whether the distribution of Chinese housing prices significantly follows Zipf’s law. The KS test statistic is computed using the following formula

(2)

where H(x) represents the empirical distribution function, D(x) represents the hypothetical distribution function, and refers to the dataset we analyze.

The results obtained from the KS test on China’s housing price data from 2015 to 2023 are presented in Table 1. The KS statistic values are all below 0.03, and the p-values are all above 0.9, providing insufficient evidence to reject the hypothesis that China’s housing prices and rankings follow Zipf’s law [32]. Based on these findings, it is reasonable to assume that Zipf’s law accurately describes the data.

Rank clock

Rank clock offer a method for visualizing dynamic data, especially useful for tracking and displaying changes over time [33]. The rank clock complements Figs 10–11, which might not effectively capture subtle dynamic changes. However, the rank clock excels at capturing and clearly displaying these nuances [34]. For example, the rank clock can plot and analyze the trend of housing prices in Chinese cities over time, showing how prices evolve anticlockwise in polar coordinates. By observing the fluctuations and crossovers of the trajectories, we can visually discern the rise, fall, and stability of housing prices in each city. We selected samples from first-tier cities in China (Beijing, Shanghai, Guangzhou, Shenzhen) and representatives of fifth-tier cities (Tonghua, Jixi, Ezhou, Loudi), displayed in Figs 10–11. Different colored lines are used to distinguish cities, with housing prices represented by the radial distance in polar coordinates.

Download:

Fig 10. China’s first-tier cities housing prices rank clock.

https://doi.org/10.1371/journal.pone.0324239.g010

Download:

Fig 11. China’s fifth-tier cities housing prices rank clock.

https://doi.org/10.1371/journal.pone.0324239.g011

As shown in Figs 10–11, at the macro level, housing prices in Chinese cities experienced rapid growth from 2015 to 2016. Compared to fifth-tier cities, China’s super first-tier cities not only have more stable housing prices but also demonstrate greater resilience to unexpected events. From a microperspective, housing prices in China’s first-tier cities have entered a relatively stable state after rapid growth and their rankings remain stable. In contrast, housing prices in fifth-tier cities have fluctuated considerably. Comparison of rank clock plots provides a different perspective on housing prices in Chinese cities.

Short-term forecast of housing prices

In this section, a time series dataset is constructed using data from the top 200 Chinese cities in terms of housing prices between 2015 and 2023, with 70% of the data used for the training set and 30% for the test set. We found that the traditional LSTM model may struggle to capture the underlying trends in the data. To improve the model’s prediction performance, we constructed a ConvLSTM model by incorporating 2D convolution and merging layers into the traditional LSTM model. Furthermore, since our time series data do not exhibit seasonality, we selected the ARIMA model for comparative forecasting [35, 36].

First, we present the results of the ConvLSTM model. After using Keras Tuner and performing several experimental optimizations [37], we finally selected the following parameter configurations: the number of LSTM layers is 3, with 256 kernels in the first layer, 128 kernels in the second layer and 64 kernels in the third layer. All kernels have a size of (1, 1), and the merge window size is (1, 1, 1, 1). The specific configurations are as follows: the number of filters in each ConvLSTM2D layer ranges from 32 to 256, in increments of 32, as determined by Keras Tuner optimization. The final fully connected Dense layer has 1 neuron. The dropout rate for each layer ranges from 0.2 to 0.5, with the exact values determined by Keras Tuner. After testing the model in the test set, the and precision were 0.9851 and 97.64%, respectively. For the ARIMA model, we debugged and arrived at the ARIMA (2, 1, 2) configuration. After testing the ARIMA model on the test set, the and accuracy were 0.9943 and 96.84%, respectively.

From the overall results, the ConvLSTM model outperforms the ARIMA model in many aspects of predicting housing prices in Chinese cities. Therefore, we selected the ConvLSTM model for future predictions of housing prices. Consequently, we used the model to predict housing prices in 200 Chinese cities for the next three years, plotting the double logarithmic figure and performing the KS test. The results shown in Figs 12–14. and Table 2, indicate that the predicted housing prices for the next three years conform well to Zipf’s law, further supporting the hypothesis that housing prices and their rankings in Chinese cities follow Zipf’s law.

Download:

Fig 12. 2024 Predicted Chinese housing prices and rankings in log–log scale.

https://doi.org/10.1371/journal.pone.0324239.g012

Download:

Fig 13. 2025 Predicted Chinese housing prices and rankings in log–log scale.

https://doi.org/10.1371/journal.pone.0324239.g013

Download:

Fig 14. 2026 Predicted Chinese housing prices and rankings in log–log scale.

https://doi.org/10.1371/journal.pone.0324239.g014

Download:

Table 2. Distributional estimation of forecasted housing prices in China.

https://doi.org/10.1371/journal.pone.0324239.t002

Housing price dynamic model

In economics, it is a common assumption that logarithmic changes in prices satisfy certain laws [38], we assume that the log change in housing prices in Chinese cities associated with housing price rankings satisfies the following conditions

(3)

where i represents the ranking of housing prices in China (the city with the highest housing price i = 1, the city with the second highest housing price i = 2, and so on), u(i) is consistently positive and monotonically decreasing, so we can write it as , where h(i) is consistently positive and monotonically increasing

(4)

we assume i = nl, when n is an extremely large value, l is an extremely small value, we can written as

(5)

We discuss the case where P(i) is a homogeneous function, and we get , which follows

(6)

suppose h(i) is linear function, let h(i) = h₁nl, then Eq. 6 can be reduced into the following equation

(7)

multiply both sides of the equation by n, we can derive

(8)

when n tends to infinity, we can get

(9)

which yields,

(10)

we can get solution

(11)

We can think of l as a quantity related to ranking, randomly assigned an integer value between 1 and 200. The constant h₁ varies between 1.83 and 1.88, specifically taking values such as 1.83, 1.84, 1.85, 1.86, 1.87 and 1.88. The constant c is set to 1,000,00 to calculate housing prices that closely match the actual data. After fitting the data, the results are shown in Figs 15–20, with the corresponding exponent values and R-square values listed in Table 3. The results in Figs 15–20 and Table 3 shown that the housing prices obtained by Eq. 11 follow Zipf’s law.

Download:

Fig 15. When h₁ = 1.83.

https://doi.org/10.1371/journal.pone.0324239.g015

Download:

Fig 16. When h₁ = 1.84.

https://doi.org/10.1371/journal.pone.0324239.g016

Download:

Fig 17. When h₁ = 1.85.

https://doi.org/10.1371/journal.pone.0324239.g017

Download:

Fig 18. When h₁ = 1.86.

https://doi.org/10.1371/journal.pone.0324239.g018

Download:

Fig 19. When h₁ = 1.87.

https://doi.org/10.1371/journal.pone.0324239.g019

Download:

Fig 20. When h₁ = 1.88.

https://doi.org/10.1371/journal.pone.0324239.g020

Download:

Table 3. Fitting of housing price distributions calculated by the dynamic model.

https://doi.org/10.1371/journal.pone.0324239.t003

Conclusion

In this paper, we focus on the housing prices and ranking of Chinese cities from 2015 to 2023. By analyzing the annual housing prices and rankings using double logarithmic figures, we initially determined that they follow Zipf’s law. Further, we use the KS test to calculate the KS statistic and p-value, which verify that the housing prices and rankings of Chinese cities indeed follow Zipf’s law. In addition, we employ the rank clock to explore trends and fluctuations in housing prices in China’s first-tier cities and some fifth-tier cities, revealing characteristics of housing prices that are not captured in the previous figure. To further investigate whether the housing prices and rankings of Chinese cities will continue to follow Zipf’s law in the future, we use various models to make short-term forecasts for housing prices. Upon comparing the prediction metrics, we finally select the ConvLSTM model for short-term forecasting. The KS tests of the predicted results confirm that the foretasted housing prices also closely adhere to Zipf’s law. Finally, we construct a housing price dynamic model to provide new perspectives on housing price forecasting and research.

References

1. Zipf GK. Human behavior and the principle of least effort: an introduction to human ecology. Martino Fine Books; 1949.
2. Chang Y-W. Influence of human behavior and the principle of least effort on library and information science research. Inf Process Manage. 2016;52(4):658–69.
- View Article
- Google Scholar
3. Li Y-B, Du Y, Liu F-Z, Zhang Y-Y, Li M, Wang J, et al. Applicability of Zipf’s Law in traditional Chinese medicine prescriptions. Chin Med Sci J. 2022;37(3):195–200. pmid:36321174
- View Article
- PubMed/NCBI
- Google Scholar
4. Wan G, Zhu D, Wang C, Zhang X. The size distribution of cities in China: evolution of urban system and deviations from Zipf’s Law. Ecol Indic. 2020;111:106003.
- View Article
- Google Scholar
5. Arshad S, Hu S, Ashraf BN. Zipf’s Law, the coherence of the urban system and city size distribution: evidence from Pakistan. Phys A Stat Mech Appl. 2019;513:87–103.
- View Article
- Google Scholar
6. Luckstead J, Devadoss S. Do the world’s largest cities follow Zipf’s and Gibrat’s laws? Econ Lett. 2014;125(2):182–6.
- View Article
- Google Scholar
7. Eftekhari A. Fractal geometry of texts: an initial application to the works of Shakespeare. J Quant Linguist. 2006;13(2–3):177–93.
- View Article
- Google Scholar
8. Sun X, Yuan O, Xu Z, Yin Y, Liu Q, Wu L. Did Zipf’s Law hold for Chinese cities and why? Evidence from multi-source data. Land Use Policy. 2021;106:105460.
- View Article
- Google Scholar
9. Rybski D, Ciccone A. Auerbach, Lotka, and Zipf: pioneers of power-law city-size distributions. Arch Hist Exact Sci. 2023;77(6):601–13.
- View Article
- Google Scholar
10. Adamic LA, Huberman BA. Zipf’s Law and the internet. Glottometrics. 2002;3:143–50.
- View Article
- Google Scholar
11. Axtell RL. Zipf distribution of U.S. firm sizes. Science. 2001;293(5536):1818–20. pmid:11546870
- View Article
- PubMed/NCBI
- Google Scholar
12. Jiang B, Yin J, Liu Q. Zipf’s Law for all the natural cities around the world. Int J Geogr Inf Sci. 2015;29(3):498–522.
- View Article
- Google Scholar
13. González-Val R. Deviations from Zipf’s Law for American cities. Urban Studies. 2010;48(5):1017–35.
- View Article
- Google Scholar
14. Giesen K, Sudekum J. Zipf’s Law for cities in the regions and the country. J Econ Geogr. 2010;11(4):667–86.
- View Article
- Google Scholar
15. Kaizoji T. Scaling behavior in land markets. Phys A Stat Mech Appl. 2003;326(1–2):256–64.
- View Article
- Google Scholar
16. Coad A. On the distribution of product price and quality. J Evol Econ. 2009;19(4):589–604.
- View Article
- Google Scholar
17. Ohnishi T, Mizuno T, Shimizu C, Watanabe T. Power laws in real estate prices during bubble periods. Int J Mod Phys Conf Ser. 2012;16:61–81.
- View Article
- Google Scholar
18. Blackwell C. Power laws in real estate prices? Some evidence. Q Rev Econ Finance. 2018;69:90–8.
- View Article
- Google Scholar
19. Zhan C, Liu Y, Wu Z, Zhao M, Chow TWS. A hybrid machine learning framework for forecasting house price. Expert Syst Appl. 2023;233:120981.
- View Article
- Google Scholar
20. Bork L, Møller SV. Forecasting house prices in the 50 states using dynamic model averaging and dynamic model selection. Int J Forecast. 2015;31(1):63–78.
- View Article
- Google Scholar
21. Hjort A, Scheel I, Sommervoll DE, Pensar J. Locally interpretable tree boosting: an application to house price prediction. Decis Support Syst. 2024;178:114106.
- View Article
- Google Scholar
22. Engle R. GARCH 101: the use of ARCH/GARCH models in applied econometrics. J Econ Perspect. 2001;15(4):157–68.
- View Article
- Google Scholar
23. Breiman L. Random forests. Mach Learn. 2001;45:5–32.
- View Article
- Google Scholar
24. Xu X, Zhang Y. House price forecasting with neural networks. Intell Syst Appl. 2021;12:200052.
- View Article
- Google Scholar
25. Xu X, Zhang Y. Residential housing price index forecasting via neural networks. Neural Comput Appl. 2022;34(17):14763–76.
- View Article
- Google Scholar
26. Wang L, Wang G, Yu H, Wang F. Prediction and analysis of residential house price using a flexible spatiotemporal model. J Appl Econ. 2022;25(1):503–22.
- View Article
- Google Scholar
27. Li Q, Chand S. House prices and market fundamentals in urban China. Habitat Int. 2013;40:148–53.
- View Article
- Google Scholar
28. Matteo Marsili, Yicheng Zhang. Interacting individuals leading to Zipf’s Law. Phys Rev Lett. 1998;80:2741.
- View Article
- Google Scholar
29. Lilliefors HW. On the Kolmogorov-Smirnov test for the exponential distribution with mean unknown. J Am Stat Assoc. 1969;64(325):387–9.
- View Article
- Google Scholar
30. Drezner Z, Turel O, Zerom D. A modified Kolmogorov–Smirnov test for normality. Commun Stat. 2010;39(4):693–704.
- View Article
- Google Scholar
31. Mora-López L, Mora J. An adaptive algorithm for clustering cumulative probability distribution functions using the Kolmogorov–Smirnov two-sample test. Expert Syst Appl. 2015;42(8):4016–21.
- View Article
- Google Scholar
32. Zhang J, Chen Q, Wang Y. Zipf distribution in top Chinese firms and an economic explanation. Phys A Stat Mech Appl. 2009;388(10):2020–4.
- View Article
- Google Scholar
33. Batty M. Rank clocks. Nature. 2006;444(7119):592–6. pmid:17136088
- View Article
- PubMed/NCBI
- Google Scholar
34. Guo J, Xu Q, Chen Q, Wang Y. Firm size distribution and mobility of the top 500 firms in China, the United States and the world. Phys A Stat Mech Appl. 2013;392(13):2903–14.
- View Article
- Google Scholar
35. Tolesh F, Biloshchytska S. Forecasting international migration in Kazakhstan using ARIMA models. Procedia Comput Sci. 2024;231:176–83.
- View Article
- Google Scholar
36. Zhang J, Liu H, Bai W, Li X. A hybrid approach of wavelet transform, ARIMA and LSTM model for the share price index futures forecasting. North Am J Econ Finance. 2024;69:102022.
- View Article
- Google Scholar
37. Wen T, Liu Y, Bai YH, Liu H. Modeling and forecasting CO2 emissions in China and its regions using a novel ARIMA-LSTM model. Heliyon. 2023;9(11):e21241. pmid:37954263
- View Article
- PubMed/NCBI
- Google Scholar
38. Rosenberg B. Statistical analysis of price series obscured by averaging measures. J Financ Quant Anal. 2009;6(4):1083–94.
- View Article
- Google Scholar

[ref1] 1. Zipf GK. Human behavior and the principle of least effort: an introduction to human ecology. Martino Fine Books; 1949.

[ref2] 2. Chang Y-W. Influence of human behavior and the principle of least effort on library and information science research. Inf Process Manage. 2016;52(4):658–69.
View Article
Google Scholar

[3] View Article

[4] Google Scholar

[ref3] 3. Li Y-B, Du Y, Liu F-Z, Zhang Y-Y, Li M, Wang J, et al. Applicability of Zipf’s Law in traditional Chinese medicine prescriptions. Chin Med Sci J. 2022;37(3):195–200. pmid:36321174
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref4] 4. Wan G, Zhu D, Wang C, Zhang X. The size distribution of cities in China: evolution of urban system and deviations from Zipf’s Law. Ecol Indic. 2020;111:106003.
View Article
Google Scholar

[10] View Article

[11] Google Scholar

[ref5] 5. Arshad S, Hu S, Ashraf BN. Zipf’s Law, the coherence of the urban system and city size distribution: evidence from Pakistan. Phys A Stat Mech Appl. 2019;513:87–103.
View Article
Google Scholar

[13] View Article

[14] Google Scholar

[ref6] 6. Luckstead J, Devadoss S. Do the world’s largest cities follow Zipf’s and Gibrat’s laws? Econ Lett. 2014;125(2):182–6.
View Article
Google Scholar

[16] View Article

[17] Google Scholar

[ref7] 7. Eftekhari A. Fractal geometry of texts: an initial application to the works of Shakespeare. J Quant Linguist. 2006;13(2–3):177–93.
View Article
Google Scholar

[19] View Article

[20] Google Scholar

[ref8] 8. Sun X, Yuan O, Xu Z, Yin Y, Liu Q, Wu L. Did Zipf’s Law hold for Chinese cities and why? Evidence from multi-source data. Land Use Policy. 2021;106:105460.
View Article
Google Scholar

[22] View Article

[23] Google Scholar

[ref9] 9. Rybski D, Ciccone A. Auerbach, Lotka, and Zipf: pioneers of power-law city-size distributions. Arch Hist Exact Sci. 2023;77(6):601–13.
View Article
Google Scholar

[25] View Article

[26] Google Scholar

[ref10] 10. Adamic LA, Huberman BA. Zipf’s Law and the internet. Glottometrics. 2002;3:143–50.
View Article
Google Scholar

[28] View Article

[29] Google Scholar

[ref11] 11. Axtell RL. Zipf distribution of U.S. firm sizes. Science. 2001;293(5536):1818–20. pmid:11546870
View Article
PubMed/NCBI
Google Scholar

[31] View Article

[32] PubMed/NCBI

[33] Google Scholar

[ref12] 12. Jiang B, Yin J, Liu Q. Zipf’s Law for all the natural cities around the world. Int J Geogr Inf Sci. 2015;29(3):498–522.
View Article
Google Scholar

[35] View Article

[36] Google Scholar

[ref13] 13. González-Val R. Deviations from Zipf’s Law for American cities. Urban Studies. 2010;48(5):1017–35.
View Article
Google Scholar

[38] View Article

[39] Google Scholar

[ref14] 14. Giesen K, Sudekum J. Zipf’s Law for cities in the regions and the country. J Econ Geogr. 2010;11(4):667–86.
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref15] 15. Kaizoji T. Scaling behavior in land markets. Phys A Stat Mech Appl. 2003;326(1–2):256–64.
View Article
Google Scholar

[44] View Article

[45] Google Scholar

[ref16] 16. Coad A. On the distribution of product price and quality. J Evol Econ. 2009;19(4):589–604.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref17] 17. Ohnishi T, Mizuno T, Shimizu C, Watanabe T. Power laws in real estate prices during bubble periods. Int J Mod Phys Conf Ser. 2012;16:61–81.
View Article
Google Scholar

[50] View Article

[51] Google Scholar

[ref18] 18. Blackwell C. Power laws in real estate prices? Some evidence. Q Rev Econ Finance. 2018;69:90–8.
View Article
Google Scholar

[53] View Article

[54] Google Scholar

[ref19] 19. Zhan C, Liu Y, Wu Z, Zhao M, Chow TWS. A hybrid machine learning framework for forecasting house price. Expert Syst Appl. 2023;233:120981.
View Article
Google Scholar

[56] View Article

[57] Google Scholar

[ref20] 20. Bork L, Møller SV. Forecasting house prices in the 50 states using dynamic model averaging and dynamic model selection. Int J Forecast. 2015;31(1):63–78.
View Article
Google Scholar

[59] View Article

[60] Google Scholar

[ref21] 21. Hjort A, Scheel I, Sommervoll DE, Pensar J. Locally interpretable tree boosting: an application to house price prediction. Decis Support Syst. 2024;178:114106.
View Article
Google Scholar

[62] View Article

[63] Google Scholar

[ref22] 22. Engle R. GARCH 101: the use of ARCH/GARCH models in applied econometrics. J Econ Perspect. 2001;15(4):157–68.
View Article
Google Scholar

[65] View Article

[66] Google Scholar

[ref23] 23. Breiman L. Random forests. Mach Learn. 2001;45:5–32.
View Article
Google Scholar

[68] View Article

[69] Google Scholar

[ref24] 24. Xu X, Zhang Y. House price forecasting with neural networks. Intell Syst Appl. 2021;12:200052.
View Article
Google Scholar

[71] View Article

[72] Google Scholar

[ref25] 25. Xu X, Zhang Y. Residential housing price index forecasting via neural networks. Neural Comput Appl. 2022;34(17):14763–76.
View Article
Google Scholar

[74] View Article

[75] Google Scholar

[ref26] 26. Wang L, Wang G, Yu H, Wang F. Prediction and analysis of residential house price using a flexible spatiotemporal model. J Appl Econ. 2022;25(1):503–22.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref27] 27. Li Q, Chand S. House prices and market fundamentals in urban China. Habitat Int. 2013;40:148–53.
View Article
Google Scholar

[80] View Article

[81] Google Scholar

[ref28] 28. Matteo Marsili, Yicheng Zhang. Interacting individuals leading to Zipf’s Law. Phys Rev Lett. 1998;80:2741.
View Article
Google Scholar

[83] View Article

[84] Google Scholar

[ref29] 29. Lilliefors HW. On the Kolmogorov-Smirnov test for the exponential distribution with mean unknown. J Am Stat Assoc. 1969;64(325):387–9.
View Article
Google Scholar

[86] View Article

[87] Google Scholar

[ref30] 30. Drezner Z, Turel O, Zerom D. A modified Kolmogorov–Smirnov test for normality. Commun Stat. 2010;39(4):693–704.
View Article
Google Scholar

[89] View Article

[90] Google Scholar

[ref31] 31. Mora-López L, Mora J. An adaptive algorithm for clustering cumulative probability distribution functions using the Kolmogorov–Smirnov two-sample test. Expert Syst Appl. 2015;42(8):4016–21.
View Article
Google Scholar

[92] View Article

[93] Google Scholar

[ref32] 32. Zhang J, Chen Q, Wang Y. Zipf distribution in top Chinese firms and an economic explanation. Phys A Stat Mech Appl. 2009;388(10):2020–4.
View Article
Google Scholar

[95] View Article

[96] Google Scholar

[ref33] 33. Batty M. Rank clocks. Nature. 2006;444(7119):592–6. pmid:17136088
View Article
PubMed/NCBI
Google Scholar

[98] View Article

[99] PubMed/NCBI

[100] Google Scholar

[ref34] 34. Guo J, Xu Q, Chen Q, Wang Y. Firm size distribution and mobility of the top 500 firms in China, the United States and the world. Phys A Stat Mech Appl. 2013;392(13):2903–14.
View Article
Google Scholar

[102] View Article

[103] Google Scholar

[ref35] 35. Tolesh F, Biloshchytska S. Forecasting international migration in Kazakhstan using ARIMA models. Procedia Comput Sci. 2024;231:176–83.
View Article
Google Scholar

[105] View Article

[106] Google Scholar

[ref36] 36. Zhang J, Liu H, Bai W, Li X. A hybrid approach of wavelet transform, ARIMA and LSTM model for the share price index futures forecasting. North Am J Econ Finance. 2024;69:102022.
View Article
Google Scholar

[108] View Article

[109] Google Scholar

[ref37] 37. Wen T, Liu Y, Bai YH, Liu H. Modeling and forecasting CO2 emissions in China and its regions using a novel ARIMA-LSTM model. Heliyon. 2023;9(11):e21241. pmid:37954263
View Article
PubMed/NCBI
Google Scholar

[111] View Article

[112] PubMed/NCBI

[113] Google Scholar

[ref38] 38. Rosenberg B. Statistical analysis of price series obscured by averaging measures. J Financ Quant Anal. 2009;6(4):1083–94.
View Article
Google Scholar

[115] View Article

[116] Google Scholar

Figures

Abstract

Introduction

Zipf’s law and KS test

Rank clock

Short-term forecast of housing prices

Housing price dynamic model

Conclusion

References