Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Identification and analysis of urban functional area in Hangzhou based on OSM and POI data

  • Ziyi Wang ,

    Contributed equally to this work with: Ziyi Wang, Debin Ma

    Roles Conceptualization, Methodology, Supervision, Writing – original draft

    Affiliations School of Geography, Geomatics and Planning, Jiangsu Normal University, Xuzhou, China, School of Architecture and Urban Planning, Nanjing University, Nanjing, China

  • Debin Ma ,

    Contributed equally to this work with: Ziyi Wang, Debin Ma

    Roles Conceptualization, Investigation, Methodology, Visualization, Writing – original draft

    Affiliation School of Geography, Geomatics and Planning, Jiangsu Normal University, Xuzhou, China

  • Dongqi Sun ,

    Roles Formal analysis, Supervision, Writing – review & editing

    sundq@igsnrr.ac.cn

    Affiliation Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Science, Beijing, China

  • Jingxiang Zhang

    Roles Conceptualization, Writing – review & editing

    Affiliation School of Architecture and Urban Planning, Nanjing University, Nanjing, China

Abstract

The accurate identification of urban functional areas is of great significance for optimizing urban spatial structure, rationally allocating spatial elements, and promoting the sustainable development of the city. This paper proposes a method to precisely identify urban functional areas by coupling Open Street Map (OSM) and Point of Interest (POI) data. It takes the central urban area of Hangzhou as a case study to analyze the spatial distribution characteristics of the functional areas. The results show that: (1) The central urban areas of Hangzhou are divided into 21 functional areas (6 single functional areas, 14 mixed functional areas and 1 comprehensive functional area). (2) The single functional areas and the mixed functional areas show the geographical distribution characteristics of the looping stratification, which means “Core-periphery” differentiation is obvious, and the comprehensive functional area is relatively scattered. (3) The mixed degree of regional function with ecological function and production function is low while comprehensive functional areas are usually associated with higher potential and vitality. (4) The identification results are in great agreement with the actual situation of Hangzhou central urban area, and the method is feasible. Therefore, this paper can provide a reference for urban development planning and management.

Introduction

The functional area is the basic unit of urban planning, management and resource allocation. Urban functional areas are important geospatial attributes of urban land, in which people carry out various socio-economic activities, which are usually determined by two perspectives of land use type and human activities, including residential land, industrial land, commercial and business facilities land [1, 2, 7]. As the basic spatial unit of urban development, the concept of urban functional area originated from the "functionalism" planning idea established by the Athens Charter, which emphasizes the clear structure of the city, pays attention to the functional areas division and the purification of use [3]. Diversified urban functions bring convenience to the life, work, recreation, and communication of urban residents, which is the foundation and charm of urban sustainable development [4, 5]. In 2019, China’s urbanization rate exceeded 60%. With the accelerating process of urbanization, various elements gather and diffuse in different spaces of the city [6] and result in functional differentiation at different regional scales. In addition, unreasonable urban planning leads to such problems as a single functional urban structure, spatial differentiation, lack of affection and care and so on, which causes the loss of urban vitality and a series of urban environmental and social problems. Therefore, the accurate identification of space and social structure of the urban, the rational division of functional urban areas has become an important subject of current research. It is of great significance for coordinating the relationship between humans and land, optimizing the urban spatial strategy and improving the level of urban planning [7]. At the same time, the reasonable division of urban functional areas has certain guiding value for solving many urban diseases, such as traffic congestion, air pollution, waste of land resources, climate change and so on [810].

The traditional urban function zoning is mainly based on remote sensing images, land use data, panel data and so on [11]. Zhong et al. proposed a semantic allocation level multi-feature fusion strategy for high spatial resolution image scene classification which is based on the probabilistic topic model [12]. Zhang et al. proposed a complete hybrid scene decomposition system to decompose the high-resolution remote sensing images of Beijing and Zhuhai [13]. However, the traditional method of remote sensing can only classify urban functions based on the natural properties of the land. With the enhancement of the data acquisition capability, breaking the traditional idea of functional zoning, fully mining the urban social and cultural information contained in big data, and establishing the methodology system for identifying urban functional areas has become an innovation direction of urban geography, which especially in the context of information [14]. Liu collected 7-day taxi trajectories in Shanghai to study the temporal variation of boarding and alighting and its relationship with different land-use characteristics [15]. Pei et al. collected mobile phone data in Singapore to identify residents’ daily travel activities to reflect the social function of land use [16]. Jia et al. combined remote sensing image (RSI) of a large area with mobile phone positioning data (MPPD) and applied this framework to the center of Beijing [17]. Song et al. explored the use of Singapore’s parks based on the number of photos and visual content of the geographical location of the park (from Instagram and Flickr platforms) [18]. However, there are data acquisition difficulties and strong subjectivity problems in urban functional area identification based on remote sensing or survey data. The big data of travel or pictures can only be roughly divided into urban functional areas, which cannot accurately analyze the spatial structure of urban functions, nor can they provide the precise implementation to solve the problems of land use function [19, 20]. At the same time, these methods are limited by the number of users and the accuracy of the user footprint location, so the accuracy of the classification results is poor [21].

Compared with the above studies, the POI data is a kind of point-shaped geographic space big data of real geographic entities. The results of this research could finely characterize the dynamic and real-time nature of urban land functions, and reflect the diversity and mixed degree of various facilities [22, 23]. Zhai et al. improved the functional area identification framework by constructing the Place2vec model to capture POI geographic information [24]. Han et al. used POI data to identify single urban functional areas in Beijing [25]. Ran et al. conducted an in-depth analysis of the spatial pattern of the life service industry in Changsha based on POI data [26]. However, most of the present researches on the identification of urban functional areas which based on POI data are focused on the identification of the single function, and lack of in-depth division of mixed and comprehensive land-use functions. At the same time, in the division of the basic unit of functional area identification, present researches mainly divided into cells by grids, and lack of multi-functional identification of the actual land use units.

Therefore, this paper takes Open Street Map (OSM) road network data and POI data as the main data source, and proposes a method for accurate identification of urban functional areas based on kernel density estimation, functional area identification, mixed degree calculation and other technical means. Taking the main urban area of Hangzhou as a case, this paper further identifies urban single functional land, mixed and comprehensive functional land, and analyzes the distribution characteristics of the functional structure in the research area. The improvements of this study are shown as follows: 1) Taking the irregular grid formed by the road network as the research unit, the segmentation of urban functional areas is more reasonable. 2) The kernel density method reduces the noise effect of POI data, improves the recognition accuracy of mixed functional areas and weakens the discretization of results. 3) The recognition model considers the mixed functional areas and comprehensive functional areas to make the results more in line with the real situation. 4) The mixed degree calculation and accuracy verification of the identification results of urban functional areas are conducted to make the results more convincing. In terms of literature, it is expected to provide a typical case and research methods for the research of urban spatial structure in China driven by spatial big data. In practice, it provides the reference for urban managers to understand the urban spatial structure, accurately grasp the development status of urban functions, and formulate reasonable urban planning schemes.

Study area and methods

Study area: The central urban area of Hangzhou, China

Hangzhou, the capital city of Zhejiang Province, is located in the northern part of Zhejiang Province, the lower reaches of the Qiantang River, and the southern end of the Beijing-Hangzhou Grand Canal. The central geographical coordinates are E120°19’, N30°26’. The city governs 10 districts, 2 counties and 1 county-level city, with a total area of 16853.57km2. It is and one of the central cities in the Yangtze River Delta. In 2019, the city’s GDP reached 222.85 billion US dollars, with a permanent population of 10.36 million. The central urban areas are located in the northeast of Hangzhou, including Xihu District, Gongshu District, Shangcheng District, Binjiang District, Jianggan District, and Xiacheng District, with a total area of 706.27km2. The central urban areas are concentrated areas of business, population and services in Hangzhou. The urbanization process of Hangzhou is the epitome of China’s rapid development. At the same time, the urban development mode of Hangzhou, which takes into account science and technology, industry, humanities and ecology, represents the development direction of the future city. Moreover, the rapid change of urban functions of such cities brings challenges to urban management and operation. Therefore, this area is selected as the research area. An overview of the research area is shown in Fig 1.

thumbnail
Fig 1. Map of the study area.

The administrative division data are from the 1:250,000 basic geographic database provided the National Geomatics Center of China (http://www.ngcc.cn/ngcc/html/1//391/392/16114.html), these data are available free of charge.

https://doi.org/10.1371/journal.pone.0251988.g001

Data source and process

Before collecting the open source data required for the article, we have carefully examined all relevant platform service terms to ensure that our research is fully compliant with the agreement. The administrative division data is from the 1:250,000 basic geographic database provided the National Geomatics Center of China (http://www.ngcc.cn/ngcc/html/1//391/392/16114.html), the satellite map data comes from Auto Navi Map (https://www.amap.com/), and they all can be obtained free of charge. The urban road network data of Hangzhou comes from the Open Street Map (OSM) geographic data platform (https://www.openstreetmap.org/). OSM aims to provide users with free and easy-to-access digital map resources, which is currently the most popular spontaneous geographic information platform. We preprocessed the collected OSM road network data: select highways, railways, township main roads, urban main roads and roads, delete sidewalks, residential lanes, chaotic road network and other lines which are less important and do not affect the analysis results, delete duplicate lines. Filter to delete other redundant paths and broken paths and paths below 100 meters, and finally, fill the broken paths. POI (Point of Interest) data comes from AutoNavi Maps Open Platform (https://lbs.amap.com/). AutoNavi Maps is China’s leading provider of digital map content, navigation, and location services. This research uses the open API interface provided by AutoNavi Maps to obtain 168607 POI data of six districts in Hangzhou in December 2020, and each POI data contains attribute information such as geographical entity name, address, affiliation type, longitude, and latitude, administrative region and so on. There are many problems in the original POI data, such as various classifications, the repeated crossover between different classifications, classification errors, information missing, and inconsistent coordinate system. Moreover, some types of POI have low public awareness, such as public toilets, newspaper pavilions, and bus stations. These types of POI are not significant in the identification of functional areas and are not convenient for research and discussion. Therefore, it is necessary to unify the coordinate of POI data, clean the data, delete wrong POI points, delete repeated abnormal values, screen and delete low public awareness points, and reclassify them. According to the "Code for classification of urban land use and planning standards of development land" (GB_50137–2011) and combined with the actual conditions of Hangzhou, we divided POI into six types: residential, administration and public services, commercial and business facilities, industrial, green space, science and education [27] (Table 1).

Research methods

The research idea of this paper is to use the OSM road network data to divide the city into different research units. Then, based on the POI data, the frequency density of different research units in the city is calculated by using the methods of kernel density and POI weight assignment. Finally, it is divided into different functional partitions according to the discriminant rules. The technical route is shown in Fig 2.

Research unit division based on OSM data.

OSM refers to an open street map for the crowd. Ordinary users can collect and track a large amount of location or trajectory data through mobile terminals, and publish geographic information to the Internet with the help of an open map platform. Compared with traditional mapping spatial data, this kind of data has the advantages of real-time, fast update speed, and free access [28]. This paper collects road network data covering the research area, including five types: highways, arterial roads, first-class roads, second-class roads, and third-class roads. According to China’s "Code for classification of urban land use and planning standards of development land", the highways and arterial roads are the first level, the first-class roads and second-class roads are the second level, and the third-class roads are the third level, buffer zones of 40m, 20m, and 10m are generated respectively. When the irregular grid is generated, there are suspension points and independent roads are topologically modified to form closed units.

Kernel density estimation.

The kernel density is based on the first law of geography, that is, the closer the objects are, the greater the density expansion value will be. This method is used to calculate the density of spatial points and line elements in their surrounding neighborhoods, and to simulate the density layout continuously. The kernel density value of each grid in the image reflects the layout characteristics of spatial elements. It is less affected by subjective factors, and the result has the advantages of gradual change and revealing local characteristics [29, 30]. In this research, the influence spreading of POI data is realized by using the kernel density method. The formula is defined as: (1)

In the formula, Kj is the weight of the research object j, is the distance between the spatial point i and the research object j, R is the bandwidth of the selected regular area (Dij < R), n is the number of research objects j within the bandwidth R.

The result shows that the choice of bandwidth R has a critical impact on the result of kernel density analysis [31, 32]. Generally, larger bandwidth is used to reflect the spatial variation of the global scale, and a smaller bandwidth reflects the spatial variation of the local scale. In this paper, the bandwidth R is adaptive by the optimal search bandwidth strategy of the ArcGIS10.5 platform, and the calculation formula is: (2) (3) where r is the bandwidth, SD is the standard distance value, Dm is the median of the distance between the average center of the point and all points, n is the number of points, xi and yi are the coordinates of points, respectively, and and are the average center coordinates of POI points, respectively.

The above formula is used for calculation in GIS to obtain the search radius of POI data of various functional areas. Among them, the residential area is 1095 m, the administration and public services area is 1102 m, the commercial and business facilities area is 745 m, the industrial area is 970 m, the green space is 1224 m, the science and education area is 1013 m, and the overall search radius of POI data is 837 m. Hinnerburg et al. researched the relationship between the bandwidth and the number of density-attractor and found that there are a certain number of bandwidth intervals that keep the density-attractor stable, and it is reasonable to choose the bandwidth in these intervals [33]. Based on this, the bandwidth interval is determined as [700m, 1300m]. Considering the unity of data analysis, 1000m bandwidth kernel density analysis can better describe the specific aggregation characteristics of various industries, which can meet the needs of urban spatial structure analysis [34, 35].

POI weight assignment.

It is not only the distribution density of POI but also the size of geographical objects represented by POI and the degree of public recognition to affect the function of the plot. In this paper, the occupied area of POI points is selected as one of the weights (Table 2), and the average building area or occupied area of various POIs is determined by referring to the current format classification standard GB / T18106-2010 and the urban public service facility planning standard GB50442 (draft for comments), and then according to the data and the area value for grading and scoring (Table 3). According to the research of Xue and Zhao et al. [36, 37], public awareness is cited as another impact factor (Table 4). Then the weights of the two influencing factors are set to 1:9, 3:7, 5:5, 7:3 and 9:1, respectively. After the accuracy of each proportion of sampling verification, only the accuracy of the results with the weight of 5:5 reaches more than 80%. Therefore, in order to ensure the accuracy of the research, we set the weight of the two influencing factors as 5:5. After superposing the weights of different POI data in different land-use types, the corresponding weights of different land-use types are given. The full score of the two factors is 100, and the total score after weighting is 100. For example, if the general area score of a POI point is 50 and the public awareness is 0.7 (the percentage is 70), then the weight assignment result of this POI point is 60. Based on these, the corresponding weights of each land type are: science and education 65, administration and public services 50, residential 30, commercial and business facilities 25, industrial 70, and green space 80.

thumbnail
Table 2. Functional classification of POI and reference value of coverage area.

https://doi.org/10.1371/journal.pone.0251988.t002

Urban functional area identification

According to the weight calculation of each POI, the frequency density of each POI in the land use unit is calculated, which can be used as the basis of functional zoning [3]. The calculation formula is as follow: (4) Where Fi, di, and Wi are the frequency density, sum of nuclear density, weight of the POI of class I in the unit, respectively. Due to the complex and diverse urban functional structure, there are both single-use functional areas and areas with two or more mixed functions. Therefore, this paper further compares the frequency density of POI in the functional area: When the frequency density of POI in the unit is more than or equal to 50%, the functional area is a single function area. When the two kinds of values of higher frequency density of POI in the unit are between 20% and 50%, the unit is defined as a mixed functional area with two types of POI. When the frequency density of all kinds of POI in the unit is 0, it is defined as no data area, and the rest of the unit is the comprehensive functional area.

Mixed degree.

According to the mixed index of land use, the mixed degree of functional areas in the research area is further evaluated [38]. The formula is as follow: (5) Where D is the calculation result of the mixing degree, pi is the proportion of class I POI kernel density value to the total value of each class of POI kernel density in the functional area unit, and n is the number of POI classes. If the kernel density of some POI in the unit is 0, it does not participate in the calculation.

Results

Results of urban functional area identification

Through OSM network grading and buffer operation, the central urban area of Hangzhou is divided into 1542 block units. After calculating the weight and frequency density of various POI in each block unit, the identification result of functional areas in the Hangzhou central district is obtained (Fig 3). It can be seen from Fig 3 that there are 21 types of functional areas in downtown Hangzhou, including 6 types of single functional areas, 14 types of mixed functional areas, and 1 comprehensive functional area. Single functional areas have commercial and business facilities land (422 blocks in total, with a total area of 153.032 km2), residential land (76 blocks in total, with a total area of 48.128 km2), administration and public services land (11 blocks in total, with a total area of 8.703 km2), green space land (5 blocks in total, with a total area of 2.369 km2), industrial land (35 blocks in total, with a total area of 7.957 km2), science and education land (4 blocks in total, with a total area of 2.989 km2). The mixed functional areas include the combination of commercial and business facilities land, and administration and public services land (350 blocks in total, with a total area of 144.794km2), the combination of commercial and business facilities land, and residential land (234 blocks in total, with a total area of 83.733km2), the combination of commercial and business facilities land, science and education land (136 blocks in total, with a total area of 34.917km2), the combination of commercial and industrial facilities land (39 blocks in total, with a total area of 10.678km2), the combination of commercial and business facilities land, and green space land(7 blocks in total, with a total area of 5.466km2), the combination of administration and public services land, and residential land (14 blocks in total, with a total area of 15.317km2), the combination of administration and public services and green space land (7 blocks in total, with a total area of 2.714km2), the combination of administration and public services and industrial land (6 blocks in total, with a total area of 6.084km2), the combination of administration and public services land, and science and education land (4 blocks in total, with a total area of 8.618km2), the combination of industrial and residential land (2 blocks in total, with a total area of 0.258km2), the combination of residential and green space land (1 block in total, with a total area of 0.411km2), the combination of science and education, and residential land (4 blocks in total, with a total area of 2.838km2), the combination of science and education, and industrial land (1 block in total, with a total area of 0.961km2), the combination of science and education, and green space land (2 blocks in total, with a total area of 2.938km2). There are 103 comprehensive functional areas, covering an area of 39.801km2. The remaining units are no data areas (79 blocks in total, with a total area of 0.333km2). Because of the small statistics of various POIs in this block, the kernel density value is small too.

thumbnail
Fig 3. Recognition results of functional areas in the central city area of Hangzhou.

https://doi.org/10.1371/journal.pone.0251988.g003

Spatial distribution of different functional areas

Single functional area.

The single functional areas in the central urban area of Hangzhou are mainly commercial and business facilities land and residential land (Fig 4). Among them, the number of commercial and business facilities land plots is the largest, accounting for 76% of the total land plots in the single functional area, and 27% of the total land plots in the research area. And the total area of commercial and business facilities land plots is the largest, accounting for 69% of the total land plots in the single functional area, and 26% of the total land plots in the research area. Commercial and business facilities land is distributed in a wide range, but there are large differences between regions, showing the characteristics of more periphery and less center. The number of residential land accounts for 13.7% and 4.9% of the single functional areas and the total of research areas, and the area accounts for 24% and 9%, respectively. Residential land’s spatial distribution is similar to that of commercial and business facilities land, showing the characteristics of more periphery and less center, residential land mainly distributed in the north and south of Xihu District, the east of Jianggan District, and the southern edge of Binjiang District. Meanwhile, the number of industrial land accounts for 6% of the single functional area and 2% of the total research area, with the area accounting for 1% and 0.4% respectively. Industrial land is mainly distributed along the main traffic arteries. The administration and public services land, science and education land, and green space have small areas and single distribution because there are many other mixed functions and the recognition rate is low.

thumbnail
Fig 4. Distribution map of single functional areas in the central city area of Hangzhou.

https://doi.org/10.1371/journal.pone.0251988.g004

Mixed functional area and comprehensive functional area.

Mixed functional areas in the downtown area of Hangzhou are widely distributed (Fig 5), with an area of 319.767km2, accounting for 55% of the total area of the research area, which indicates that the mixed degree of urban functions in the downtown area of Hangzhou is relatively high. Mixed functional areas are mainly the combination of commercial and business facilities land and administration and public services land, which occupy the largest area, accounting for 25% of the total research area. In terms of spatial distribution, the central part is more and concentrated, while the periphery is less and scattered. Specifically, they are primarily concentrated in Shangcheng District and Xiacheng District, the south of Gongshu District, the east and west of the Xihu District, and the west of Jianggan District. Next is the combination of commercial and business facilities land and residential land, accounting for 14% of the total research area, intersperses in various functional areas. The number of other mixed functional areas is 223, accounting for 14% of the total area of the research area, and the area of other mixed functional areas accounted for 16% of the total area of the research area. The layout of other mixed functional areas is more dispersed. Among them, the number of the combination of residential and green space land and the combination of science and education and the industry land are the least, with 1 each. The area of the combination of industrial and residential land is the smallest, accounting for only 0.04% of the total area of the research area. The overall layout of the comprehensive functional urban areas in the central city area of Hangzhou is scattered (Fig 6), accounting for 6.7% of the total research area, and the area of comprehensive functional urban areas accounted for 6.8% of the total functional areas. So it is clear that the overall layout of integrated functional urban areas in the center is in the state of multi-point flowering, distributed around all kinds of single functional areas and mixed functional areas.

thumbnail
Fig 5. Distribution map of mixed functional areas in the central city area of Hangzhou.

https://doi.org/10.1371/journal.pone.0251988.g005

thumbnail
Fig 6. Distribution map of comprehensive functional areas in the central city area of Hangzhou.

https://doi.org/10.1371/journal.pone.0251988.g006

Distribution characteristics of mixed utilization degree.

According to the distribution map of the mixed degree of functional areas (Fig 7), it can be clearly seen that the mixed-use degree of land use in the central urban area of Hangzhou has obvious regional differentiation. Those with a high mixed degree are mixed functional area and comprehensive functional area, and those with low mixing degree are the single functional area. The high-value areas are mainly distributed in the south of Xihu District and Binjiang District, the middle of Shangcheng District and Xiacheng District, the west of the Gongshu District, and the riverside area of Jianggan District. The low degree mixed areas are mainly distributed in the west of Xihu District, the north of Binjiang District, the south of Shangcheng District and Gongshu District, and the north of Xiacheng District and Jianggan District. In each administrative region, there are high-value areas and low-value areas of mixing degree, and the distribution is more even.

thumbnail
Fig 7. Distribution map of the mixed degree of functional areas in the central urban area of Hangzhou.

https://doi.org/10.1371/journal.pone.0251988.g007

Validation of results

In order to verify the accuracy of the functional area identification results, this paper draws on the research of Ding and Kang [39, 40]. In this paper, 40 units of functional area are selected randomly (Fig 8), and the real attributes of the units of functional area are judged according to AutoNavi map, and the accuracy of the identification results is evaluated by the method of scoring conformity degree. The full score is 3, that is, complete compliance, 0 is complete non-compliance. If the single functional area is identified as a mixed functional area of one type, it is 2, and the rest is 1. If the mixed functional area is identified as a single functional area that contains one function of this mixed functional area or a mixed functional area which contains one function of this mixed functional area, it is 2, and the rest is 1.If the comprehensive functional area is identified as a mixed functional area, it is 2, If the comprehensive functional area is identified as a single functional area, it is 1.

thumbnail
Fig 8. Sampling inspection of urban functional areas in central city area of Hangzhou.

https://doi.org/10.1371/journal.pone.0251988.g008

The accuracy calculation formula is: (6)

In the formula, n is the sampling number and Xi is the sum of all samples with full precision, xi is the actual score of sample accuracy.

The evaluation and calculation of the accuracy of functional zoning is shown in Table 5. It can be seen that the overall accuracy of urban functional area identification in the central urban area of Hangzhou reaches 82%, indicating that this research can effectively identify the urban functional areas and have certain accuracy.

Discussion and suggestions

Method feasibility analysis

In the past, the basic data of urban functional area identification were mainly census data and government planning text. These data have some limitations (high acquisition cost, difficult management, poor timeliness, etc.), making research in this field not detailed enough. With the development and popularization of computer technology, remote sensing technology and big data are applied in the research of urban functional area recognition, but there are still problems such as tedious classification methods and the result precision cannot be guaranteed. Based on the kernel density estimation algorithm, this paper proposes an urban functional area identification method that combines OSM and POI data. It has a high reference value for the analysis of urban overall patterns and spatial optimization. At the same time, the availability of OSM and POI data makes this method can be easily used in other cities.

Firstly, POI data can provide a spatial location and attribute information of various urban facilities. Real-time and efficient data acquisition methods simplify the cost and uncertainty of traditional identification methods such as remote sensing, field research and questionnaire survey. The application of big data breaks through the problems of poor timeliness and a small sample of traditional data, improves the accuracy of urban functional area identification, and is convenient for larger-scale research.

Then, the OSM road network is used to generate road space, and the research area is divided into independent block units so that the segmentation of urban functional areas is more reasonable. The kernel density estimation method is used to realize the influence diffusion of POI in the adjacent location, weaken the discrete phenomenon of POI points, and identify the functional areas through the optimal bandwidth selection. The results are basically consistent with the actual situation. Through the calculation of the mixed index of urban functional areas, it is convenient to express the mixed degree of land use in the central urban area of Hangzhou. Although in the process of urban planning and construction, urban land use layout planning emphasizes functional zoning, mixed land use planning should also be adhered to promote economic development and facilitate the lives of residents.

Finally, compare the identified functional areas with the image of AMap, the verification of the results shows that the method has a high degree of recognition for urban functional areas, which can make up for the disadvantages of the traditional method and has certain feasibility. It can support the spatial layout research of urban functional areas, so that the decision-making organs can find the shortcomings of the existing urban planning, and can coordinate the development direction of the existing planning functional areas, improve the vitality of urban space, and carry more human activities.

However, this method also has some shortcomings. For example, the verification of the results is only carried out through the comparison of typical regions, and there is no true value as a comparison. Therefore, it is hard to determine the accurate identification accuracy of this method. In addition, this method is based on the OSM road network to divide urban units, and the basic road network data have the problems of low density and the lack of data in suburban and rural areas, which leads to the excessive division of functional units in such areas, resulting in poor local recognition results.

Current situation analysis

The urban functional area projects each element of the city in space and forms a closely related organism. The identification of Hangzhou urban functional areas based on big data is not only beneficial to the adjustment of urban spatial layout and the creation of good conditions for the development of urban economy and society but also can improve the efficiency of urban land use [41] through the rationalization of urban functional zoning, so as to ensure the effective implementation of urban sustainable development strategy [42, 43]. According to the “Results” part, we have the following judgments:

(1) The urban functional pattern of "single-mixed-comprehensive" coexistence.

On the whole, the single functional areas are mainly distributed in the periphery of the city. For example, the single residential areas are mainly distributed in the north and south of West Lake District, the east of Jianggan District, and the southern edge of the Binjiang District. The single industrial areas are mainly distributed in the eastern part of the Jianggan District and the southern part of the Xihu District. The single areas for science and education are mainly distributed in the northern part of the main urban area of Hangzhou. In general, the more single functional land, the more it affects the mobility of the city [44]. For example, the time cost of residents leaving the residential area to reach two or more other functional areas is relatively higher. From the perspective of sustainable development, the simplification of functions will not only affect the "mobility" of the city but also directly reduce the attractiveness and vitality of the city [45]. The existence of single-use areas is related to policies, history, and land use cost. For example, the cost of land use in central areas may be more than ten times that in the suburbs. Therefore, large scales of land development are often developed in the suburbs [15]. At the same time, the development area policies with Chinese characteristics will also lead to the emergence of large-scale single functional areas in the suburbs.

The mixed functional areas are distributed in the whole research area and nested with single functional areas, which indicates the rationality of the spatial structure of the main urban area of Hangzhou. For example, residential land forms residential-industrial zones around enterprises, commercial-residential lands around businesses, and public service-residential lands around public services. From the perspective of different administrative regions, each administrative region has an area with a higher degree of mixed, as well as an area with a lower degree of mixed. In line with the law of urban growth and development in urban areas, it revolves around multiple economic activity centers [46, 47]. However, most of the highly mixed areas are not centers of economic activity and do not have highly mixed urban functions around traffic stations. This is mainly due to the rigid system of land use and the consolidation of management models in China [48, 49].

(2) Comprehensive functional areas bring more vitality to the city.

The comprehensive functional area means that there are three or more urban functions in the plot, reflecting the organic integration of different urban functions. For example, the southernmost comprehensive functional area of Binjiang District is the location of China Network Writers’ Village and the core for the development of China’s network literature industry. Meanwhile, the high-quality environment and supporting facilities attract more writers. Similarly, in the easternmost comprehensive functional area of Jianggan District, there are enterprise platforms with functions of scientific research and industrial incubation, such as ABO Biopharmaceuticals (Hangzhou) Co., Ltd., Sanhua Academia Sinica, as well as high-grade residential areas, the supporting facilities are relatively complete. These regions are very typical innovation areas in Hangzhou, and the mixing of functions is also the most prominent feature of high-quality urban innovation space [50, 51].

(3) Low mixed degree of regional function with ecological function and production function.

In the research area, the mixed degree of some areas is deliberately controlled, mainly in the areas that undertake ecological functions, such as the Qiantang River, Xixi Wetland, West Lake Scenic Area, and the areas that undertake production functions, such as Zhejiang Qiosi Farm in Jianggan District. This is because the strategy of ecological civilization and the protection of cultivated land both are China’s basic state policies [52], which require strict restrictions on the development of the ecological area and cultivation area. A high degree of mixed often means diversified and high-intensity land development. Therefore, the existence of such low-mixed land use for urban development strategy is of great significance.

Suggestions

The rapid evolution of cities and the promotion of China’s urbanization policy have brought many new challenges to urban planning and management. The spatial division of urban functional areas and urban zoning management of the city could provide a new idea for planners and managers to a certain extent. The government could plan different functional areas of the city according to the social, economic and natural geographical conditions, and realize the change from the traditional " unitary and extensive " management mode to the modern " diversified and refined " management mode [53, 54].

Based on the above analysis of the comprehensive judgment of the current situation, we believe that the current urban functions of the central city area of Hangzhou are becoming diversified and individualized. However, with the continuous evolution of urban spatial structure, there are many potential problems [55]. First of all, the single functional area is still more, leading to the separation of working and housing, the separation of school and residence, and heavy traffic pressure in the morning rush and evening rush. Secondly, the less comprehensive function area and insufficient space for diversified urban functions, which will bring the problem of insufficient vitality and attraction of the city. Finally, the over-strict control of land function results in the restriction of the diversified allocation of ecological and productive land, which has a great inconvenience to the surrounding residents.

Therefore, in the future, the urban land use function of the central urban area of Hangzhou should be optimized from the following aspects: (1) Developing mixed-use areas based on actual needs, guiding the optimization of urban mixed areas to improve the efficiency of land use. (2) Reducing single-use land types and promoting the transition from single-use land to mixed-use land. (3) Adhering to the construction of the multi-center distribution pattern of urban spatial structure and promote the fair distribution of social-spatial elements [56]. (4) While adhering to the strict system of cultivated land protection and ecological protection, we should also provide diversified infrastructure to serve the neighboring residents.

Conclusions and deficiencies

This article takes the central urban area of Hangzhou as the research area and proposes a method of city functional area identification based on the kernel density estimation algorithm that integrates OSM road network and POI data: Firstly, the OSM network divides the urban area into research units with the same social and economic functions. The influence diffusion of POI facilities is realized based on the kernel density estimation algorithm, which weakens the discretization of POI. At the same time, kernel density bandwidth and POI weight are set to identify the functional urban areas and projected to different plots. Finally, the land-use mixed index is calculated. The results show that: (1) There are three types of urban functions in the central urban area of Hangzhou: single, mixed, and comprehensive. The central urban areas of Hangzhou are divided into 21 functional areas and the commercial function land is the dominant function of the research area. (2) The single functional areas and the mixed functional areas show the geographical distribution characteristics of the looping stratification, which means the “Core-periphery” differentiation is obvious. Mixed functional areas are widely distributed, but the areas with ecological and production functions have a low degree of mixed. Comprehensive functional areas are widely distributed and with higher potential and vitality. (3) The results of identification are in good consistent with the actual situation of the central urban area of Hangzhou. Therefore, the kernel density weighting method which considered public awareness and general area has high accuracy and feasibility, which can provide a reference for urban development planning and management. (4) The real-time and high efficiency of OSM and POI data acquisition methods and the scientific accuracy of big data application make the method of identifying urban functional area which combined OSM and POI data available. This method can be easily used in the identification of other urban regional functional areas to help people to understand the urban spatial structure more intuitively. And this method has high potential in assisting government departments and researchers in the scientific analysis of urban planning and land use.

There are some deficiencies of this paper: Because of the data defect of POI, the identification ability of areas such as small building density, unused land in cities and farmland is weak. At the same time, the integrity of POI data, the rationality of data cleaning and the accuracy of reclassification need further research. In view of the defects of the results test in research methods, a variety of verification methods can be added to future research for a horizontal comparison of results. Finally, due to the lack of historical POI, the evolution characteristics of urban functional areas cannot be identified. So combining mobile signaling data, sharing vehicle (bicycle) data, social media big data and other multi-source data to analyze the evolutionary logic and driving mechanism of urban functional areas will be the direction of future research.

References

  1. 1. Ma S, Long Y. Functional urban area delineations of cities on the Chinese mainland using massive Didi ride-hailing records. Cities. 2020, 97: 102532. https://doi.org/10.1016/j.cities.2019.102532
  2. 2. Zhang X, Du S, Zheng Z. Heuristic sample learning for complex urban scenes: Application to urban functional-zone mapping with VHR images and POI data. Isprs J Photogramm. 2020, 161: 1–12. https://doi.org/10.1016/j.isprsjprs.2020.01.005
  3. 3. Hu Y, Han Y. Identification of Urban Functional Areas Based on POI Data: A Case Study of the Guangzhou Economic and Technological Development Zone. Sustainability. 2019; 1: 385. https://doi.org/10.3390/su11051385
  4. 4. Gu C, Wang Y, Shao Y, Gu J. Research on administrative divisions based on functional areas analysis: A case of Shaoxing metropolitan area. Acta Geogr Sin. 2015; 70: 1187–1201. https://doi.org/10.11821/dlxb201508001
  5. 5. Liu X, Peng W. Urban functional zoning and zoning classification management. Urban Oper Manag. 2010; 4: 20–22.
  6. 6. Yang J, Guo A, Li Y, Zhang Y, Li X. Simulation of landscape spatial layout evolution in rural-urban fringe areas: a case study of Ganjingzi District, Gisci Remote Sens, 2019; 56: 388–405. https://doi.org/10.1080/15481603.2018.1533680.
  7. 7. Hu S, He Z, Wu L, Yin L, Xu Y, Cui H. A framework for extracting urban functional regions based on multiprototype word embeddings using points-of-interest data. Comput. Environ. Urban Syst. 2020; 80: 101442. https://doi.org/10.1016/j.compenvurbsys.2019.101442
  8. 8. Tonne C, Halonen JI, Beevers SD, Dajnak D, Anderson HR. Long-term traffic air and noise pollution in relation to mortality and hospital readmission among myocardial infarction survivors. Int J Hyg Environ Health. 2015; 219: 72–78. pmid:26454658
  9. 9. Zhang P, He J, Hong X. Carbon sources/sinks analysis of land-use changes in China based on data envelopment analysis. J Clean Prod, 2018; 208: 702–711. https://doi.org/10.1016/j.jclepro.2018.08.341
  10. 10. Zhang P, Li Y, Jing W. Comprehensive Assessment of the Effect of Urban Built-Up Land Expansion and Climate Change on Net Primary Productivity. Complexity, 2020. https://doi.org/10.1155/2020/8489025
  11. 11. Cui H, Wu L, Hu S, Lu R, Wang S. Recognition of urban functions and mixed use based on residents’ movement and topic generation model: the case of Wuhan, China. Remote sens-basel. 2020; 12: 2889. https://doi.org/10.3390/rs12182889
  12. 12. Zhong Y, Zhu Q, Zhang L. Scene classification based on the multifeature fusion probabilistic topic model for high spatial resolution remote sensing imagery. IEEE Trans Geosci Electron. 2015; 53: 6207–6222. https://doi.org/10.1109/TGRS.2015.2435801
  13. 13. Zhang X, Du S. A linear dirichlet mixture model for decomposing scenes: application to analyzing urban functional zonings. Remote Sens Environ. 2015; 169: 37–49. https://doi.org/10.1016/j.rse.2015.07.017
  14. 14. Zhang X, Hua Q, Zhang L. Development and application of a planning support system for regional spatial functional zoning based on gis. Sustainability. 2016; 8: 909. https://doi.org/10.3390/su8090909
  15. 15. Liu Y, Wang F, Xiao Y, Gao S. Urban land uses and traffic ’source-sink areas’: Evidence from GPS-enabled taxi data in Shanghai. Landscape Urban Plan. 2012; 106: 73–87. https://doi.org/10.1016/j.landurbplan.2012.02.012
  16. 16. Pei T, Sobolevsky S, Ratti C, Shaw SL, Li T, Zhou C. A new insight into land used classification based on aggregated mobile phone data. Int J Geogr Inf Sci. 2014; 28: 1988–2007. https://doi.org/10.1080/13658816.2014.913794
  17. 17. Jia Y, Ge Y, Ling F, Guo X, Wang J, Wang L. Urban land use mapping by combining remote sensing imagery and mobile phone positioning data. Remote Sens-basel. 2018; 10: 446. https://doi.org/10.3390/rs10030446
  18. 18. Song X, Richards DR, He P, Tan PY. Does geo-located social media reflect the visit frequency of urban parks? a city-wide analysis using the count and content of photographs. Landscape Urban Plan. 2020; 203: 103908. https://doi.org/10.1016/j.landurbplan.2020.103908
  19. 19. Crandall D, Snavely N. Modeling people and places with internet photo collections. Commun ACM. 2011; 4: 18–26. https://doi.org/10.1145/2208917.2212756
  20. 20. Bagchi M, White PR. What role for smart-card data from bus systems? P I Civil Eng-Munic. 2004; 157: 39–46. https://doi.org/10.1680/muen.2004.157.1.39
  21. 21. Wang F, Gao X, Yan B. Research on urban spatial structure in Beijing based on housing prices. Prog Geogr. 2014; 33: 322–1331. https://doi.org/10.11820/dlkxjz.2014.10.004
  22. 22. Qiu M, Du M, Liu Y. Application of voronoi diagrams and multiangle measurable image in the urban poi location and site generation. In Proceedings of the IET Conference Proceedings, Institution of Engineering and Technology, 2012, Glasgow, UK, 22–25 October 2012. https://doi.org/10.1049/cp.2012.2318
  23. 23. Zhu X, Zhou C. POI inquiries and data update based on lbs. In Proceedings of the 2009 International Symposium on Information Engineering and Electronic Commerce, Ternopil, Ukraine, 16–17 May 2009. https://doi.org/10.1109/IEEC.2009.159
  24. 24. Zhai W, Bai X, Shi Y. Beyond Word2vec: An approach for urban functional region extraction and identification by combining Place2vec and POIs. Comput Environ Urban. 2019; 74: 1–12. https://doi.org/10.1016/j.compenvurbsys.2018.11.008
  25. 25. Han H, Yu X, Long Y. Identifying urban functional zones using bus smart card data and points of interest in Beijing. City Plann Rev. 2016; 40: 52–60. https://doi.org/10.1007/978-3-319-19342-7_10
  26. 26. Ran Z, Zhou G. Study on spatial pattern of consumer service industry in Changsha based on POI data. World Regional Studies. 2019; 28: 63–172. https://doi.org/10.3969/j.issn.1004-9479.2019.03.2018225
  27. 27. Kang Y, Wang Y, Xia Z, Chi J, Jiao M, Wei Z. Identification and classification of wuhan urban districts based on poi. J Geomat. 2018; 43: 81–85. https://doi.org/10.14188/j.2095-6045.2016352
  28. 28. Hong Y, Yao Y. Hierarchical community detection and functional area identification with osm roads and complex graph theory. Int J Geogr Inf Sci. 2019; 33: 1569–1587. https://doi.org/10.1080/13658816.2019.1584806
  29. 29. Rosenblatt M. Remarks on Some Nonparametric Estimates of a Density Function. Annals of Mathematical Statistics. 1956; 27: 642–669. https://doi.org/10.1007/978-1-4419-8339-8_13
  30. 30. Sheather SJ, Jones MC. A reliable Data-Based Bandwidth Se-lection Method for Kernel Density Estimation. J R Stat Soc. 1991; 53: 683–690. https://doi.org/10.2307/2345597
  31. 31. Heidenreich NB, Schindler A, Sperlich S. Bandwidth selection for kernel density estimation: a review of fully automatic selectors. Asta-Adv Stat Anal. 2013; 97: 403–433. https://doi.org/10.1007/s10182-013-0216-y
  32. 32. Elgammal A, Daraiswami R, Harwood D. Background and Foreground Modeling Using Nonparametric Kernel Density Estimation for Visual Surveillance. P. IEEE. 2002; 90: 1151–1163. https://doi.org/10.1109/JPROC.2002.801448
  33. 33. Hinneburg A. An efficient approach to clustering in large multime-dia databases with noise. Data Min Knowl Disc. 1998; 98:58–65.
  34. 34. Xue B, Xiao X, Li J, Xie X, Lu C, Ren W. POI-based spatial correlation of the residences and retail industry in Shenyang city. Sci Geogr Sin. 2019; 39: 442–449. https://doi.org/10.13249/j.cnki.sgs.2019.03.010
  35. 35. Liu Z. Geographic information analysis. Prof Geogr. 2005; 57: 624–626. https://doi.org/10.1111/j.1467-9272.2005.00504_11.x
  36. 36. Xue B, Li J, Xiao X. Overview of manland relationship research based on POI data: Theory, method and application. Geo Spat Inf Sci. 2019; 35: 51–60. https://doi.org/10.3969/j.issn.1672-0504.2019.06.009
  37. 37. Zhao W, Li Q, Li B. Extracting hierarchical landmarks from urban POI data. J Remote sens. 2011; 15: 973–988. https://doi.org/10.11834/jrs.20110173
  38. 38. Li M, Ma Y, Sun X, Wang J, Dang A. Application of spatial and temporal entropy based on multi-source data for measuring the mix degree of urban functions. City Plann Rev. 2018; 2: 97–103. https://doi.org/10.11819/cpr20180214a
  39. 39. Kang Y, Wang Y, Xia Z. Division and identification of urban functional areas in Wuhan Based on POI data. J Geoma. 2018; 43: 81–85. https://doi.org/10.14188/j.2095-6045.2016352
  40. 40. Ding Y, Xu H, Wang C. Urban functional area identification based on OSM road network and POI data. Geo Gis. 2020; 36: 57–63. https://doi.org/10.3969/j.issn.1672-0504.2020.04.009
  41. 41. Yang J, Xie P, Xi J, Ge Q, Li X, Ma Z. Land use change simulation based on cellular automata model-Dalian Economic and Technological Development Zone as an example. J Geogr Sci. 2015; 70: 461–475. https://doi.org/10.11821/dlxb201503009
  42. 42. Chi J, Jiao L, Dong T, Gu Y, Ma Y. Quantitative identification and visualization of urban functional area based on poi data. J Geomat. 2016; 41: 68–73. https://doi.org/10.14188/j.2095-6045.2016.02.017
  43. 43. Yuan J, Zheng Y, Xie X. Discovering regions of different functions in a city using human mobility and pois. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012, Beijing, China, 12–16 August 2012. https://doi.org/10.1145/2339530.2339561
  44. 44. Xue B, Xiao X, Li J. POI-based analysis on the affecting factors of prices’ spatial distribution in the traditional industrial area. Hum Geogr. 2019; 34: 106–114. https://doi.org/10.13959/j.issn.1003-2398.2019.04.013
  45. 45. Zhou L, Dang X, Mu H, Wang B, Wang S. Cities are going uphill: Slope gradient analysis of urban expansion and its driving factors in China. Sci Total Environ, 2020; 709:1–16. https://doi.org/10.1016/j.scitotenv.2021.145836
  46. 46. Cranshaw J, Schwartz R, Hong JI. The Livehoods Project: Utilizing Social Media to Understand the Dynamics of a City. 6th International AAAI Conference on Weblogs and Social Media (ICWSM), 2012, Dubin, Ireland, 6 July 2012.
  47. 47. Cheng X, Shao H, Li Y, Shen C, Liang P. Urban land intensive use evaluation study based on nighttime light—A case study of the Yangtze river economic belt. Sustainability. 2019; 11: 675. https://doi.org/10.3390/su11030675
  48. 48. Dowling TJ. Reflections on Urban Sprawl, Smart Growth, and the Fifth Amendment. U. Penn. Law Rev. 2000; 148: 873–887. https://doi.org/10.2307/3312828
  49. 49. Wang F, Hui Q. An Analysis of the Yangtze River Delta Urban Agglomeration Smart City Policies from the Perspective of Policy Tools. J Intel. 2017; 36: 86–92. https://doi.org/10.3969/j.issn.1002-1965.2017.09.014
  50. 50. Esmaeilpoorarabi N, Yigitcanlar T, Guaralda M. Place quality in innovation clusters: an empirical analysis of global best practices from Singapore, Helsinki, New York, and Sydney. Cities, 2018, 74: 156–168. https://doi.org/10.1016/j.cities.2017.11.017
  51. 51. Pancholi S, Yigitcanlar T, Guarald M. Place making for innovation and knowledge-intensive activities: The Australian experience. Technol Forecast Soc Change. 2019, 146: 616–625. https://doi.org/10.1016/j.techfore.2017.09.014
  52. 52. Zhou L, Dang X, Sun Q, Wang S. Multi-scenario simulation of urban land change in Shanghai by random forest and CA-Markov model. Sustain Cities Soc, 2020; 55:1–10. https://doi.org/10.1016/j.scs.2020.102045
  53. 53. Niu X, Liang D, Song X. Understanding Urban Spatial Structure of Shanghai Central City Based on Mobile Phone Data. City Plann Rev. 2015; 24. https://doi.org/CNKI:SUN:CSGY.0.2015-03-004
  54. 54. Zhang P, Yang D, Qin M. Spatial heterogeneity analysis and driving forces exploring of built-up land development intensity in Chinese prefecture-level cities and implications for future Urban Land intensive use. Land Use Policy. 2020; 99: 104958. https://doi.org/10.1016/j.landusepol.2020.104958
  55. 55. Myint SW, Gober P, Brazel A. Per-pixel vs. Object-based Classification of Urban Land Cover Extraction Using High Spatial Resolution Imagery. Remote Sens Environ. 2011; 115:1145–1161. https://doi.org/10.1016/j.rse.2010.12.017
  56. 56. Minner JS, Xiao S. Churn and Change along Commercial Strips: Spatial Analysis of Patterns in Remodeling Activity and Landscapes of Local Business. Urban Stud. 2017; 54:3655–3680. https://doi.org/10.1177/0042098016684274