基于神经网络嵌入从城市兴趣点数据中描述城市功能使用:以大伦敦为例

摘要

Delineating urban functional use plays a key role in understanding urban dynamics, evaluating planning strategies and supporting policymaking. In recent years, Points of Interest (POI) data, with precise geolocation and detailed attributes, have become the primary data source for exploring urban functional use from a bottom-up perspective, using local, highly disaggregated, big datasets. Previous studies using POI data have given insufficient consideration to the relationship among POI classes in the spatial context, and have failed to provide a straightforward means by which to classify urban functional areas. This study proposes an approach for delineating urban functional use at the scale of the Lower Layer Super Output Area (LSOA) in Greater London by integrating the Doc2Vec model, a neural network embedding method commonly used in natural language processing for vectoring words and documents from their context. In this study, the neural network vectorises both POI classes (‘Word’) and urban areas (‘Document’) based on their functional context by learning features from the spatial distribution of POIs in the city. Specifically, we first construct POI sequences based on the distribution of POI classes, and add their LSOA IDs as ‘document’ tags. By utilising these constructed POI–LSOA sequences, the Doc2Vec model trains the vectors of 574 POI classes (word vectors) and 4,836 LSOAs (document vectors). The vectors of POI classes are then used in calculating the functional similarity scores based on their cosine distance, with the vectors of LSOAs grouped into clusters (i.e., functional areas) via the $k$-means clustering algorithm. We also identify latent functions in each cluster of LSOAs by performing topic modelling and enrichment factor. Compared with TF–IDF, LDA and Word2Vec models, the Doc2Vec model obtains the highest accuracy when classifying functional areas. This study proposes a straightforward approach in which the model directly trains vectors for urban areas, subsequently using them to classify urban functional areas. By employing the enhanced neural network model with low-cost and ubiquitous POI datasets, this study provides a potential tool with which to monitor urban dynamics in a timely and adaptive manner, thereby providing enhanced, data-driven support to urban planning, development and management

出版物
Computers, Environment and Urban Systems
点击标题下方 DOI 按钮转到期刊在线发布版本。
  • A neural network embedding model is employed in delineating urban functional use from POI (Points of Interest) data.
  • Doc2Vec model directly trains vector representations for spatial areas while considering the spatial distribution of POIs.
  • This paper explores the functional similarity among 574 POI classes and 4836 LSOAs (Lower Layer Super Output Areas) in Greater London.
  • Doc2Vec model outperforms other semantic models (Word2Vec, LDA and TF-IDF) in urban functional areas identification.
牛海沣
牛海沣
副研究员

英国剑桥大学土地经济系副研究员,欧盟Horizon 2020资助项目Emotional Cities空间分析研究员。主要研究兴趣包括城市大数据挖掘、空间数据科学、地理可视化、城市感知和城市动态模拟,特别是关注如何通过结合机器学习、人工智能和城市大数据来更好地支持城市规划、政策制定和智能管理。