Research – Page 2 – GeoDSLab@UW-Madison

Prof. Song Gao received an AI for Earth Grant from Microsoft

August 31, 2019May 10, 2022Song Gao

[Madison, WI/USA] – [August 8, 2019] – Professor Song Gao as the Principal Investigator (PI) has been awarded an AI for Earth research grant from Microsoft to help further the efforts in the area of Geospatial Artificial Intelligence (GeoAI).

This new grant will provide Dr. Song Gao and his research assistants Yuhao Kang and Jake Kruse at the GeoDS@UW-Madison lab, and Dr. Fan Zhang (Postdoc Researcher at the MIT Senseable city Lab and Peking University) with the Azure cloud computing resources and AI data labelling services to accelerate their work on understanding the playability of cities and metropolitan areas from the human-environment interaction perspective using multi-source geospatial big data (e.g., images, texts, and videos).

The Microsoft AI for Earth is a $50 million, 5-year program that brings the full advantage of Microsoft technology to those working to solve global environmental challenges in the key focus areas of climate, agriculture, water and biodiversity. Through grants that provide access to cloud and AI tools, opportunities for education and training on AI, and investments in innovative, scalable solutions, AI for Earth works to advance sustainability across the globe.

Learn more about the Microsoft AI for Earth program: https://www.microsoft.com/en-us/aiforearth

A theoretical framework of modeling vague areal objects in GIScience

July 18, 2019sgao

Liu, Y., Yuan, Y., & Gao, S. (2019). Modeling the Vagueness of Areal Geographic Objects: A Categorization System. ISPRS International Journal of Geo-Information, 8(7), 306. DOI: https://doi.org/10.3390/ijgi8070306

Abstract: Modeling vague objects with indeterminate boundaries has drawn much attention in geographic information science (GIScience). Because fields and objects are two perspectives in modeling geographic phenomena, this paper investigates the characteristics of vague regions from the perspective of the field/object dichotomy. Based on the assumption that a vague object can be viewed as the conceptualization of a field, we defined five categories of vague objects: (1) direct field-cutting objects, (2) focal operation-based field-cutting objects, (3) element-clustering objects, (4) object-referenced objects, and (5) dynamic boundary objects. We then established a categorization system to formalize the semantic differences between vague objects using the fuzzy set theory. The proposed framework provides valuable input for the conceptualization, interpretation, and modeling of vague geographical objects.

**Figure.** The categorization system of the five categories of fuzzy regions and their relations.

Full Paper about “Trajectory Reconstruction” accepted at Computers, Environment and Urban Systems

May 29, 2019Mingxiao Li

Citation info: Mingxiao Li, Song Gao, Feng Lu, Hengcai Zhang. (2019) Reconstruction of human movement trajectories from large-scale low-frequency mobile phone data. Computers, Environment and Urban Systems, Volume 77, September 2019, 101346. DOI:
10.1016/j.compenvurbsys.2019.101346

Abstract

Understanding human mobility is important in many fields, such as geography, urban planning, transportation, and sociology. Due to the wide spatiotemporal coverage and low operational cost, mobile phone data have been recognized as a major resource for human mobility research. However, due to conflicts between the data sparsity problem of mobile phone data and the requirement of fine-scale solutions, trajectory reconstruction is of considerable importance. Although there have been initial studies on this problem, existing methods rarely consider the effect of similarities among individuals and the spatiotemporal patterns of missing data. To address this issue, we propose a multi-criteria data partitioning trajectory reconstruction (MDP-TR) method for large-scale mobile phone data. In the proposed method, a multi-criteria data partitioning (MDP) technique is used to measure the similarity among individuals in near real-time and investigate the spatiotemporal patterns of missing data. With this technique, the trajectory reconstruction from mobile phone data is then conducted using classic machine learning models. We verified the method using a real mobile phone dataset including 1 million individuals with over 15 million trajectories in a large city. Results indicate that the MDP-TR method outperforms competing methods in both accuracy and robustness. We argue that the MDP-TR method can be effectively utilized for grasping highly dynamic human movement status and improving the spatiotemporal resolution of human mobility research.

Full Paper about “Map Style Transfer” accepted at the International Journal of Cartography

May 4, 2019yuhao

Our paper entitled Transferring Multiscale Map Styles Using Generative Adversarial Networks has been accepted for publishing in the International Journal of Cartography.

DOI: 10.1080/23729333.2019.1615729

Authorship: Yuhao Kang, Song Gao, Robert E. Roth.

This paper proposes a methodology framework to transfer the cartographic style in different kinds of maps. By inputting the raw GIS vector data, the system can automatically render styles to the input data with target map styles but without CartoCSS or Mapbox GL style specification sheets. The Generative Adversarial Networks (GANs) are used in this research. The study explores the potential of implementing artificial intelligence in cartography in the era of GeoAI.

We outline several important directions for the use of AI in cartography moving forward. First, our use of GANs can be extended to other mapping contexts to help cartographers deconstruct the most salient stylistic elements that constitute the unique look and feel of existing designs, using this information to improve design in future iterations. This research also can help nonexperts who lack professional cartographic knowledge and experience to generate reasonable cartographic style sheet templates based on inspiration maps or visual art. Finally, integration of AI with cartographic design may automate part of the generalization process, a particularly promising avenue given the difficult of updating high resolution datasets and rendering new tilesets to support the ’map of everywhere’.

Here is the abstract:

The advancement of the Artificial Intelligence (AI) technologies makes it possible to learn stylistic design criteria from existing maps or other visual arts and transfer these styles to make new digital maps. In this paper, we propose a novel framework using AI for map style transfer applicable across multiple map scales. Specifically, we identify and transfer the stylistic elements from a target group of visual examples, including Google Maps, OpenStreetMap, and artistic paintings, to unstylized GIS vector data through two generative adversarial network (GAN) models. We then train a binary classifier based on a deep convolutional neural network to evaluate whether the transfer styled map images preserve the original map design characteristics. Our experiment results show that GANs have a great potential for multiscale map style transferring, but many challenges remain requiring future research.

Examples of Map Style Transfer using Pix2Pix

Examples of Map Style Transfer using CycleGAN

You can also visit the following links to see some of the trained results:

CycleGAN at zoom level 15: https://geods.geography.wisc.edu/style_transfer/cyclegan15/

CycleGAN at zoom level 18: https://geods.geography.wisc.edu/style_transfer/cyclegan18/

Pix2Pix at zoom level 15: https://geods.geography.wisc.edu/style_transfer/pix2pix15/

Pix2Pix at zoom level 18: https://geods.geography.wisc.edu/style_transfer/pix2pix18/

Dataset available (Only simple styled maps are available, while target styled maps are not available because of the copyright from Google):

Level 15: Training, Test.

Level 18: Training, Test.

Full Paper about “Solar Energy Estimation using Street-view Images” accepted at the Journal of Cleaner Production

April 28, 2019yuhao

Our paper entitled “Towards feasibility of photovoltaic road for urban traffic-solar energy estimation using street view image” has been accepted for publishing in the Journal of Cleaner Production.

Authorship: Ziyu Liu, Anqi Yang, Mengyao Gao, Hong Jiang, Yuhao Kang, Fan Zhang, Teng Fei.

This paper proposes a methodology framework to calculate the solar energy that can be collected by solar panels paved on the road. Estimation of how much energy can be collected help making decision of where these photovoltaic road system should be built. Exemplified by the city of Boston, using street view images and taking light obstacles, traffic conditions, weather conditions and seasonal changes of solar radiation into consideration, the potential of solar energy generated by Boston’s road network is estimated precisely. Our results show that the energy obtained from urban road network can support all private cars in Boston.

Here is the abstract:
A sustainable city relies on renewable energy, which promotes the development of electric vehicles. To support electric vehicles, the concept of charging vehicles while driving has been put forward. Under such circumstances, constructing solar panels on urban roads is an innovative option with great benefits, and the accurate calculation of road photovoltaic power generation is a prerequisite. In this paper, we propose a novel framework for predicting and calculating the solar radiation and electric energy that can be collected from the roads. Google Street View images are collected to measure the sky obstruction of roads which is integrated with the solar radiation model to estimate the irradiation receiving capability. In addition to sky obstruction, we also take the impact of traffic conditions and weather situations into consideration in the calculation. Radiation maps at different times in a year are produced from our work to analyze the roads photovoltaic distribution. In order to test the feasibility of our framework, we take Boston as a case study. Results show that roads in Boston can generate abundant electricity for all future electric vehicles in the city. What’s more, main roads through Boston exhibit better power generation potential, and the effect of the traffic condition is limited. Our calculation framework confirms that utilizing solar panels as road surfaces is a great supplement of city power with the unique ability to charge moving cars.

Funded Project: Geo-mapping antimicrobial resistance in E. coli from humans & animals in Wisconsin

April 25, 2019sgao

Recently, Dr. Laurel Legenza (PI) from the UW School of Pharmacy, Dr. Thomas R. Fritsche (Co-PI) from the Marshfield Medical Center and Professor Song Gao participating as a geospatial analysis scientist along with the State Cartographer’s Office (SCO) and other multidisciplinary collaborators, have been awarded a pilot grant from the UW Institute for Clinical and Translational Research (ICTR) and the Marshfield Clinic Research Institute for a research proposal titled “Geo-mapping antimicrobial resistance in E. coli from humans & animals” in Wisconsin.

The AMR Tracker tool, shown in the screenshot above, provides a map showing an array of antibiotics that might be prescribed to treat an infection (in this case, E.coli), and which one can be expected to work best in a specific geographic location. This could help doctors choose the right drug for their patients.

When a patient arrives at a hospital with an infection, his/her doctor must decide which antibiotic might have the best chance of curing him/her — no easy feat when disease-causing pathogens are increasingly resistant to multiple antibiotics. To make this data more accessible, a team of researchers at the University of Wisconsin–Madison School of Pharmacy and the State Cartographer’s Office have developed a prototype system that maps out trends in antibiotic resistance across the State of Wisconsin, which provides guidance at a glance of the likelihood a pathogen will respond to a particular drug.

More details: [Link]

Research on Multi-Scale Spatio-temporal Analysis of Human Emotions

January 4, 2019yuhao

In our research, state-of-the-art computer vision and AI technologies are utilized to collect, store, handle, manipulate and analyze the human emotions and sentiment at different geographic scales. The research explored what and how people express their emotions at different places, and why and how their emotions would be influenced by environmental factors. Several maps are utilized to visualize where people may be happier than at other locations. In traditional research, we may only use questionnaires to investigate the human emotions and socioeconomic factors. But nowadays, it is possible to collect human emotions using large-scale user generated data online, including tweets, emoji, photos, articles, etc.. As we know, human emotions are innate characteristics of human beings, and with computer technology, it is possible to use objective methods to quantify the subjective human emotion. And it is quite important to build a computational workflow to handle large volumes of user generated data and extract emotion from those data efficiently. Here are several examples which we are working on.

(1) Individual place scale: human emotions at different tourist attractions

In this study, a novel framework for extracting human emotions from large-scale georeferenced photos at different places is proposed. After the construction of places based on spatial clustering of user generated footprints collected in social media websites, online cognitive services are utilized to extract human emotions from facial expressions using state-of-the-art computer vision techniques. And two happiness metrics are defined for measuring the human emotions at different places. To validate the feasibility of the framework, we take 80 tourist attractions around the world as an example and a happiness ranking list of places is generated based on human emotions calculated over 2 million faces detected out from over 6 million photos. Different kinds of geographical contexts are taken into consideration to find out the relationship between human emotions and environmental factors. Results show that much of the emotional variation at different places can be explained by a few factors such as openness. The research may offer insights on integrating human emotions to enrich the understanding of sense of place in geography and in place-based GIS.

The spatial distribution of 80 tourist sites and their associated emotion indices using facial expression.

(2) Urban scale: relationship between human emotion and stock market fluctuation at Manhattan

In this research, we examined whether emotion expressed by users in social media can be influenced by stock market index or can predict the fluctuation of the stock market index. We collected the emotion data in Manhattan, New York City using face detection technology and emotion cognition services for photos uploaded to Flickr. Each face’s emotion was described in 8 dimensions the location was also recorded. An emotion score index was defined based on the combination of all 8 dimensions of emotion calculated by principal component analysis. The correlation coefficients between the stock market values and emotion scores are significant (R>0.59 with p < 0.01). Using Granger Causality analysis for cause and effect detection, we found that users’ emotion is influenced by stock market value change. A multiple linear regression model was established (R-square=0.76) to explore the potential factors that influence the emotion score. Finally, a sensitivity map was created to show sensitive areas where human emotion is easily affected by the stock market changes. We concluded that in Manhattan region: (1) there is a statistically significant relationship between human emotion and stock market fluctuation; (2) emotion change follows the movements of the stock market; (3) the Times Square and Broadway Theatre are the most sensitive regions in terms of public emotional reaction to the economy represented by stock value.

(3) Global scale: global human emotions in different groups of people

In this research, we used a huge global scale image dataset: YFCC100, to extract emotions from photos and to describe the worldwide geographic patterns of human happiness. Two indices of Average Smiling Index (ASI) and Happiness Index (HI) are defined from different perspectives to describe the degree of human happiness in a specific region. We computed the spatio-temporal characteristics of facial expression-based happiness on a global scale and linked them to some demographic variables (ethnicity, gender, age, and nationality). After that, the robust analysis was made to ensure our results are reliable. Results are in accordance with some previous studies in Social Science. For example, White and Black are often better at expressing happiness than Asian, women are more expressive than men, and happiness expressed varies across space and time. Our research provides a novel methodology for emotion measurement and it could be utilized for assessing a region‘s emotion conditions based on geo-crowdsourcing data. Robust analysis results indicate that our approaches are reliable and could be implemented for other research projects on place-based human sentiment analysis.

For more information about this research, you can also visit: http://urbanplayground.cn/Emotion/

References:

Yuhao Kang, Qingyuan Jia, Song Gao, Xiaohuan Zeng, Yueyao Wang, Stephan Angsuesser , Yu Liu, Xinyue Ye, Teng Fei. (2019) Extracting Human Emotions at Different Places Based on Facial Expressions and Spatial Clustering Analysis. Transactions in GIS (in press)

Kang, Y., Wang, J., Wang, Y., Angsuesser, S. and Fei, T. (2017) Mapping the Sensitivity of the Public Emotion to the Movement of Stock Market Value: A Case Study of Manhattan. International Archives of the Photogrammetry, Remote Sensing & Spatial Information Sciences, 42.

Kang, Y., Zeng, X., Zhang, Z., Wang, Y. and Fei, T. (2018, March) Who are happier? Spatio-temporal Analysis of Worldwide Human Emotion Based on Geo-Crowdsourcing Faces. In 2018 Ubiquitous Positioning, Indoor Navigation and Location-Based Services (UPINLBS) (pp. 1-8). IEEE.

Research on Human Mobility and Urban Computing with Big Data

December 29, 2018sgao

In the Mobile Age, with the widespread use of location-awareness devices, it is possible to collect large-scale location-awareness datasets, such as mobile phone data, GPS-enabled taxi trajectories, and location-based social media data, to sense complex human movements and human-environment interactions in urban space. Here are some examples of research that we have been working on.

(1) Discovering Spatial Interaction Communities from Mobile Phone Data

This research attempts to explore and interpret patterns embedded in the network of phone‐call interaction and the network of phone‐users’ movements, by considering the geographical context of mobile phone cells. We adopt an agglomerative clustering algorithm based on a Newman‐Girvan modularity metric and propose an alternative modularity function incorporating a gravity model to discover the clustering structures of spatial‐interaction communities using a mobile phone dataset from one week in a city in China. The results verify the distance decay effect and spatial continuity that control the process of partitioning phone‐call interaction, which indicates that people tend to communicate within a spatial‐proximity community. Furthermore, we discover that a high correlation exists between phone‐users’ movements in physical space and phone‐call interaction in cyberspace. Our approach presents a combined qualitative‐quantitative framework to identify clusters and interaction patterns, and explains how geographical context influences communities of callers and receivers. The findings of this empirical study are valuable for urban structure studies as well as for the detection of communities in spatial networks.

(2) Spatio-Temporal Analytics for Exploring Human Mobility Patterns and Urban Dynamics

In this research, we present a spatio-temporal analytical framework including spatiotemporal visualization (STV), space-time kernel density estimation (STKDE), and spatio-temporal-autocorrelation-analysis (STAA), to explore human mobility patterns and intra-urban communication dynamics. Experiments were conducted using large-scale detailed records of mobile phone calls in a city. The space-time path, time series graphs, vertical Bezier curves, STKDE, STAA, and related techniques in 3D GIS as well as statistical tests have been suggested for different spatio-temporal analysis tasks. We also investigated several statistical measures that extend the classic spatial association indices for spatio-temporal autocorrelation analysis. The spatial order of weighted matrix was found to have more significant effects than the temporal neighbors on influencing the autocorrelation strength of hourly phone calls.

(3) Spatio-Temporal-Network Visualization for Exploring Human Movements and Interactions in Physical and Virtual Spaces

Spatiotemporal visualization techniques are effective in detecting human activity patterns over space and time, while social network analysis is good for exploring graph structures and interactions among individual nodes and groups. Integrating these two types of methods can facilitate the exploration of complex social networks in space and time, and can help discover hidden spatiotemporal and social connections. In this research, we propose a novel conceptual framework for spatiotemporal and social network visualization in a three-dimensional context. Based on this framework, new spatio-temporal-network (STN) quantitative metrics (including STN-impact-extent, STN-impact-center, STN-distance, STN-efficiency, and STN-centrality) are introduced to measure the underlying dynamic interactions among entities. The proposed framework aims to help better understand spatiotemporal patterns of human dynamics and social interactions over both physical and virtual spaces simultaneously, as well as explore how emerging events trigger spatial-temporal-social interactions and information diffusion from a process perspective. As a proof of concept, we demonstrate the proposed framework with a case study using geotagged tweets and associated visualization in the ArcScene software. We hope that this research can stimulate new insights on integrating multidisciplinary knowledge to explore human dynamics in a broader way.

(4) Uncovering the digital divide and the physical divide using mobile phone data

In this research, we first aim at developing data analytics that can derive insights about how people from different regions communicate and connect via mobile phone calls and physical movements. We uncover the digital divide (geographical segregation of phone communication patterns) and the physical divide (geographical limits of human mobility) in a developing country. The research also demonstrates that the chosen spatial unit and temporal resolution can affect the community detection results of spatial interaction graphs when analyzing human mobility patterns and exploring urban dynamics in the mobile age. We find that the daily detection has generated a more stable partition structure than an hourly one, while monthly changes also exist over time. The presented framework can help identify patterns of spatial interaction in both cyberspace and physical space with phone call detailed records in some regions where census data acquisition is difficult, especially in some developing countries.

(5) Identifying Local Spatiotemporal Autocorrelation Patterns of Large-scale Taxi Pick-ups and Drop-offs

Analyzing spatiotemporal autocorrelation would be helpful to understand the underlying dynamic patterns in space and time simultaneously. In this work, we aim to extend the conventional spatial autocorrelation statistics to a more general framework considering both spatial and temporal dimensions. Specifically, we focus on the spatiotemporal version of Getis-Ord’s G*. The new indicator STG* can quantify the local association of adjacent features in space and time. As a proof of concept, the proposed method is applied in a large-scale GPS-enabled taxi dataset to identify local spatiotemporal autocorrelation patterns of taxi pick-ups and drop-offs in New York City.

(6) Extracting urban functional regions from points of interest and human activities on location‐based social networks

Data about points of interest (POI) have been widely used in studying urban land use types and for sensing human behavior. However, it is difficult to quantify the correct mix or the spatial relations among different POI types indicative of specific urban functions. In this research, we develop a statistical framework to help discover semantically meaningful topics and functional regions based on the co‐occurrence patterns of POI types. The framework applies the latent Dirichlet allocation (LDA) topic modeling technique and incorporates user check‐in activities on location‐based social networks. Using a large corpus of about 100,000 Foursquare venues and user check‐in behavior in the 10 most populated urban areas of the US, we demonstrate the effectiveness of our proposed methodology by identifying distinctive types of latent topics and, further, by extracting urban functional regions using K‐means clustering and Delaunay triangulation spatial constraints clustering. We show that a region can support multiple functions but with different probabilities, while the same type of functional region can span multiple geographically non‐adjacent locations. Since each region can be modeled as a vector consisting of multinomial topic distributions, similar regions with regard to their thematic topic signatures can be identified. Compared with remote sensing images which mainly uncover the physical landscape of urban environments, our popularity‐based POI topic modeling approach can be seen as a complementary social sensing view on urban space based on human activities.

(7) Detecting Origin-Destination Mobility Flows From Geotagged Tweets

Human origin-to-destination (OD) trip information is of major importance in urban transportation modelling and infrastructure planning in order to optimize the use of street networks. The increasing use of social media like Twitter offers unprecedented opportunities to study individual activities and movements, to know where users are at which time, and what they are talking about. In this work we study the reliability of detecting regional OD trips from individual geotagged tweets in comparison with survey data in a quantitative manner, and explore the spatiotemporal flow patterns extracted from social media. We investigate the research question of whether OD trips mined from social media yield comparable results to expensive and labour intensive large-scale studies. To do so, we derive large-scale OD trips from geotagged tweets, aggregate them, and compare the results by correlating them to the American Community Survey data.

(8) Exploring the uncertainty of activity zone detection from Geotagged Tweets with multi-scaled DBSCAN (M-DBSCAN)

While exploring human mobility patterns based on digital footprints captured from social media, the density-based spatial clustering of applications with noise (DBSCAN) method is often used to identify activity zones which an individual regularly visits. However, DBSCAN is sensitive to the two parameters, including the search radius of a cluster (eps), and the minimum number of points (minpts). This research first discusses the uncertainty while detecting an individual’s activity zones through digital footprints. An improved density-based clustering algorithm for mobility analysis known as Multi-Scaled DBSCAN (M-DBSCAN), is then presented to mitigate the detection uncertainty of clusters produced by DBSCAN at different scales of density and cluster size. Next, we demonstrate that M-DBSCAN iteratively calibrates suitable local eps and minpts values instead of using one global parameter setting as DBSCAN for detecting clusters of varying densities, and proves to be very effective for detecting potential activity zones (clusters) with the historic geo-tagged tweets of selected users. Besides, M-DBSCAN can significantly reduce the noise ratio (the proportion of trajectory points not included in any cluster) by identifying all points capturing the activities performed in each zone. Using the historic geo-tagged tweets of a large number of users in Madison, Wisconsin and Washington, D.C., the results of M-DBSCAN and DBSCAN with a minpts value of 4 and varying eps values reveal that: 1) M-DBSCAN can capture dispersed clusters with low density of points, and therefore detecting more activity zones for each user and resulting in a lower noise ratio; 2) A value of 40m or higher should be used for eps in order to reduce the possibility of collapsing distinctive activity zones, and ensure a relatively low noise ratio during the clustering process; and 3) A value between 200m to 300m is recommended for eps while using DBSCAN for detecting activity zones from geotagged tweets.

(9) Reconstruction of human movement trajectories from large-scale low-frequency mobile phone data

Understanding human mobility is significant in many fields, such as urban planning, transportation, and sociology. Due to the wide spatiotemporal coverage and low operational cost, mobile phone data have been recognized as a major resource for human mobility research. However, due to conflicts between the data sparsity problem of mobile phone data and the requirement of fine-scale solutions, trajectory reconstruction is of considerable importance. Although there have been initial studies on this problem, existing methods rarely consider the effect of similarities among individuals and the temporal patterns of missing data. To address this issue, we propose a novel multi-criteria data partitioning trajectory reconstruction (MDP-TR) method for large-scale mobile phone data. In this research, a multi-criteria data partitioning (MDP) technique is used to measure the similarity among individuals in near real-time and investigate the spatiotemporal patterns of missing data. With this technique, the trajectory reconstruction from mobile phone data is then conducted with classic machine learning models. We verified the method using a real mobile phone dataset in a large city. Results indicate that the MDP-TR method outperforms competing methods in both accuracy and robustness. We argue that the MDP-TR method can be effectively utilized for grasping highly dynamic human movement status and improving the spatiotemporal resolution of human mobility research.

(10) Understanding neighborhood isolation through spatial interaction network analysis using location big data

Hidden biases of racial and socioeconomic preferences shape residential neighborhoods throughout the USA. Thereby, these preferences shape neighborhoods composed predominantly of a particular race or income class. However, the assessment of spatial extent and the degree of isolation outside the residential neighborhoods at large scale is challenging, which requires further investigation to understand and identify the magnitude and underlying geospatial processes. With the ubiquitous availability of location-based services, large-scale individual-level location data have been widely collected using numerous mobile phone applications and enable the study of neighborhood isolation at large scale. In this research, we analyze large-scale anonymized smartphone users’ mobility data in Milwaukee, Wisconsin, to understand neighborhood-to-neighborhood spatial interaction patterns of different racial classes. Several isolated neighborhoods are successfully identified through the mobility-based spatial interaction network analysis.

a dark background version of the neighborhood isolation map without a cartogram in Milwaukee, Wisconsin

References:

Gao, S., Liu, Y., Wang, Y., & Ma, X. (2013) Discovering Spatial Interaction Communities from Mobile Phone Data. Transactions in GIS. 17(3):463-481.

Gao, S. (2015). Spatio-temporal analytics for exploring human mobility patterns and urban dynamics in the mobile age. Spatial Cognition & Computation, 15(2), 86-114.

Gao, S., Janowicz, K., & Couclelis, H. (2017). Extracting urban functional regions from points of interest and human activities on location‐based social networks. Transactions in GIS, 21(3), 446-467.

Gao, S., Yan, B., Gong, L., Regalia, B., Ju, Y., & Hu, Y. (2017). Uncovering the digital divide and the physical divide in Senegal using mobile phone data. In Advances in Geocomputation (pp. 143-151). Springer, Cham.

Gao, S., Yang, J. A., Yan, B., Hu, Y., Janowicz, K., & McKenzie, G. (2014, September). Detecting origin-destination mobility flows from geotagged Tweets in greater Los Angeles area. In Eighth International Conference on Geographic Information Science (GIScience’14).

Gao, S., & Long, Y. (2015). Finding public transportation community structure based on large-scale smart card records in Beijing. In Long, Y., & Shen, Z. (Eds) Geospatial Analysis to Support Urban Planning in Beijing (pp. 155-167). Springer, Cham.

Gao, S., Zhu, R., & Mai, G. (2016, January). Identifying Local Spatiotemporal Autocorrelation Patterns of Taxi Pick-ups and Drop-offs. In International Conference on GIScience Short Paper Proceedings (Vol. 1, No. 1). (pp.109–113), Montreal, Canada, Sep.27-Sep.30, 2016. DOI:10.21433/B31104b2d8xp.

Gao, S., Chen, H., Luo, W., Hu, Y., & Ye, X. (2018) Spatio-Temporal-Network Visualization for Exploring Human Movements and Interactions in Physical and Virtual Spaces. In Shih-Lung Shaw and Daniel Sui (Eds): Human Dynamics Research in Smart and Connected Communities (Chapter 4). pp. 67-80, Springer.

Lee, J. H., Gao, S., & Goulias, K. G. (2015). Can Twitter data be used to validate travel demand models. In 14th International Conference on Travel Behaviour Research.

Liu, X.Y., Huang, Q.Y., & Gao, S. (2019). Exploring the uncertainty of activity zone detection using digital footprints with multi-scaled DBSCAN. International Journal of Geographical Information Science. DOI: 10.1080/13658816.2018.1563301

Li, M.X.,Gao, S., Lu, F., & Zhang, H.C. (2019). Reconstruction of human movement trajectories from large-scale low-frequency mobile phone data. Computers, Environment and Urban Systems. DOI: 10.1016/j.compenvurbsys.2019.101346

Prestby, T., App, J., Kang, Y., & Gao, S. (2019). Understanding Neighborhood Isolation through Spatial Interaction Network Analysis using Location Big Data. Environment and Planning A: Economy and Space. DOI: 10.1177/0308518X19891911

Street Networks and Transportation Analysis

December 29, 2018sgao

A street network is a system of interconnecting polylines and points (called edges and nodes in network science) that represent street segments/roads and intersections for a given area. It plays a very important role in transportation operation and management and many other fields. Below are some examples of research that we have been working on in this field.

(1) Street Centrality and Traffic Flow

The structural and morphological properties of a street network, represented in topological or geographic metric measurements, are considered to be the key factors that shape dynamic urban traffic flow. Urban traffic flow can be seen as individual trips aggregately distributed in street networks. Each trip is generated from an origin and destination (OD) pair or multiple destinations with a network path connecting them. Generally, urban planners and transportation engineers rely on household questionnaires or transportation surveys on job–housing places for traffic-demand forecasting. With the rapid development of information and communication technology (ICT), the availability of large amounts of GPS (Global Positioning System) data and mobile phone data over time and space has increased the capability for monitoring, visualizing, analyzing, and modeling urban dynamics. In this research we investigate the spatial distribution of urban traffic flow based on taxi trajectories in a city in China and compute the correlation between urban traffic fl ow and street betweenness centrality. The results confirm that purely betweenness centrality is not an ideal measure for predicting urban traffic flow. We analyze the characteristics of betweenness and point out the ‘gap’ between betweenness centrality and actual flow. The gap is filled by a framework taking into account the distance decay and the spatial heterogeneity of human activities.

(2) Ride-sharing and Street Networks

Given different types of constraints on human life, people must make decisions that satisfy social activity needs. Minimizing costs (i.e. distance, time, or money) associated with travel plays an important role in perceived and realized social quality of life. Identifying optimal interaction locations (e.g., for ride-sharing) along road networks when there are multiple moving objects (MMO) with space–time constraints remains a challenge. In this research, we formalize the problem of finding dynamic ideal interaction locations for MMO as a spatial optimization model and introduce a context-based geoprocessing heuristic framework to address this problem.

(3) Spatial Interactions along Street Networks

Researchers have proposed many methods to investigate the spatial interactions derived from human movements, such as the gravity model and the radiation model. However, most studies have mainly focused on the interactions among areal units at an aggregated level, neglecting that in most cases, human movements are carried by vehicles and constrained by the underlying road networks. To fill this gap, we propose a novel approach to identify spatial interaction patterns of vehicle movements along urban road networks.

Good characterization of traffic interactions among urban roads can facilitate traffic-related applications, such as traffic control and short-term forecasting. Most studies measure the traffic interaction between two roads by their topological distance or the correlation between their traffic variables. However, the distance-based methods neglect the spatial heterogeneity of roads’ traffic interactions, while the correlation-based methods cannot capture the non-linear dependency between two roads’ traffic variables. In this paper, we propose a novel approach called Road2Vec to quantify the implicit traffic interactions among roads based on large-scale taxi operating route data using a Word2Vec model from the natural language processing (NLP) domain. A case study on short-term traffic forecasting is conducted with artificial neural network (ANN) and support vector machine (SVM) algorithms to validate the advantages of the presented method. The results show that the forecasting achieves a higher accuracy with the support of the Road2Vec method than with the topological distance and traffic correlation based methods.

(4) Public Transportation Optimization

Public transportation systems, in particular, bus systems, play an essential role in the process of urbanization. Typically more bus stops enable more people to access the bus whereas lower the efficiency of bus system. We utilize spatial optimization techniques to address this issue.

In one of our studies, we use the Spatial Interaction Coverage (SIC) model to identify and remove redundant bus stops while maintain the overall success of the whole bus system. The SIC model aims to model the relationship between demand points and bus stops. It takes factors such as the distance and the attractiveness of each bus stop into consideration. By applying the SIC model to the iXpress 202 route in Kitchener-Waterloo region in Canada, we can effectively identify the number of stops to maintain and remove redundant stops. The bus operation efficiency can be increased by 7.28% after optimization. The relationships between bus ridership and the socioeconomic variables (population, income, and age) in the study area are also analyzed. We are expanding our research to multiple cities with the consideration of multi-route buses and multi-mode transportation.

(5) Regional Economy and Transportation Network Analytics

With the booming economy in China, many researches have pointed out that the improvement of regional transportation infrastructure among other factors had an important effect on economic growth. Utilizing a large-scale dataset which includes 3.5 billion entry and exit records of vehicles along highways generated from toll collection systems, we attempt to establish the relevance of mid-distance land transport patterns to regional economic status through transportation network analyses. We apply standard measurements of complex networks to analyze the highway transportation networks. A set of traffic flow features are computed and correlated to the regional economic development indicator. The multi-linear regression models explain about 89% to 96% of the variation of cities’ GDP across three provinces in China. We then fit gravity models using annual traffic volumes of cars, buses, and freight trucks between pairs of cities for each province separately as well as for the whole dataset. We find the temporal changes of distance-decay effects on spatial interactions between cities in transportation networks, which link to the economic development patterns of each province. We conclude that transportation big data reveal the status of regional economic development and contain valuable information of human mobility, production linkages, and logistics for regional management and planning. Our research offers insights into the investigation of regional economic development status using highway transportation big data.

References:

Gao, S., Wang, Y., Gao, Y., & Liu, Y. (2013). Understanding urban traffic-flow characteristics: a rethinking of betweenness centrality. Environment and Planning B: Planning and Design, 40(1), 135-153.

Lee, J. H., Gao, S., & Goulias, K. G. (2015). Can Twitter data be used to validate travel demand models. In 14th International Conference on Travel Behaviour Research.

Liu, K., Gao, S., Qiu, P., Liu, X., Yan, B., & Lu, F. (2017). Road2Vec: Measuring Traffic Interactions in Urban Road System from Massive Travel Routes. ISPRS International Journal of Geo-Information, 6(11), 321.

Wang, S., Gao, S., Feng, X., Murray, A. T., & Zeng, Y. (2018). A context-based geoprocessing framework for optimizing meetup location of multiple moving objects along road networks. International Journal of Geographical Information Science, 32(7), 1368-1390.

Liang, Y., Gao, S., Wu, T., Wang, S., & Wu, Y. (2018, November). Optimizing Bus Stop Spacing Using the Simulated Annealing Algorithm with Spatial Interaction Coverage Model. In Proceedings of the 11th ACM SIGSPATIAL International Workshop on Computational Transportation Science (pp. 53-59). ACM.

Liu, K., Gao, S., Lu, F. (2019) Identifying spatial interaction patterns of vehicle movements on urban road networks by topic modelling. Computers, Environment and Urban Systems, 2019, 74, 50-61.

Yunlei Liang, Song Gao, Mingxiao Li, Yuhao Kang, and Jinmeng Rao (2019) Analyzing the Gap Between Ride-hailing Location and Pick-up Location with Geographical Contexts (Best Poster Award). In Proceedings of 1st ACM SIGSPATIAL International Workshop on Ride-hailing Algorithms, Applications, and Systems (RAAS’19)

Bin Li, Song Gao, Yunlei Liang, Yuhao Kang, Timothy Prestby, Yuqi Gao, and Runmou Xiao. (2020) “Estimation of Regional Economic Development Indicator from Transportation Network Analytics.”Scientific Reports, 10(2647), 1-15. DOI: https://doi.org/10.1038/s41598-020-59505-2

Funded Project: A Data-Synthesis-Driven Approach to Place-Based Social Sensing

December 29, 2018sgao

PI: Prof. Dr. Song Gao

Funded by UW-Madison Office of the Vice Chancellor for Research and Graduate Education (VCRGE) and the Wisconsin Alumni Research Foundation (WARF).

Abstract:

Place names and the semantics of places described in natural languages rather than coordinates (i.e., longitude and latitude) are pervasive in human discourse, documents, and social media while location needs to be specified for mapping or interlinking other information. However, there is still a gap between the informal or vague cognition expressions of place (e.g., downtown, neighborhood, up north Wisconsin) and the formal computational representations of place in computerized information systems. Cognitive regions and places are notoriously difficult to represent in geographic information science and systems. They arise from the complex interactions of individuals, society, and the environment.

The emergence of big data brings new opportunities to better understand our geographic and socioeconomic environments. In this research, we will synthesize multi-source datasets from census, location-based social networks, news media, Wikipedia, travel blogs, and other open data websites, and utilizing the theory of place, natural language processing, machine learning, and visualization techniques for extracting and representing vague cognitive places. The research will also help better understand individuals’ observations, experiences, and exposures to different types of places and ambient social environments.

References:

Gao, S., Janowicz, K., Montello, D. R., Hu, Y., Yang, J. A., McKenzie, G., Ju, Y., Gong, L., Adams, B., & Yan, B. (2017). A data-synthesis-driven method for detecting and extracting vague cognitive regions. International Journal of Geographical Information Science, 31(6), 1245-1271.

Gao, S., Li, L., Li, W., Janowicz, K., & Zhang, Y. (2017). Constructing gazetteers from volunteered big geo-data based on Hadoop. Computers, Environment and Urban Systems, 61, 172-186.

Gao, S., Janowicz, K., McKenzie, G., & Li, L. (2013, November). Towards Platial Joins and Buffers in Place-Based GIS. In Proceedings of The First ACM SIGSPATIAL International Workshop on Computational Models of Place (pp. 42-49).

Papadakis, E., Gao, S., & Baryannis, G. (2019). Combining Design Patterns and Topic Modeling to Discover Regions Supporting Particular Functionality. ISPRS International Journal of Geo-Information. 8(9), 385.

Blaschke, T., Merschdorf, H., Cabrera-Barona, P., Gao, S., Papadakis, E., & Kovacs-Györi, A. (2018). Place versus Space: From Points, Lines and Polygons in GIS to Place-Based Representations Reflecting Language and Culture. ISPRS International Journal of Geo-Information, 7(11), 452.

Yan, B., Janowicz, K., Mai, G., & Gao, S. (2017, November). From ITDL to Place2Vec: Reasoning About Place Type Similarity and Relatedness by Learning Embeddings From Augmented Spatial Contexts. In Proceedings of the 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (p. 35). ACM.

Hu, Y., Gao, S., Janowicz, K., Yu, B., Li, W., & Prasad, S. (2015). Extracting and understanding urban areas of interest using geotagged photos. Computers, Environment and Urban Systems, 54, 240-254.

Liu, Y., Liu, X., Gao, S., Gong, L., Kang, C., Zhi, Y., Chi, G., & Shi, L. (2015). Social sensing: A new approach to understanding our socioeconomic environments. Annals of the Association of American Geographers, 105(3), 512-530.

McKenzie, G., Janowicz, K., Gao, S., & Gong, L. (2015). How where is when? On the regional variability and resolution of geosocial temporal signatures for points of interest. Computers, Environment and Urban Systems, 54, 336-346.

McKenzie, G., Janowicz, K., Gao, S., Yang, J. A., & Hu, Y. (2015). POI pulse: A multi-granular, semantic signature–based information observatory for the interactive visualization of big geosocial data. Cartographica: The International Journal for Geographic Information and Geovisualization, 50(2), 71-85.

Special Issue on “Place-Based Research in GIScience and Geoinformatics”

December 3, 2018sgao

A special issue on “Place-Based Research in GIScience and Geoinformatics” recently published on the ISPRS International Journal of Geo-Information (ISSN 2220-9964). It consists of 12 peer-reviewed articles relevant to the topic, co-edited by Professor Thomas Blaschke and Professor Song Gao.

Place versus Space: From Points, Lines and Polygons in GIS to Place-Based Representations Reflecting Language and Culture

Thomas Blaschke, Helena Merschdorf, Pablo Cabrera-Barona, Song Gao, Emmanuel Papadakis and Anna Kovacs-Györi

ISPRS Int. J. Geo-Inf. 2018, 7(11), 452; doi:10.3390/ijgi7110452

Identifying Urban Neighborhood Names through User-Contributed Online Property Listings

by Grant McKenzie, Zheng Liu, Yingjie Hu, and Myeong Lee

ISPRS Int. J. Geo-Inf. 2018, 7(10), 388; https://doi.org/10.3390/ijgi7100388

Place and City: Toward Urban Intelligence

by Albert Acedo,Marco Painho, Sven Casteleyn, and Stéphane Roche

ISPRS Int. J. Geo-Inf. 2018, 7(9), 346; https://doi.org/10.3390/ijgi7090346

A Graph Database Model for Knowledge Extracted from Place Descriptions

by Hao Chen, Maria Vasardani, Stephan Winter, and Martin Tomko

ISPRS Int. J. Geo-Inf. 2018, 7(6), 221; https://doi.org/10.3390/ijgi7060221

Deep Belief Networks Based Toponym Recognition for Chinese Text

by Shu Wang, Xueying Zhang, Peng Ye, and Mi Du

ISPRS Int. J. Geo-Inf. 2018, 7(6), 217; https://doi.org/10.3390/ijgi7060217

Revisiting the Role of Place in Geographic Information Science

by Helena Merschdorf and Thomas Blaschke

ISPRS Int. J. Geo-Inf. 2018, 7(9), 364; https://doi.org/10.3390/ijgi7090364

More papers: https://www.mdpi.com/journal/ijgi/special_issues/place-based

Two Papers Accepted at ACM SIGSPATIAL 2018

October 5, 2018sgao

Congratulations to Yuhao Kang and Yunlei Liang for their paper acceptance! Check “Publication” for more details.

“Utilizing Reverse Viewshed Analysis in Image Geo-Localization” by Yuhao Kang, Song Gao, and Yunlei Liang, has been accepted at the 2nd ACM SIGSPATIAL International Workshop on Recommendations for Location-based Services and Social Networks (LocalRec 2018).

“Optimizing Bus Stop Spacing Using the Simulated Annealing Algorithm with Spatial Interaction Coverage Model“, by Yunlei Liang, Song Gao, Tianyu Wu, Sujing Wang and Yuhao Wu, has been accepted for presentation at the 11th ACM SIGSPATIAL International Workshop on Computational Transportation Science (IWCTS 2018).