A Bayesian Approach for Spatial Clustering - IEEE CIS Webinar

728 views
Download
  • Share
#IEEE CIS Webinar 2016 #Bayesian #Spacial clustering

When performing analysis of spatial data, there is often the need to aggregate geographical areas into larger regions, a process called regionalization or spatially constrained clustering. These algorithms assume that the items to be clustered are non-stochastic, an assumption not held in many applications. In this webinar, we discuss these current approaches and also a new probabilistic regionalization algorithm that allows spatially varying random variables as features. Hence, an area highly different from its neighbors can still be considered a member of their cluster if it has a large variance.

Our proposal is based on a Bayesian generative spatial product partition model. We build an effective Markov Chain Monte Carlo algorithm to carry out a random walk on the space of all trees and their induced spatial partitions by edges’ deletion. We evaluate our algorithm using synthetic data and with one problem of municipalities regionalization based on cancer incidence rates. We are able to better accommodate the natural variation of the data and to diminish the effect of outliers, producing better results than state-of-art approaches.

Speaker bio: Renato Assunção received his Ph.D. in Statistics from University of Washington in 1994. He is professor in the Department of Computer Science at the Universidade Federal de Minas Gerais(UFMG), Brazil. He is the director of the Laboratory of Spatial Statistics (LESTE), a research unit affiliated to UFMG. The research of Professor Assuncao is focused on the development of algorithms and probabilistic methods for the statistical analysis of spatial data. He is mainly interested in areal data and point processes data and the main applied motivations come from crime mapping and public health problems.

When performing analysis of spatial data, there is often the need to aggregate geographical areas into larger regions, a process called regionalization or spatially constrained clustering. These algorithms assume that the items to be clustered are non-stochastic, an assumption not held in many applications. In this webinar, we discuss these current approaches and also a new probabilistic regionalization algorithm that allows spatially varying random variables as features. Hence, an area highly different from its neighbors can still be considered a member of their cluster if it has a large variance.

Advertisment

Advertisment