[R-sig-Geo] complete linkage Agglomerative hierarchical clustering, nnclust, spatclus or something else?

Hans Ekbrand hans at sociologi.cjb.net
Tue Apr 20 17:16:57 CEST 2010


I have just read about clustering on wikipedia, and learnt that what I
want is:

Agglomerative hierarchical clustering, with complete linkage

I searched for suitable r-packages for this and found nnclust, and
spatclus. Are those the packages that you could recommend for
clustering events data (the events here is urban fires, created by
arsonists)? Or do you want to suggest other packages?

In this first analysis I want to do the clustering should only by
location, and ignore the point in time.

Later on I will also include time data, so if the clustering package
could handle time too, that would be great, but that is not a
requirement at this time.

The aim at this stage is only to group events with the same, or almost
the same, location. In my data-set the coordiante-data is a bit too
precise in some cases.

I guess a crude way of clustering would be to round the
coordinate-data to a lesser number of significant digits, however, a
sound clustering algorithm would be better :-)

I also need to grasp the scale here:

With this proj4string, and datapoints as below:

  ..@ proj4string:Formal class 'CRS' [package "sp"] with 1 slots
  .. .. ..@ projargs: chr " +proj=utm +zone=33 +ellps=GRS80 +units=m +no_defs"

head(fires at coords, 2)
         East   North
[1,] 315359.9 6393110
[2,] 325862.4 6405239

dist(head(fires at coords, 2))
         1
2 16044.10

Is this in meters?

TIA

-- 
Hans Ekbrand (http://sociologi.cjb.net) <hans at sociologi.cjb.net>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 197 bytes
Desc: Digital signature
URL: <https://stat.ethz.ch/pipermail/r-sig-geo/attachments/20100420/edb90403/attachment.bin>


More information about the R-sig-Geo mailing list