[R-sig-Geo] merging tables by columns AND row names (coordinates)
Mikkel Grum
mi2kelgrum at yahoo.com
Fri Sep 8 21:13:32 CEST 2006
merge(Table1, Table2,
by = intersect(c("XCOORD", "YCOORD"),
c("XCOORD", "YCOORD")), all = TRUE)
It might not handle the amount of data you have, but,
if your tables are normal dataframes, it would do the
job with a smaller dataset. It doesn't work with
Spatial*DataFrames (yet?).
Mikkel
--- Roger Bivand <Roger.Bivand at nhh.no> wrote:
> On Fri, 8 Sep 2006, Michael Sumner wrote:
>
> > Hello, I can think of a couple of simple-minded
> approaches that would
> > take some time - either relying on direct
> string-matching for the unique
> > coordinates, or by some contrived overlay.
> >
> > However, there's probably far better approaches -
> a couple of questions:
> >
> > Can you predefine the set of all unique
> coordinates without reading all
> > the tables from file?
> > - if so you might simplify the identification of
> each individual
> > coordinate, for matching the records
> >
> > Are the coordinates (intended to be) on a regular
> grid? (This seems
> > unlikely, although it is nearly true given your X
> coordinates).
>
> The key question is what the data are. To me they
> look like a global
> regular grid with some slippage in the print() - the
> underlying diff() of
> the unique x's and y's is almost certainly regular.
> I'm not sure why they
> are in text files either (model output?). But some
> bits of the grid may be
> missing, the question being whether this is regular.
> If as an earlier
> response indicated different data sets have
> different grid cells
> missing, then we need the overall grid to start
> with, then grab the row
> and column indices (and/or grid index), and attach
> these to the data rows.
>
> If the solution needs to be robust, and have a
> longer term utility, I
> would go for using MySQL, Terralib, and aRT. The
> data representation is
> that of the Terralib Cell object, so the question
> would be how to upload
> to the database from the text files.
>
> aRT is at:
>
> http://www.est.ufpr.br/aRT/
>
> By the way, 1M by 100 by 8 bytes is pushing 32-bit R
> - but handing off a
> lot of the data storage to a database relieves this
> greatly.
>
> Roger
>
> >
> > Cheers, Mike.
> >
> >
> > isidora k wrote:
> > > Hi everyone!
> > > I have 100 tables of the form:
> > > XCOORD,YCOORD,OBSERVATION
> > > 27.47500,42.52641,177
> > > 27.48788,42.52641,177
> > > 27.50075,42.52641,179
> > > 27.51362,42.52641,178
> > > 27.52650,42.52641,180
> > > 27.53937,42.52641,178
> > > 27.55225,42.52641,181
> > > 27.56512,42.52641,177
> > > 27.57800,42.52641,181
> > > 27.59087,42.52641,181
> > > 27.60375,42.52641,180
> > > 27.61662,42.52641,181
> > > ..., ..., ...
> > > with approximately 1000000 observations for
> each. All
> > > these tables have the same xcoord and ycoord and
> I
> > > would like to get a table of the form
> > > XCOORD,YCOORD,OBSERVATION1,OBSERVATION2,...
> > > 27.47500,42.52641,177,233,...
> > > 27.48788,42.52641,177,345,...
> > > 27.50075,42.52641,179,233,...
> > > 27.51362,42.52641,178,123,...
> > > 27.52650,42.52641,180,178,...
> > > 27.53937,42.52641,178,...,...
> > > 27.55225,42.52641,181,...
> > > 27.56512,42.52641,177,...
> > > 27.57800,42.52641,181,...
> > > 27.59087,42.52641,181,...
> > > 27.60375,42.52641,180,...
> > > 27.61662,42.52641,181,...
> > > In other words I would like to merge all the
> tables
> > > taking into account the common row names of
> their
> > > xcoords AND ycoords.
> > > Not all tables have the same number of
> observations
> > > which means that not all pairs of x and y coords
> > > match.
> > > Is there a way to do this in R?
> > > I would be grateful for any advice.
> > > Many thanks
> > > Isidora
> > >
> > > _______________________________________________
> > > R-sig-Geo mailing list
> > > R-sig-Geo at stat.math.ethz.ch
> > > https://stat.ethz.ch/mailman/listinfo/r-sig-geo
> > >
> > >
> > >
> >
> > _______________________________________________
> > R-sig-Geo mailing list
> > R-sig-Geo at stat.math.ethz.ch
> > https://stat.ethz.ch/mailman/listinfo/r-sig-geo
> >
>
> --
> Roger Bivand
> Economic Geography Section, Department of Economics,
> Norwegian School of
> Economics and Business Administration, Helleveien
> 30, N-5045 Bergen,
> Norway. voice: +47 55 95 93 55; fax +47 55 95 95 43
> e-mail: Roger.Bivand at nhh.no
>
> _______________________________________________
> R-sig-Geo mailing list
> R-sig-Geo at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/r-sig-geo
>
More information about the R-sig-Geo
mailing list