[R] merging dataframes in a list
Ulrik Stervbo
ulrik.stervbo at gmail.com
Fri Jun 3 21:17:31 CEST 2016
You can use ldply in the plyr package to bind all the data.frames together
(a regular loop will also work). Afterwards you can summarise using ddply
Hope this helps
Ulrik
Ed Siefker <ebs15242 at gmail.com> schrieb am Fr., 3. Juni 2016 21:10:
> aggregate isn't really what I want. Maybe tapply? I still can't get
> it to work.
>
> > length(mylist)
> [1] 4
> > length(names)
> [1] 4
> > tapply(mylist, names, merge)
> Error in tapply(mylist, names, merge) : arguments must have same length
>
> I guess because a list isn't an atomic data type. What function will
> do the same on lists? lapply doesn't have a 'by' argument.
>
> On Fri, Jun 3, 2016 at 1:41 PM, Ed Siefker <ebs15242 at gmail.com> wrote:
> > I manually constructed the list of sample names and tried the
> > aggregate call I mentioned.
> > Merge works when called manually, but not when using aggregate.
> >
> >> mylist <- list(data.frame(name="sample1", red=20),
> data.frame(name="sample1", green=15), data.frame(name="sample2", red=10),
> data.frame(na me="sample2", green=30))
> >> names <- list("sample1", "sample1", "sample2", "sample2")
> >> merge(mylist[1], mylist[2])
> > name red green
> > 1 sample1 20 15
> >> merge(mylist[3], mylist[4])
> > name red green
> > 1 sample2 10 30
> >> aggregate(mylist, by=as.list(names), merge)
> > Error in as.data.frame(y) : argument "y" is missing, with no default
> >
> > What's the right way to do this?
> >
> > On Fri, Jun 3, 2016 at 1:20 PM, Ed Siefker <ebs15242 at gmail.com> wrote:
> >> I have a list of data as follows.
> >>
> >>> list(data.frame(name="sample1", red=20), data.frame(name="sample1",
> green=15), data.frame(name="sample2", red=10), data.frame(name="sample 2",
> green=30))
> >> [[1]]
> >> name red
> >> 1 sample1 20
> >>
> >> [[2]]
> >> name green
> >> 1 sample1 15
> >>
> >> [[3]]
> >> name red
> >> 1 sample2 10
> >>
> >> [[4]]
> >> name green
> >> 1 sample2 30
> >>
> >>
> >> I would like to massage this into a data frame like this:
> >>
> >> name red green
> >> 1 sample1 20 15
> >> 2 sample2 10 30
> >>
> >>
> >> I'm imagining I can use aggregate(mylist, by=samplenames, merge)
> >> right? But how do I get the list of samplenames? How do I subset
> >> each dataframe inside the list?
>
> ______________________________________________
> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
[[alternative HTML version deleted]]
More information about the R-help
mailing list