[R] simple subset question

William Dunlap wdunlap at tibco.com
Sun Dec 2 20:00:33 CET 2012


> I am
> still getting an error message
> >with :
> >  x <- subset(fish,Year==2012 & Total==max(Total));x
> >I get:
> >[1] IDWeek Total  Fry    Smolt  FryEq  Year
> ><0 rows> (or 0-length row.names)

The above is not an error message.  It says that there
are no rows satisfying your criteria.  Note that Total==max(Total)
returns a TRUE for each row in which the Total value
equals the maximum Total value over all the years in
the data.  Are you looking for the maximum value of Total
in each year?

> tmp <- transform(fish, YearlyMaxTotal = ave(Total, Year, FUN=max))
> subset(tmp, Total==YearlyMaxTotal)
   IDWeek  Total    Fry  Smolt  FryEq Year YearlyMaxTotal
21     47 303259  34008 269248 491733 2012         303259
39     39 157260 156909    351 157506 2011         157260
> subset(tmp, Total==YearlyMaxTotal & Year==2012)
   IDWeek  Total   Fry  Smolt  FryEq Year YearlyMaxTotal
21     47 303259 34008 269248 491733 2012         303259

Bill Dunlap
Spotfire, TIBCO Software
wdunlap tibco.com


> -----Original Message-----
> From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-project.org] On Behalf
> Of Felipe Carrillo
> Sent: Sunday, December 02, 2012 10:47 AM
> To: arun
> Cc: R help
> Subject: Re: [R] simple subset question
> 
> Works with the small dataset (2 years) but I get the error message with the whole
> dataset (12 years of data). I am going to have
> to check what's wrong with it...Thanks
> 
> Felipe D. Carrillo
> Supervisory Fishery Biologist
> Department of the Interior
> US Fish & Wildlife Service
> California, USA
> http://www.fws.gov/redbluff/rbdd_jsmp.aspx
> 
> 
> From: arun <smartpink111 at yahoo.com>
> >To: Felipe Carrillo <mazatlanmexico at yahoo.com>
> >Cc: R help <r-help at r-project.org>; R. Michael Weylandt
> <michael.weylandt at gmail.com>
> >Sent: Sunday, December 2, 2012 10:29 AM
> >Subject: Re: [R] simple subset question
> >
> >Hi,
> >I am getting this:
> >x<-subset(fish,Year==2012 & Total==max(Total))
> > x
> >#   IDWeek  Total   Fry  Smolt  FryEq Year
> >#21     47 303259 34008 269248 491733 2012
> >A.K.
> >
> >
> >
> >
> >----- Original Message -----
> >From: Felipe Carrillo <mazatlanmexico at yahoo.com>
> >To: R. Michael Weylandt <michael.weylandt at gmail.com>
> >Cc: "r-help at r-project.org" <r-help at r-project.org>
> >Sent: Sunday, December 2, 2012 1:25 PM
> >Subject: Re: [R] simple subset question
> >
> >Sorry, I was trying it to subset from a bigger dataset called 'winter' and forgot to
> change the variable names
> >when I asked the question. David W suggestion works but the strange part is that I am
> still getting an error message
> >with :
> >  x <- subset(fish,Year==2012 & Total==max(Total));x
> >I get:
> >[1] IDWeek Total  Fry    Smolt  FryEq  Year
> ><0 rows> (or 0-length row.names)
> >
> >I will start a fresh session to see if that helps...Thank you all
> >
> >Felipe D. Carrillo
> >Supervisory Fishery Biologist
> >Department of the Interior
> >US Fish & Wildlife Service
> >California, USA
> >http://www.fws.gov/redbluff/rbdd_jsmp.aspx
> >
> >
> >From: R. Michael Weylandt <michael.weylandt at gmail.com>
> >>To: Felipe Carrillo <mazatlanmexico at yahoo.com>
> >>Cc: "r-help at r-project.org" <r-help at r-project.org>
> >>Sent: Sunday, December 2, 2012 9:42 AM
> >>Subject: Re: [R] simple subset question
> >>
> >>On Sun, Dec 2, 2012 at 5:21 PM, Felipe Carrillo
> >><mazatlanmexico at yahoo.com> wrote:
> >>>  Hi,
> >>> Consider the small dataset below, I want to subset by two variables in
> >>> one line but it wont work...it works though if I subset separately. I have
> >>> to be missing something obvious that I did not realize before while using subset..
> >>>
> >>> fish <- structure(list(IDWeek = c(27L, 28L, 29L, 30L, 31L, 32L, 33L,
> >>> 34L, 35L, 36L, 37L, 38L, 39L, 40L, 41L, 42L, 43L, 44L, 45L, 46L,
> >>> 47L, 48L, 49L, 50L, 51L, 52L, 27L, 28L, 29L, 30L, 31L, 32L, 33L,
> >>> 34L, 35L, 36L, 37L, 38L, 39L, 40L, 41L, 42L, 43L, 44L, 45L, 46L,
> >>> 47L, 48L, 49L, 50L, 51L, 52L), Total = c(0L, 0L, 326L, 1735L,
> >>> 1807L, 2208L, 3883L, 8820L, 6060L, 19326L, 63158L, 100718L, 53015L,
> >>> 91689L, 152629L, 122708L, 61293L, 15574L, 86538L, 75365L, 303259L,
> >>> 19691L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 161L, 321L, 1000L, 4425L,
> >>> 13202L, 19726L, 30518L, 84949L, 157260L, 145691L, 85801L, 62044L,
> >>> 44439L, 23272L, 22391L, 20159L, 14854L, 35379L, 31142L, 7736L,
> >>> 13221L, 4894L), Fry = c(0L, 0L, 326L, 1735L, 1807L, 2208L, 3883L,
> >>> 8759L, 6060L, 19326L, 63119L, 100524L, 52582L, 88170L, 145564L,
> >>> 111416L, 38233L, 5248L, 17826L, 11038L, 34008L, 215L, 0L, 0L,
> >>> 0L, 0L, 0L, 0L, 0L, 0L, 161L, 321L, 1000L, 4425L, 13055L, 19488L,
> >>> 30518L, 84818L, 156909L, 144786L, 84207L, 57720L, 31049L, 6858L,
> >>> 1616L, 719L, 364L, 49L, 0L, 0L, 0L, 0L), Smolt = c(0L, 0L, 0L,
> >>> 0L, 0L, 0L, 0L, 62L, 0L, 0L, 38L, 195L, 433L, 3518L, 7067L, 11290L,
> >>> 23058L, 10327L, 68712L, 64328L, 269248L, 19479L, 0L, 0L, 0L,
> >>> 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 147L, 238L, 0L, 131L, 351L,
> >>> 905L, 1592L, 4324L, 13391L, 16414L, 20774L, 19444L, 14491L, 35330L,
> >>> 31142L, 7736L, 13221L, 4894L), FryEq = c(0L, 0L, 326L, 1735L,
> >>> 1807L, 2208L, 3883L, 8864L, 6060L, 19326L, 63185L, 100854L, 53318L,
> >>> 94151L, 157576L, 130610L, 77432L, 22805L, 134639L, 120393L, 491733L,
> >>> 33327L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 161L, 321L, 1000L, 4425L,
> >>> 13306L, 19894L, 30518L, 85042L, 157506L, 146328L, 86914L, 65073L,
> >>> 53812L, 34763L, 36931L, 33769L, 24998L, 60110L, 52938L, 13149L,
> >>> 22476L, 8319L), Year = c(2012L, 2012L, 2012L, 2012L, 2012L, 2012L,
> >>> 2012L, 2012L, 2012L, 2012L, 2012L, 2012L, 2012L, 2012L, 2012L,
> >>> 2012L, 2012L, 2012L, 2012L, 2012L, 2012L, 2012L, 2012L, 2012L,
> >>> 2012L, 2012L, 2011L, 2011L, 2011L, 2011L, 2011L, 2011L, 2011L,
> >>> 2011L, 2011L, 2011L, 2011L, 2011L, 2011L, 2011L, 2011L, 2011L,
> >>> 2011L, 2011L, 2011L, 2011L, 2011L, 2011L, 2011L, 2011L, 2011L,
> >>> 2011L)), .Names = c("IDWeek", "Total", "Fry", "Smolt", "FryEq",
> >>> "Year"), row.names = c(NA, 52L), class = "data.frame")
> >>> fish
> >>> #  Subset to get the max Total for 2012
> >>>  x <- subset(winter,Year==2012 & Total==max(Total));b  # How come one line doesn't
> work?
> >>
> >>Works fine for me if I change "winter" to fish here.
> >>
> >>subset(fish,Year==2012 & Total==max(Total))
> >>  IDWeek  Total  Fry  Smolt  FryEq Year
> >>21    47 303259 34008 269248 491733 2012
> >>
> >>>
> >>>  # It works if I subset the year first and then get the Total max from it
> >>>  xx <- subset(winter,Year==2012)
> >>> xxx <- subset(xx,Total==max(Total));xxx
> >>> xxx
> >>>
> >>> Felipe D. Carrillo
> >>> Supervisory Fishery Biologist
> >>> Department of the Interior
> >>> US Fish & Wildlife Service
> >>> California, USA
> >>> http://www.fws.gov/redbluff/rbdd_jsmp.aspx
> >>>
> >>>        [[alternative HTML version deleted]]
> >>>
> >>>
> >>> ______________________________________________
> >>> R-help at r-project.org mailing list
> >>> https://stat.ethz.ch/mailman/listinfo/r-help
> >>> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> >>> and provide commented, minimal, self-contained, reproducible code.
> >>>
> >>
> >>
> >>
> >    [[alternative HTML version deleted]]
> >
> >
> >______________________________________________
> >R-help at r-project.org mailing list
> >https://stat.ethz.ch/mailman/listinfo/r-help
> >PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> >and provide commented, minimal, self-contained, reproducible code.
> >
> >
> >
> >
> 	[[alternative HTML version deleted]]




More information about the R-help mailing list