[R] Help finding the mode and maximum for a specified 'window' of time series data
jim holtman
jholtman at gmail.com
Sun Jun 15 20:05:12 CEST 2008
This will give you an idea of how you might want to approach the problem:
> sample <-cbind((1:20)*.25,c(1,1,1,2,2,3,3,3,4,5,6,4,4,3,3,3,2,1,1,1))
> sample
[,1] [,2]
[1,] 0.25 1
[2,] 0.50 1
[3,] 0.75 1
[4,] 1.00 2
[5,] 1.25 2
[6,] 1.50 3
[7,] 1.75 3
[8,] 2.00 3
[9,] 2.25 4
[10,] 2.50 5
[11,] 2.75 6
[12,] 3.00 4
[13,] 3.25 4
[14,] 3.50 3
[15,] 3.75 3
[16,] 4.00 3
[17,] 4.25 2
[18,] 4.50 1
[19,] 4.75 1
[20,] 5.00 1
> # range of timing
> s.r <- range(sample[,1])
> # create groupings for 1 minute intervals
> s.cut <- seq(from=floor(s.r[1]), to=ceiling(s.r[2]), by=1)
> s.cut
[1] 0 1 2 3 4 5
> # split the data
> s.split <- split(sample[,2], cut(sample[,1], s.cut))
> s.split
$`(0,1]`
[1] 1 1 1 2
$`(1,2]`
[1] 2 3 3 3
$`(2,3]`
[1] 4 5 6 4
$`(3,4]`
[1] 4 3 3 3
$`(4,5]`
[1] 2 1 1 1
> # determine maximum in interval
> sapply(s.split, max, na.rm=TRUE)
(0,1] (1,2] (2,3] (3,4] (4,5]
2 3 6 4 2
> # mode (maximum # of occurances)
> sapply(s.split, function(x) {
+ .tab <- table(x)
+ as.numeric(names(.tab)[which.max(.tab)])
+ })
(0,1] (1,2] (2,3] (3,4] (4,5]
1 3 4 3 1
>
>
On Sun, Jun 15, 2008 at 12:10 PM, Staples, Angela Dawn
<adstaple at indiana.edu> wrote:
> I am relatively new to R, so apologize up front for my long question,
> particularly if there is too much or too little information.
>
> I have a large time series data set where each subject's behavior was
> originally coded on .25s intervals for 3min task. I am trying to determine
> if the findings are different depending on the coding interval (i.e. Compare
> .25s to 1s to 5s to 10s).
>
> I also need to see if it makes a difference if I use the maximal value
> within the interval or the modal value within the interval. In other words,
> I do not need the average value.
>
> Here a sample of my data with time as the first column:
>
> sample <-cbind((1:20)*.25,c(1,1,1,2,2,3,3,3,4,5,6,4,4,3,3,3,2,1,1,1))
>
> [,1] [,2]
> [1,] 0.25 1
> [2,] 0.50 1
> [3,] 0.75 1
> [4,] 1.00 2
> [5,] 1.25 2
> [6,] 1.50 3
> [7,] 1.75 3
> [8,] 2.00 3
> [9,] 2.25 4
> [10,] 2.50 5
> [11,] 2.75 6
> [12,] 3.00 4
> [13,] 3.25 4
> [14,] 3.50 3
> [15,] 3.75 3
> [16,] 4.00 3
> [17,] 4.25 2
> [18,] 4.50 1
> [19,] 4.75 1
> [20,] 5.00 1
>
> I need help returning the maximum and minimum of a specified "window" such
> that the function/loop/etc would return:
>
> maximum (window=4, or 1s) would be (2, 3, 6, 4, 2)
> mode (window=4, or 1s) would be (1, 3, 4, 3, 2)
>
> Given the coding conventions in this research area, I need the values to be
> from adjacent, as opposed to overlapping, windows. There are likely to be
> situations where there is no clear mode (1,1,2,2). In those cases it would
> be fine to have 1.5 or 2 returned, but not NA. The data file is in the long
> format with each subject having 720 rows of data. I've tried playing with
> the row indices, but I cannot figure out how to 'move' the window.
>
> I would appreciate any help/suggestions. Since I'm new to this, the
> suggestion doesn't have to be pretty, I just need it to work.
>
> Sincerely,
>
> Angela
>
> ~~~~~~~~~~~~~~~~~~~~~~
> Angela Staples
> Doctoral Candidate
> Psychological and Brain Sciences
> Indiana University
> 1101 E. 10th St.
> Bloomington, IN 47405
> http://www.indiana.edu/~batessdl/
>
> The plural of anecdote is not data.
> ~ Roger Brinner
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
--
Jim Holtman
Cincinnati, OH
+1 513 646 9390
What is the problem you are trying to solve?
More information about the R-help
mailing list