[R] Quantile regression - violation of independence

Valeriano Parravicini valeriano.parravicini at unige.it
Mon May 17 19:58:51 CEST 2010


I am trying to perform  quantile regression (using quantreg package) and 
I am particularly interested to know whether the technique requires 
independence of observations.

I am an ecologist and, in particular, I collected data of abundance of a 
species in 15 location around an island. In each location abundance data 
of the species (response variable) have been collected in about 20 
replicate sampling units. Yet, I have just one measure of my explanatory 
variables (I have two explanatory variables) for each location. So, for 
each value of the explanatory variables I have 20 values of the response 
variable. Since my data have not normal distribution and do not show 
homogeneity of variance I would  use quantile regression. My main 
concern is about violation of independece because I can not consider my 
20 measure of abundance  as  independent since they have been collacted 
in the same location (i.e. they actually are replicates). However, I 
would not like to loose the information about the within-location 
variation by averaging my response variable at location level. I am not 
skilled in quantile regression; this is my first attempt to employ such 
a technique, but in a paper on econometrics I found that quantile 
regression do not require independence of observations. This is the only 
reference I found about that. Is it true? Can anyone suggests me any 
reference about that?

Moreover, I have two explanatory variables for each location and I am 
conducting quantile regression using backward selection. Particularly, I 
am performing the regression separately for a number of tau and each 
time I perform backward selection. What happens is that the "best model" 
changes accordingly with the tau setted. For instance, using tau = 0.5, 
backward selection indicate me that both explanatory variables are 
significant. Yet, setting tau = 0.75 just one of the two explanatory 
variables is found significant and backward selection drops the other 
one. Is  that a correct way to conduct quantile regression with more 
than one explanatory variable: i.e. performing  backward selection for 
each tau I am interested on ?

Thank you for your help

Valeriano



More information about the R-help mailing list