[R] Alternative to Scale Function?

Mark Difford mark_difford at yahoo.co.uk
Fri Sep 11 23:03:05 CEST 2009


>> The scale function will return the mean and sd of the data.

By default. Read ?scale.

Mark.


Noah Silverman-3 wrote:
> 
> I think I just answered my own question.
> 
> The scale function will return the mean and sd of the data.
> 
> So the process is fairly simple.
> scale training data varaible
> note mean and sd from the scale
> then manually scale the test data using the mean and sd from the 
> training data.
> 
> That should make sure that a value is transformed the same regardless of 
> which data set it is in.
> 
> Do I have this correct, or can anybody contribute any more to the concept?
> 
> Thanks!
> 
> 
> --
> Noah
> 
> On 9/11/09 1:10 PM, Noah Silverman wrote:
>> Hi,
>>
>> Is there an alternative to the scale function where I can specify my 
>> own mean and standard deviation?
>>
>> I've come across an interesting issue where this would help.
>>
>> I'm training and testing on completely different sets of data.  The 
>> testing set is smaller than the training set.
>>
>> Using the standard scale function of R seems to introduce some error.  
>> Since it scales data WITHIN the set, it may scale the same number to 
>> different value since the range in the training and testing set may be 
>> different.
>>
>> My thought was to scale the larger training set of data, then use the 
>> mean and SD of the training data to scale the testing data according 
>> to the same parameters.  That way a number will transform to the same 
>> result regardless of whether it is in the training or testing set.
>>
>> I can't be the first one to have looked at this.  Does anyone know of 
>> a function in R or if there is a scale alternative where I can control 
>> the parameters?
>>
>> Thanks!
>>
>> -- 
>> Noah
>>
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide 
>> http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
> 
> 

-- 
View this message in context: http://www.nabble.com/Alternative-to-Scale-Function--tp25407625p25408289.html
Sent from the R help mailing list archive at Nabble.com.




More information about the R-help mailing list