[R] Data import R: some explanatory variables not showing up correctly in summary

Charles C. Berry ccberry at ucsd.edu
Thu Jun 1 19:06:11 CEST 2017


On Thu, 1 Jun 2017, Rui Barradas wrote:

> Hello,
>
> In order for us to help we need to know how you've imported your data. What 
> was the file type? What instructions have you used to import it? Did you use 
> base R or a package?
> Give us a minimal but complete code example that can reproduce your 
> situation.
>
> Hope this helps,
>
> Rui Barradas

Absolutely.

It would also help to see what the unique values of each column 
*really* are. To that end run and report the results of this:

lapply(your.data.frame, function(x) unique(as.character(x)))

I'll bet you have both "combination" and "combination " as values or 
something similar where two different strings look to your eye to be the 
same when printed by summary().

HTH,

Chuck

>
> Em 01-06-2017 11:02, Tara Adcock escreveu:
>> Hi,
>> 
>> I have a question regarding data importing into R.
>> 
>> When I import my data into R and review the summary, some of my explanatory 
>> variables are being reported as if instead of being one variable, they are 
>> two with the same name. See below for an example;
>>
>>     Behav person         Behav dog               Position
>>    **combination  : 38   combination  :  4**     Bank    :372
>>    **combination  :  7   combination  :  4**   **Island  :119**
>>      fast         :123   fast         : 15     **Island  : 11**
>>      slow         :445   slow         : 95       Land    :  3
>>      stat         :111   stat         : 14       Water   :230
>> 
>> Also, all of the distances I have imported are showing up in the summary 
>> along with a line entitled "other". However, I haven't used any other 
>> distances?
>>
>>     Distance        Distance.dog
>>     2-10m  :184     <50m   : 35
>>     <50m   :156     2-10m  : 27
>>     10-20m :156     20-30m : 23
>>     20-30m : 91     30-40m : 16
>>     40-50m : 57     10-20m : 13
>>     **(Other): 82   (Other): 18**
>> 
>> I have checked my data sheet over and over again and I think standardised 
>> the data, but the issue keeps arising. I'm assuming I need to clean the 
>> data set but as a nearly complete novice in R I am not certain how to do 
>> this. Any help at all with this would be much appreciated. Thanks so much.
>> 
>> Kind Regards,
>> 
>> Tara Adcock.
>> 
>>
>> 	[[alternative HTML version deleted]]
>> 
>> ______________________________________________
>> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide 
>> http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>> 
>

Charles C. Berry                 Dept of Family Medicine & Public Health
cberry at ucsd edu               UC San Diego / La Jolla, CA 92093-0901
http://biostat.ucsd.edu/ccberry.htm



More information about the R-help mailing list