[R] graphs, need urgent help (deadline :( )
Rosa Oliveira
rosita21 at gmail.com
Wed Jun 10 23:18:49 CEST 2015
Dear Don, thank you very much.
I really wasn’t being able to figure the problem.
You were a big (huge) help.
Seeing the graphs, I think I’ll try to put the 3 settings (sample size) in different graphs.
I’ll try to use trellis graphs :) using sample size as the “factor”
Thank you very much ;)
Atenciosamente,
Rosa Oliveira
--
____________________________________________________________________________
Rosa Celeste dos Santos Oliveira,
E-mail: rosita21 at gmail.com
Tlm: +351 939355143
Linkedin: https://pt.linkedin.com/in/rosacsoliveira
____________________________________________________________________________
"Many admire, few know"
Hippocrates
> On 10 Jun 2015, at 20:07, Don McKenzie <dmck at u.washington.edu> wrote:
>
> Here is code that IS tested. I am sending Rosa the (ugly) output in a separate file. Crazy problems with argument order; I never figured out
> exactly what was wrong.
>
>
> # therapy plot
>
>
> plot(therapy.df$Region[therapy.df$sample==50],therapy.df$factor.a[therapy.df$sample==50],xlab="Region",ylab="factor",type="l",col=4,ylim=c(0,1.5))
> lines(therapy.df$Region[therapy.df$sample==50],therapy.df$factor.b[therapy.df$sample==50],col=2)
> lines(therapy.df$Region[therapy.df$sample==50],therapy.df$factor.c[therapy.df$sample==50],col=3)
>
> lines(therapy.df$Region[therapy.df$sample==250],therapy.df$factor.a[therapy.df$sample==250],col=4,lty=2)
> lines(therapy.df$Region[therapy.df$sample==250],therapy.df$factor.b[therapy.df$sample==250],col=2,lty=2)
> lines(therapy.df$Region[therapy.df$sample==250],therapy.df$factor.c[therapy.df$sample==250],col=3,lty=2)
>
> lines(therapy.df$Region[therapy.df$sample==1000],therapy.df$factor.a[therapy.df$sample==1000],col=4,lty=3)
> lines(therapy.df$Region[therapy.df$sample==1000],therapy.df$factor.b[therapy.df$sample==1000],col=2,lty=3)
> lines(therapy.df$Region[therapy.df$sample==1000],therapy.df$factor.c[therapy.df$sample==1000],col=3,lty=3)
>
> legend(7,1.4,c("factor.a","factor.b","factor.c"),col=c(4,2,3),lty=1)
>
>
>
>> On Jun 10, 2015, at 11:03 AM, Rosa Oliveira <rosita21 at gmail.com <mailto:rosita21 at gmail.com>> wrote:
>>
>> Sorry,
>>
>> I taught I attached the cvs file :)
>>
>> <therapy.csv>
>>
>>
>> Don,
>>
>> I tried, but I got an error:
>>
>> > my.data$Region
>> [1] 1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 10
>> > my.data$sample
>> [1] 50 50 50 50 50 50 50 50 50 50 250 250 250 250 250 250 250 250 250 250 1000 1000 1000 1000 1000 1000 1000 1000
>> [29] 1000 1000
>> > my.data$factor.a
>> [1] 0.895 0.811 0.685 0.777 0.600 0.466 0.446 0.392 0.256 0.198 0.136 0.121 0.875 0.777 0.685 0.626 0.550 0.466 0.384 0.330 0.060 0.138 0.065
>> [24] 0.034 0.931 0.124 0.060 0.028 0.017 0.014
>>
>>
>> > plot(my.data$Region[my.data$sample==50],my.data$factor.a[my.data$sample==50],col=4,type=“l”,xlab=“Region”,ylab=“factor")
>> Error: unexpected input in "plot(my.data$Region[my.data$sample==50],my.data$factor.a[my.data$sample==50],col=4,type=�”
>>
>>
>> I’m really naive, right?
>>
>>
>> Best,
>> RO
>>
>>
>> Atenciosamente,
>> Rosa Oliveira
>>
>> --
>> ____________________________________________________________________________
>>
>> <smile.jpg>
>>
>> Rosa Celeste dos Santos Oliveira,
>>
>> E-mail: rosita21 at gmail.com <mailto:rosita21 at gmail.com>
>> Tlm: +351 939355143
>> Linkedin: https://pt.linkedin.com/in/rosacsoliveira
>> ____________________________________________________________________________
>> "Many admire, few know"
>> Hippocrates
>>
>>> On 10 Jun 2015, at 18:10, Don McKenzie <dmck at u.washington.edu> wrote:
>>>
>>> For a legend, try (untested)
>>>
>>> legend(0.15,0.9,c("factora","factorb","factorc"),col=c(4,2,3),lty=1)
>>>
>>> If it overlaps data points move the first two arguments (0.15 and 0.9) around, or change the “ylim” argument in the plot() to ~1.2.
>>>
>>> to avoid clutter, put the line-types information in the figure caption (IMO)
>>>
>>>
>>>> On Jun 10, 2015, at 10:03 AM, Don McKenzie <dmck at u.washington.edu> wrote:
>>>>
>>>>
>>>>> On Jun 10, 2015, at 9:08 AM, Rosa Oliveira <rosita21 at gmail.com> wrote:
>>>>>
>>>>> Dear All,
>>>>>
>>>>>
>>>>> I attach my data.
>>>>>
>>>>> Dear Jim,
>>>>>
>>>>> when I run your code (even the one you send me, not in my data), I get:
>>>>>
>>>>> Don't know how to automatically pick scale for object of type function. Defaulting to continuous
>>>>> Error in data.frame(x = c(0.1, 0.2, 0.1, 0.2, 0.1, 0.2, 0.1, 0.2, 0.1, :
>>>>> arguments imply differing number of rows: 24, 0
>>>>>
>>>>>
>>>>>
>>>>> Dear Don,
>>>>>
>>>>> It’s meant that I will have 12 lines:
>>>>> 3 factors - lines colors
>>>>> with 3 different values of “sample” for each - line types
>>>>>
>>>>>
>>>>> [Three colors, one for each factor,
>>>>> and three line types (lty=1,2,3), one for eachvalue of “sample - preferable dash, thin and thick).
>>>>>
>>>>>
>>>>> in the X - I should have region (because I have 10 regions)
>>>>> for each region I have the outcome of 3 different treatments (factor)
>>>>> for each region and each treatment I have 3 different sample size.
>>>>
>>>> But in your original post you had 4 sample sizes: 10,20,30,40.
>>>>>
>>>>> I need to “see” the the influence of the region in the treatment outcome for each sample size.
>>>>>
>>>>> So, at the end I should have 9 lines
>>>>> 3 red (1 dash, 1 thin, 1 thick) - concerning factor a (dash for sample size 50, thin for sample size 250 and thick for sample size 1000)
>>>>> 3 blue (1 dash, 1 thin, 1 thick) - concerning factor b (dash for sample size 50, thin for sample size 250 and thick for sample size 1000)
>>>>> 3 green (1 dash, 1 thin, 1 thick) - concerning factor c (dash for sample size 50, thin for sample size 250 and thick for sample size 1000)
>>>>>
>>>>>
>>>>>
>>>>> Hope this time is clear.
>>>>>
>>>>>
>>>>> I also though about doing 3 different graphs, each one for 1 different sample size, and in that case I should have 3 graphs each one with 3 lines
>>>>> 1 red to factor a, 1 blue to factor b and 1 green to factor c.
>>>>>
>>>>> Do you all think is better?
>>>>
>>>> A matter of style perhaps but I would use dotplots because you have only two data points for each “line”. The lines will be misleading. You also could use
>>>> panel plots, but given your skill set (unless someone wants to spend a fair bit of time with you), it’s probably best to stay as simple as possible.
>>>>
>>>> But given your original post (cleaned up) # untested: apologies for any typos
>>>>
>>>>> region sample factora factorb factorc
>>>>> 0.1 10 0.895 0.903 0.378
>>>>> 0.2 10 0.811 0.865 0.688
>>>>> 0.1 20 0.735 0.966 0.611
>>>>> 0.2 20 0.777 0.732 0.653
>>>>> 0.1 30 0.600 0.778 0.694
>>>>> 0.2 30 0.466 174.592 0.461
>>>>> 0.1 40 0.446 0.432 0.693
>>>>> 0.2 40 0.392 0.294 0.686
>>>>
>>>> plot(my.data$region[my.data$sample==10],my.data$factora[my.data$sample==10],col=4,type=“l”,ylim=c(0,1),xlab=“region”,ylab=“factor")
>>>> lines(my.data$region[my.data$sample==10],my.data$factorb[my.data$sample==10],col=2)
>>>> lines(my.data$region[my.data$sample==10],my.data$factorc[my.data$sample==10],col=3)
>>>>
>>>> lines(my.data$region[my.data$sample==20],my.data$factora[my.data$sample==20],col=4,lty=2)
>>>> lines(my.data$region[my.data$sample==20],my.data$factorb[my.data$sample==20],col=2,lty=2)
>>>> lines(my.data$region[my.data$sample==20],my.data$factorc[my.data$sample==20],col=3,lty=2)
>>>>
>>>> # Now do two more groups of 3, changing the parameter “lty” to 3 and then 4
>>>>
>>>> # Look at the syntax and note what changes and what stays constant. Do you see how this works?
>>>> # there will be what looks like a vertical line where sample = 30 and factorb = 174.592. Do you see why?
>>>>
>>>> # then you will need a legend
>>>>
>>>>> Nonetheless I can’t do it :(
>>>>>
>>>>> best,
>>>>> RO
>>>>>
>>>>>
>>>>>
>>>>> Atenciosamente,
>>>>> Rosa Oliveira
>>>>>
>>>>> --
>>>>> ____________________________________________________________________________
>>>>>
>>>>> <smile.jpg>
>>>>> Rosa Celeste dos Santos Oliveira,
>>>>>
>>>>> E-mail: rosita21 at gmail.com
>>>>> Tlm: +351 939355143
>>>>> Linkedin: https://pt.linkedin.com/in/rosacsoliveira
>>>>> ____________________________________________________________________________
>>>>> "Many admire, few know"
>>>>> Hippocrates
>>>>>
>>>>>> On 10 Jun 2015, at 14:13, John Kane <jrkrideau at inbox.com> wrote:
>>>>>>
>>>>>> Hi Jim,
>>>>>>
>>>>>> I was looking at that last night and had the same problem of visualizing what Rosa needed.
>>>>>>
>>>>>> Hi Rosa
>>>>>> This is nothing like what you wanted and I really don't understand your data but would something like this work as a substitute or am I completely lost?
>>>>>>
>>>>>>
>>>>>> dat1 <- structure(list(region = c(0.1, 0.2, 0.1, 0.2, 0.1, 0.2, 0.1,
>>>>>> 0.2), sample = c(10L, 10L, 20L, 20L, 30L, 30L, 40L, 40L), factora = c(0.895,
>>>>>> 0.811, 0.735, 0.777, 0.6, 0.466, 0.446, 0.392), factorb = c(0.903,
>>>>>> 0.865, 0.966, 0.732, 0.778, 0.592, 0.432, 0.294), factorc = c(0.37,
>>>>>> 0.688, 0.611, 0.653, 0.694, 0.461, 0.693, 0.686)), .Names = c("region",
>>>>>> "sample", "factora", "factorb", "factorc"), class = "data.frame", row.names = c(NA,
>>>>>> -8L))
>>>>>>
>>>>>>
>>>>>> mdat1 <- melt(dat1, id.var = c("region", "sample"),
>>>>>> variable.name = "factor",
>>>>>> value.name = "value")
>>>>>> str(mdat1)
>>>>>>
>>>>>> ggplot(mdat1, aes(region, value, colour = factor)) +
>>>>>> geom_line() + facet_grid(sample ~ .)
>>>>>>
>>>>>> John Kane
>>>>>> Kingston ON Canada
>>>>>>
>>>>>>
>>>>>>> -----Original Message-----
>>>>>>> From: drjimlemon at gmail.com
>>>>>>> Sent: Wed, 10 Jun 2015 20:51:52 +1000
>>>>>>> To: rosita21 at gmail.com
>>>>>>> Subject: Re: [R] graphs, need urgent help (deadline :( )
>>>>>>>
>>>>>>> Hi Rosa,
>>>>>>> Like Don, I can't work out what you want and I don't even have the
>>>>>>> picture. For example, your specification of color and line type leaves
>>>>>>> only one point for each color and line type, and the line from one
>>>>>>> point to the same point is not going to show up. Here is a possibility
>>>>>>> that may lead (eventually) to a solution.
>>>>>>>
>>>>>>> library(plotrix)
>>>>>>> par(tcl=-0.1)
>>>>>>> gap.plot(x=rep(seq(10,45,by=5),3),
>>>>>>> y=unlist(my.data[,c("factora","factorb","factorc")]),
>>>>>>> main="A plot of factorial mystery",
>>>>>>> gap=c(1.1,174),ylim=c(0,175),ylab="factor score",xlab="Group",
>>>>>>> xticlab=c(" \n0.1\n10"," \n0.2\n10"," \n0.1\n20"," \n0.2\n20",
>>>>>>> " \n0.1\n30"," \n0.2\n30"," \n0.1\n40"," \n0.2\n40"),
>>>>>>> ytics=c(0,0.5,1,174.59),pch=rep(1:3,each=8),col=rep(c(4,2,3),each=8))
>>>>>>> mtext(c("Region","Sample"),side=1,at=6,line=c(0,1))
>>>>>>> lines(seq(10,45,by=5),my.data$factora,col=4)
>>>>>>> lines(seq(10,45,by=5),my.data$factorb[c(1:5,NA,7,8)],col=2)
>>>>>>> lines(seq(10,45,by=5),my.data$factorc,col=3)
>>>>>>>
>>>>>>> Jim
>>>>>>>
>>>>>>>
>>>>>>> On Wed, Jun 10, 2015 at 10:53 AM, Rosa Oliveira <rosita21 at gmail.com>
>>>>>>> wrote:
>>>>>>>> Dear Don and all,
>>>>>>>>
>>>>>>>> I’ve read the tutorial and tried several codes before posting :)
>>>>>>>> I’m really naive.
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> what I was trying to : is something like the graph in the picture I
>>>>>>>> drawee.
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> Is it more clear now?
>>>>>>>>
>>>>>>>> Atenciosamente,
>>>>>>>> Rosa Oliveira
>>>>>>>>
>>>>>>>> --
>>>>>>>> ____________________________________________________________________________
>>>>>>>>
>>>>>>>>
>>>>>>>> Rosa Celeste dos Santos Oliveira,
>>>>>>>>
>>>>>>>> E-mail: rosita21 at gmail.com <mailto:rosita21 at gmail.com>
>>>>>>>> Tlm: +351 939355143
>>>>>>>> Linkedin: https://pt.linkedin.com/in/rosacsoliveira
>>>>>>>> <https://pt.linkedin.com/in/rosacsoliveira>
>>>>>>>> ____________________________________________________________________________
>>>>>>>> "Many admire, few know"
>>>>>>>> Hippocrates
>>>>>>>>
>>>>>>>>> On 09 Jun 2015, at 19:23, Don McKenzie <dmck at u.washington.edu
>>>>>>>>> <mailto:dmck at u.washington.edu>> wrote:
>>>>>>>>>
>>>>>>>>> The answer lies in learning to use the help (and knowing where to
>>>>>>>>> start). Did you look at the tutorial that comes with the R
>>>>>>>>> installation?
>>>>>>>>>
>>>>>>>>> ?plot
>>>>>>>>> ?lines
>>>>>>>>>
>>>>>>>>> ?par
>>>>>>>>>
>>>>>>>>> In the last, look for the descriptions of “col” and “lty”.
>>>>>>>>>
>>>>>>>>> Using plot() and lines(), and subsetting the four unique values of
>>>>>>>>> “sample”, you can create your lines.
>>>>>>>>>
>>>>>>>>> Here is a crude start, assuming your columns are part of a data frame
>>>>>>>>> called “my.data”. Untested...
>>>>>>>>>
>>>>>> plot(my.data$region[my.data$sample==10],my.data$factora[my.data$sample==10],col=4)
>>>>>>>>> # blue line, not dashed
>>>>>>>>> .
>>>>>>>>> .
>>>>>>>>> .
>>>>>> lines(my.data$region[my.data$sample==20],my.data$factorb[my.data$sample==20],col=2,lty=2)
>>>>>>>>> # red dashed line
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>> On Jun 9, 2015, at 10:36 AM, Rosa Oliveira <rosita21 at gmail.com
>>>>>>>>>> <mailto:rosita21 at gmail.com>> wrote:
>>>>>>>>>>
>>>>>>>>>> Hi,
>>>>>>>>>>
>>>>>>>>>> another naive question (i’m pretty sure :( )
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> I’m trying to plot a multiple line graph:
>>>>>>>>>>
>>>>>>>>>> region sample factora factorb
>>>>>>>>>> factorc
>>>>>>>>>> 0.1 10 0.895 0.903 0.378
>>>>>>>>>> 0.2 10 0.811 0.865 0.688
>>>>>>>>>> 0.1 20 0.735 0.966 0.611
>>>>>>>>>> 0.2 20 0.777 0.732 0.653
>>>>>>>>>> 0.1 30 0.600 0.778 0.694
>>>>>>>>>> 0.2 30 0.466 174.592 0.461
>>>>>>>>>> 0.1 40 0.446 0.432 0.693
>>>>>>>>>> 0.2 40 0.392 0.294 0.686
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> The first column should be the independent variable, the second should
>>>>>>>>>> compute a bold line for sample(10) and dash line for sample 20.
>>>>>>>>>
>>>>>>>>> What about the other two values of “sample”?
>>>>>>>>>
>>>>>>>>>> The others variables are outcomes for each of the first scenarios, and
>>>>>>>>>> so it should: the 3rd, 4th and 5th columns should be blue, red and
>>>>>>>>>> green respectively.
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> Resume :)
>>>>>>>>>>
>>>>>>>>>> I should have a graph, in the x-axe should have the region and in the
>>>>>>>>>> y axe, the factor.
>>>>>>>>>> Lines:
>>>>>>>>>> 1 - blue and bold for region 0.1, sample 10 and factor a
>>>>>>>>>> 2 - blue and dash for region 0.2, sample 10 and factor a
>>>>>>>>>> 3 - red and bold for region 0.1, sample 10 and factor b
>>>>>>>>>> 4 - red and dash for region 0.2, sample 10 and factor b
>>>>>>>>>> 5 - green and bold for region 0.1, sample 10 and factor c
>>>>>>>>>> 6 - green and dash for region 0.2, sample 10 and factor c
>>>>>>>>>
>>>>>>>>> Not consistent with what you said above. These are no longer lines, but
>>>>>>>>> points.
>>>>>>>>>>
>>>>>>>>>> nonetheless the independent variable is nominal, I should plot a line
>>>>>>>>>> graph.
>>>>>>>>>>
>>>>>>>>>> Can anyone help me please?
>>>>>>>>>> I have my file as a cvs file, so I first read that file (that I know
>>>>>>>>>> how to do :)).
>>>>>>>>>>
>>>>>>>>>> But I have it in that format.
>>>>>>>>>>
>>>>>>>>>> Best,
>>>>>>>>>> RO
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> Atenciosamente,
>>>>>>>>>> Rosa Oliveira
>>>>>>>>>>
>>>>>>>>>> --
>>>>>>>>>> ____________________________________________________________________________
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> Rosa Celeste dos Santos Oliveira,
>>>>>>>>>>
>>>>>>>>>> E-mail: rosita21 at gmail.com <mailto:rosita21 at gmail.com>
>>>>>>>>>> Tlm: +351 939355143
>>>>>>>>>> Linkedin: https://pt.linkedin.com/in/rosacsoliveira
>>>>>>>>>> <https://pt.linkedin.com/in/rosacsoliveira>
>>>>>>>>>> ____________________________________________________________________________
>>>>>>>>>> "Many admire, few know"
>>>>>>>>>> Hippocrates
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> [[alternative HTML version deleted]]
>>>>>>>>>>
>>>>>>>>>> ______________________________________________
>>>>>>>>>> R-help at r-project.org <mailto:R-help at r-project.org> mailing list -- To
>>>>>>>>>> UNSUBSCRIBE and more, see
>>>>>>>>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>>>>>>>>> <https://stat.ethz.ch/mailman/listinfo/r-help>
>>>>>>>>>> PLEASE do read the posting guide
>>>>>>>>>> http://www.R-project.org/posting-guide.html
>>>>>>>>>> <http://www.r-project.org/posting-guide.html>
>>>>>>>>>> and provide commented, minimal, self-contained, reproducible code.
>>>>>>>>>
>>>>>>>>> <PastedGraphic-1.tiff>
>>>>>>>>>
>>>>>>>>
>>>>>>>> ______________________________________________
>>>>>>>> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
>>>>>>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>>>>>>> PLEASE do read the posting guide
>>>>>>>> http://www.R-project.org/posting-guide.html
>>>>>>>> and provide commented, minimal, self-contained, reproducible code.
>>>>>>>
>>>>>>> ______________________________________________
>>>>>>> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
>>>>>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>>>>>> PLEASE do read the posting guide
>>>>>>> http://www.R-project.org/posting-guide.html
>>>>>>> and provide commented, minimal, self-contained, reproducible code.
>>>>>>
>>>>>> ____________________________________________________________
>>>>>> FREE 3D MARINE AQUARIUM SCREENSAVER - Watch dolphins, sharks & orcas on your desktop!
>>>>>> Check it out at http://www.inbox.com/marineaquarium
>>>>>>
>>>>>>
>>>>>
>>>>
>>>> <PastedGraphic-1.tiff>
>>>>
>>>
>>> <PastedGraphic-1.tiff>
>>>
>>
>
> <PastedGraphic-1.tiff>
>
More information about the R-help
mailing list