[R] my R code worked well when running the first 1000 lines of R code

Rui Barradas ru|pb@rr@d@@ @end|ng |rom @@po@pt
Wed Jun 12 20:29:19 CEST 2024


Hello,

Inline.

Às 19:03 de 12/06/2024, Yuan Chun Ding via R-help escreveu:
> I am sorry that I know I should provide a dataset that allows to replicate my problem.
> 
> It is a research dataset and quite large, so I can not share.
> 
> Both Bert and Tim guessed my problem correctly.  I also thought about the conflicting issue between different packages and function masking.
> I just hope to that someone has similar experience, so providing me suggestion.
> 
> For conflicting issue,
> 
> What I tried  was to add dplyr::pivot_longer or tidyr:: pivot_longer, 


Do that to all functions comming from contributed packages. At least to 
those.


summary_anno1148ft <- anno1148ft %>%
   tidyr::pivot_longer(c(t_depth, t_alt_count, t_alt_ratio), names_to = 
"measure") %>%
   dplyr::group_by(dat, measure) %>%
   dplyr::summarize(minimum = min(value,na.rm=T),
                    q25 = quantile(value, probs = 0.25,na.rm=T),
                    med = median(value,na.rm=T),
                    q75 = quantile(value, probs = 0.75,na.rm=T),
                    maximum = max(value,na.rm=T),
                    average = mean(value,na.rm=T),
                    #standard_deviation = sd(value),
                    .groups = "drop"
   )


Or, simpler, no need to group_by anymore. It can be done in summarise.


summary_anno1148ft <- anno1148ft %>%
   tidyr::pivot_longer(c(t_depth, t_alt_count, t_alt_ratio), names_to = 
"measure") %>%
   dplyr::summarize(minimum = min(value,na.rm=T),
                    q25 = quantile(value, probs = 0.25,na.rm=T),
                    med = median(value,na.rm=T),
                    q75 = quantile(value, probs = 0.75,na.rm=T),
                    maximum = max(value,na.rm=T),
                    average = mean(value,na.rm=T),
                    #standard_deviation = sd(value),
                    .by = c(dat, measure),
                    .groups = "drop"
   )



This is only a guess, the question cannot really be answered.


Hope this helps,

Rui Barradas

but still not resolved the problem.
> 
> 
> 
> I will restart from the first line my code, it will work again and then I will track down.
> 
> 
> 
> Thank you,
> 
> Ding
> 
> 
> From: CALUM POLWART <polc1410 using gmail.com>
> Sent: Wednesday, June 12, 2024 10:52 AM
> To: Yuan Chun Ding <ycding using coh.org>
> Cc: r-help using r-project.org
> Subject: Re: [R] my R code worked well when running the first 1000 lines of R code
> 
> I sometimes think people on this list are quite rude to posters. I'm afraid I'm likely to join in with some rudeness? 1. "Here is some code that works but also doesn't" is probably not going to get you an answer 2. I provide
> 
> 
> I sometimes think people on this list are quite rude to posters.
> 
> I'm afraid I'm likely to join in with some rudeness?
> 
> 1. "Here is some code that works but also doesn't" is probably not going to get you an answer
> 2. I provide no information about the data it works on or doesn't
> 3. I tell you I'm using a load of dependencies, but don't tell you what
> 4. I refer to 2000 lines of code but probably means 2000 lines of data?
> 
> So. Please post a question someone can actually answer.
> 
> If the question is "why might code fail on a 2000 line dataset when it works on 1000 line dataset" then here are some thoughts:
> 
> * Is the 1000 lines being run as dataset[1:1000,] or is it dataset1 and dataset2 ?
> * Is there a structural difference in the datasets - i.e. numbers, characters or factors as columns. Often import functions guess a column type by reading the first 500/1000 lines. If the data has numbers in column 1 for 1-1000 but on line 1999 has a letter... The data type may vary.
> 
> On Wed, 12 Jun 2024, 17:28 Yuan Chun Ding via R-help, <r-help using r-project.org<mailto:r-help using r-project.org>> wrote:
> Hi R users,
> 
> The following code worked well to summarize four data groups in a dataframe for three variables (t_depth, t_alt_count, t_alt_ratio), 12 columns of summary, see attached.
> However, after running another 2000 lines of R codes using functions from more than 10 other R  libraries, then it only generated one column of summary.
> Do you know why?
> 
> Thank you,
> 
> Yuan Chun Ding
> 
> summary_anno1148ft <- anno1148ft %>%
>    pivot_longer(c(t_depth, t_alt_count, t_alt_ratio), names_to = "measure") %>%
>    group_by(dat, measure) %>%
>    summarize(minimum = min(value,na.rm=T),
>              q25 = quantile(value, probs = 0.25,na.rm=T),
>              med = median(value,na.rm=T),
>              q75 = quantile(value, probs = 0.75,na.rm=T),
>              maximum = max(value,na.rm=T),
>              average = mean(value,na.rm=T),
>              #standard_deviation = sd(value),
>              .groups = "drop"
>    )
> summary_anno1148ft <-t(summary_anno1148ft)
> 
> 
> 
> ----------------------------------------------------------------------
> ------------------------------------------------------------
> -SECURITY/CONFIDENTIALITY WARNING-
> 
> This message and any attachments are intended solely for the individual or entity to which they are addressed. This communication may contain information that is privileged, confidential, or exempt from disclosure under applicable law (e.g., personal health information, research data, financial information). Because this e-mail has been sent without encryption, individuals other than the intended recipient may be able to view the information, forward it to others or tamper with the information without the knowledge or consent of the sender. If you are not the intended recipient, or the employee or person responsible for delivering the message to the intended recipient, any dissemination, distribution or copying of the communication is strictly prohibited. If you received the communication in error, please notify the sender immediately by replying to this message and deleting the message and any accompanying files from your system. If, due to the security risks, you do not wish to rec
>   eive further communications via e-mail, please reply to this message and inform the sender that you do not wish to receive further e-mail from the sender. (LCP301)
> ------------------------------------------------------------
> ______________________________________________
> R-help using r-project.org<mailto:R-help using r-project.org> mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help<https://urldefense.com/v3/__https:/stat.ethz.ch/mailman/listinfo/r-help__;!!Fou38LsQmgU!p3fE1cCl7_IxAOT0Fvr1vPWF3xDeYl1FCDaqXi4Z6HH7tOMmDULawS8DAa7XcG5s5PrfqmeMC0XA$>
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html<https://urldefense.com/v3/__http:/www.R-project.org/posting-guide.html__;!!Fou38LsQmgU!p3fE1cCl7_IxAOT0Fvr1vPWF3xDeYl1FCDaqXi4Z6HH7tOMmDULawS8DAa7XcG5s5Prfqhxs3su7$>
> and provide commented, minimal, self-contained, reproducible code.
> 
> 	[[alternative HTML version deleted]]
> 
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



-- 
Este e-mail foi analisado pelo software antivírus AVG para verificar a presença de vírus.
www.avg.com


More information about the R-help mailing list