[R] create groups from data with duplicates, such that each group has a duplicate represented once
Kevin Wamae
KW@m@e @end|ng |rom kemr|-we||come@org
Thu Jan 17 01:29:18 CET 2019
Hi, I have a sequencing run with ~3000 samples (attached dataset). The samples were initially tagged and amplified by PCR in duplicate. The tags used range from MID01 to MID26.
MID01-MID13 were used for pair 1 while MID14-MID26 were used for pair 2. The tags are re-used to allow samples to be pooled.
The pooling process will involve mixing samples with MID01-26 into the first group, the next group samples with MID01-26 into the second group and so on.
I'm hoping to get an R script that can create these groups such that for each group, any of the Tags appears only once. An example is shown below.
ID
TagA
TagB
group
180
MID03
MID10
group1
181
MID04
MID06
group1
182
MID05
MID07
group1
183
MID03
MID09
group2
184
MID04
MID10
group2
185
MID05
MID06
group2
186
MID01
MID06
group3
187
MID02
MID07
group3
188
MID03
MID08
group3
______________________________________________________________________
This e-mail contains information which is confidential. It is intended only for the use of the named recipient. If you have received this e-mail in error, please let us know by replying to the sender, and immediately delete it from your system. Please note, that in these circumstances, the use, disclosure, distribution or copying of this information is strictly prohibited. KEMRI-Wellcome Trust Programme cannot accept any responsibility for the accuracy or completeness of this message as it has been transmitted over a public network. Although the Programme has taken reasonable precautions to ensure no viruses are present in emails, it cannot accept responsibility for any loss or damage arising from the use of the email or attachments. Any views expressed in this message are those of the individual sender, except where the sender specifically states them to be the views of KEMRI-Wellcome Trust Programme.
______________________________________________________________________
More information about the R-help
mailing list