[BioC] silly looping question
Benilton Carvalho
beniltoncarvalho at gmail.com
Mon Oct 3 14:06:07 CEST 2011
this isn't a BioC question and you'd be better off with the r-help mailing list.
what you want is:
ids = c( "1_880238", "1_883625", "1_1222695", "1_2255524" ,"1_887560")
newdata = example[match(ids, example$V1), ]
another approach is
newdata = subset(example, V1 %in% ids)
best
2011/10/3 Nathalie Conte <nac at sanger.ac.uk>:
> dear List,
> Sorry for the silly question, but I am a R beginner and try to sort
> something out.
> I have a vector named match containing N positions
> match=c( "1_880238", "1_883625", "1_1222695", "1_2255524" ,"1_887560")
> I have a big data frame containing a lot of infos (among those positions in
> the first column V1 in the same format as in the match vector, example.txt
> attached )
> I want to create an index to pull out only informations in the data frame
> whose position are the the vector.
> I have tried this simple loop
>
> for (i in match)
> index=example$V1==i
>
> new=example[index,]
>
> the new dataframe contain only the last true element...is there a way to get
> all TRUE ones?
> thanks
> Nathalie
>
>
>
>
> --
> The Wellcome Trust Sanger Institute is operated by Genome Research Limited,
> a charity registered in England with number 1021457 and a company registered
> in England with number 2742969, whose registered office is 215 Euston Road,
> London, NW1 2BE.
> "V1" "V2" "V3" "V4" "V5" "V6" "V7" "V8" "V9"
> "V10" "V11"
> "1" "1_753405" "1" 753405 "rs61770173" "C" "A" 99
> 0
> "AC=2;DB=3;DP=78;NC=-0.76;CQ=WITHIN_NON_CODING_GENE;AN=2;MQ=29;MZ=6;GN=FAM87B;AD=75;PS=1^1:0.88&2^1:0.30&3^1:0.00;PA=1"
> "GT:GQ:DP:FLT" "1/1:99:78:0"
> "2" "1_877831" "1" 877831 "rs6672356" "T" "C" 39
> 0
> "AC=2;DB=1;DP=4;NC=0.40;CQ=NON_SYNONYMOUS_CODING;AN=2;MQ=60;MZ=0;GN=SAMD11;AD=4;PS=1^1:1.00&2^1:1.00&3^1:1.00;PA=1"
> "GT:GQ:DP:FLT" "1/1:39:4:0"
> "3" "1_880238" "1" 880238 "rs3748592" "A" "G" 42
> 0
> "AC=2;DB=1;DP=5;NC=-3.73;CQ=INTRONIC;AN=2;MQ=55;PA=1^1:0.93&2^1:0.86&3^1:0.95;MZ=0;PS=1;GN=NOC2L;AD=5"
> "GT:GQ:DP:FLT" "1/1:42:5:0"
> "4" "1_881627" "1" 881627 "rs2272757" "G" "A" 99
> 0
> "AC=2;DB=1;DP=30;NC=2.43;CQ=SYNONYMOUS_CODING;AN=2;MQ=55;PA=1^1:0.72&2^1:0.00&3^1:0.64;MZ=0;PS=1;GN=NOC2L;AD=30"
> "GT:GQ:DP:FLT" "1/1:99:30:0"
> "5" "1_883625" "1" 883625 "rs4970378" "A" "G" 48
> 0
> "AC=2;DB=1;DP=7;NC=-4.40;CQ=INTRONIC;AN=2;MQ=57;PA=1^1:1.00&2^1:1.00&3^1:1.00;MZ=0;PS=1;GN=NOC2L;AD=7"
> "GT:GQ:DP:FLT" "1/1:48:7:0"
> "6" "1_887560" "1" 887560 "rs3748595" "A" "C" 99
> 0
> "AC=2;DB=1;DP=67;NC=-2.53;CQ=INTRONIC;AN=2;MQ=58;PA=1^1:0.93&2^1:0.75&3^1:0.95;MZ=0;PS=1;GN=NOC2L;AD=66"
> "GT:GQ:DP:FLT" "1/1:99:67:0"
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives:
> http://news.gmane.org/gmane.science.biology.informatics.conductor
>
--
Successful people ask better questions, and as a result, they get
better answers. (Tony Robbins)
More information about the Bioconductor
mailing list