[R] Unique in discerning missing values NA

Pancho Mulongeni p.mulongeni at namibia.pharmaccess.org
Fri Jul 5 11:28:44 CEST 2013


Hi,
I am trying to remove duplicate Patient numbers in a clinical record, I used unique
menPatients[1:40,1]
 [1] abr1160(C)/001 ABR1363(A)/001 ABR1363(A)/001 ABR1363(A)/001 abr1772(B)/001
 [6] AFR0003/001    AFR0003/001    afr0290(C)/001 afr1861(B)/001 Aga0007/001   
[11] AGA1548(A)/001 AGA1548(A)/001 AGA1548(A)/001 AGU1680(A)/001 AGU1680(A)/001
[16] AIS0492/001    AIS0492/001    AKO4268(C)/001 AKO4268(C)/001 AKT0042(B)/001
[21] AKT0042(B)/001 AKT0042(B)/001 AKT0042(B)/001 AKT0042(B)/001 AKT0042(B)/001
[26] AKT0042(B)/001 alb4423(C)/001 ALF1651(A)/001 alf1722(B)/001 ALF1735(A)/001
[31] ALF1735(A)/001 ALP4321(C)/001 <NA>           <NA>           ALU4262(B)/001
[36] ALV4286(C)/001 ALW2579(C)/001 <NA>           ALW4330(B)/001 AMA0011/001   
3886 Levels: 0750/002 0751/001 0984/002 ABE2560(C)/001 ... zul1737(B)/001

testData<-menPatients[1:40,1]

I then used unique, please note the NA at position 32 in testData
testUnique<-unique(testData)
testUnique
 [1] abr1160(C)/001 ABR1363(A)/001 abr1772(B)/001 AFR0003/001    afr0290(C)/001
 [6] afr1861(B)/001 Aga0007/001    AGA1548(A)/001 AGU1680(A)/001 AIS0492/001   
[11] AKO4268(C)/001 AKT0042(B)/001 alb4423(C)/001 ALF1651(A)/001 alf1722(B)/001
[16] ALF1735(A)/001 ALP4321(C)/001 <NA>           ALU4262(B)/001 ALV4286(C)/001
[21] ALW2579(C)/001 ALW4330(B)/001 AMA0011/001   

The missing value NA originally at position 32 in testdata is still there, it is in position 18. Why is this? How can I prevent this?
I tried using incomprables=c(NA), but this did not work.

Thanks


Pancho Mulongeni
Research Assistant
PharmAccess Foundation
1 Fouché Street
Windhoek West
Windhoek
Namibia
 
Tel:   +264 61 419 000
Fax:  +264 61 419 001/2
Mob: +264 81 4456 286



More information about the R-help mailing list