[R] re move row if the column "date_abandoned" has a date in it
frenchcr
frenchcr at btinternet.com
Sun Nov 15 19:56:38 CET 2009
this works perfectly...
new_data5 <- new_data4[nchar(new_data4$date_abandoned) != 8, ]
...and i can now think of a few different ways to manipulate my data with
what ive learned from these tricks, thanks alot David!
David Winsemius wrote:
>
>
> On Nov 15, 2009, at 11:00 AM, frenchcr wrote:
>
>>
>>
>> Yes they are not in date format, theyre just characters.
>>
>> the earliest date is 16010000 i originally had one of 0101 00 00
>> (101 years
>> BC)...this was a software problem.
>>
>>> table(nchar(new_data4$date_abandoned))
>>
>> 2 8
>> 315732 263
>>
>> The 315732 are empty fields i thought.
>
> They are actually 2 characters wide.
>
>> The 263 are dates, i want to remove their rows.
>
> If you want to "remove" the ones that are _not_ 8 characters long, then:
>
> new_data5 <- new_data4[nchar(new_data4$date_abandoned) != 8, ]
>
> or:
>
> new_data5 <- subset(new_data4, date_abandoned != 8)
>
> --
> David.
>
>>
>>
>>
>> David Winsemius wrote:
>>>
>>>
>>> On Nov 14, 2009, at 8:43 PM, frenchcr wrote:
>>>
>>>>
>>>> sorry David,
>>>>
>>>> im really new to R (my first week) and appreciate your help. Also I
>>>> dont
>>>> always know what info to give people on the forum (although im
>>>> starting to
>>>> catch the drift).
>>>>
>>>> heres what i get...
>>>>
>>>> summary(new_data4$date_abandoned)
>>>> Min. 1st Qu. Median Mean 3rd Qu.
>>>> Max. NA's
>>>> 16010000 19980000 20010000 19930000 20040000 20090000 315732
>>>
>>> So new_data4$data_abandoned is not of type "Date" and is instead a
>>> character vector.
>>>
>>> If you are resisting turning it into a date and want to work with
>>> characters, you can, you just need to deal somehow with the items
>>> that
>>> are not 8 characters wide. What does 315732 represent? How were we
>>> supposed to interpret the starting "date" you gave of "01010000"?
>>>
>>>> nchar("1010000")
>>> [1] 7
>>>
>>> What does table(nchar(new_data4$date_abandoned)) give you?
>>>>
>>>>> ls()
>>>> [1] "data" "new_data" "new_data2" "new_data3" "new_data4"
>>>>> small <- head(new_data4, 20)
>>>>> dump("small", 20)
>>>> Error in dump("small", 20) : cannot write to this connection
>>>
>>>
>>> Well, sorry, I meant to type dump("small", stdout()) ... As per the
>>> Posting Guide.
>>>
>>> --
>>> David.
>>>>
>>>> David Winsemius wrote:
>>>>>
>>>>>
>>>>> On Nov 14, 2009, at 5:24 PM, frenchcr wrote:
>>>>>
>>>>>>
>>>>>>
>>>>>> I tried the following but it does the opposite of what i want:
>>>>>>
>>>>>> new_data5 <- subset(new_data4, date_abandoned > "01010000")
>>>>>>
>>>>>> I want to remove the rows with dates and leave just the rows
>>>>>> without
>>>>>> a date.
>>>>>>
>>>>>> This removes all the rows that dont have a date in the
>>>>>> date_abandoned column
>>>>>>
>>>>>> ...on a positive note, as i did this next...
>>>>>>
>>>>>> dim(new_data5)
>>>>>> [1] 263 80
>>>>>>
>>>>>> ....i now know that i have 263 dates in that column :)
>>>>>>
>>>>>> I want to remove the 263 rows with dates and leave just the rows
>>>>>> without a
>>>>>> date.
>>>>>
>>>>> Con=me on frenchcr. Stop making us guess. Give us enough
>>>>> information
>>>>> to work with. You asked for something which I construed as saying
>>>>> you
>>>>> wanted dates greater than the the first day of the year 101. You
>>>>> did
>>>>> not address this question.
>>>>>
>>>>> What do you get with str(new_data4) and
>>>>> summary(new_data4$date_abandoned) ? In order to know what sort of
>>>>> comparison to use we need to know what the data looks like.
>>>>>
>>>>> Even better if you offered the output from:
>>>>>
>>>>> small <- head(new_data4, 20)
>>>>> dump("small", 20),
>>>>>
>>>>> --
>>>>> David
>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>> David Winsemius wrote:
>>>>>>>
>>>>>>>
>>>>>>> On Nov 14, 2009, at 1:21 PM, frenchcr wrote:
>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> I want to go through a column in data called
>>>>>>>
>>>>>>> Bad name for a data.frame. Fortunes, "dog" and all that.
>>>>>>>
>>>>>>>> date_abandoned....data["date_abandoned"]....and remove all the
>>>>>>>> rows
>>>>>>>> that
>>>>>>>> have numbers greater than 1,010,000.
>>>>>>>
>>>>>>> Are you doing archeology? Given what you say next I wondered what
>>>>>>> range you were really asking for.
>>>>>>>
>>>>>>>>
>>>>>>>> The dates are in the format 20091114 so i'm just going to treat
>>>>>>>> them
>>>>>>>> as
>>>>>>>> numbers for clean up purposes.
>>>>>>>>
>>>>>>>>
>>>>>>>> I know that i use subset but not sure how to proceed from there.
>>>>>>>
>>>>>>> subdata <- subset(data, date_abandoned > "01010000"()
>>>>>>>
>>>>>>>
>>>>>>> The problem with > "1010000" is that your specified minimum point
>>>>>>> had
>>>>>>> an insufficient number of "places" to be in YYYYMMDD format.
>>>>>>>
>>>>>>> --
>>>>>>>
>>>>>>> David Winsemius, MD
>>>>>>> Heritage Laboratories
>>>>>>> West Hartford, CT
>>>>>>>
>>>>>>> ______________________________________________
>>>>>>> R-help at r-project.org mailing list
>>>>>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>>>>>> PLEASE do read the posting guide
>>>>>>> http://www.R-project.org/posting-guide.html
>>>>>>> and provide commented, minimal, self-contained, reproducible
>>>>>>> code.
>>>>>>>
>>>>>>>
>>>>>>
>>>>>> --
>>>>>> View this message in context:
>>>>>> http://old.nabble.com/remove-row-if-the-column-%22date_abandoned%22-has-a-date-in-it-tp26352457p26354446.html
>>>>>> Sent from the R help mailing list archive at Nabble.com.
>>>>>>
>>>>>> ______________________________________________
>>>>>> R-help at r-project.org mailing list
>>>>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>>>>> PLEASE do read the posting guide
>>>>>> http://www.R-project.org/posting-guide.html
>>>>>> and provide commented, minimal, self-contained, reproducible code.
>>>>>
>>>>> David Winsemius, MD
>>>>> Heritage Laboratories
>>>>> West Hartford, CT
>>>>>
>>>>> ______________________________________________
>>>>> R-help at r-project.org mailing list
>>>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>>>> PLEASE do read the posting guide
>>>>> http://www.R-project.org/posting-guide.html
>>>>> and provide commented, minimal, self-contained, reproducible code.
>>>>>
>>>>>
>>>>
>>>> --
>>>> View this message in context:
>>>> http://old.nabble.com/remove-row-if-the-column-%22date_abandoned%22-has-a-date-in-it-tp26352457p26355689.html
>>>> Sent from the R help mailing list archive at Nabble.com.
>>>>
>>>> ______________________________________________
>>>> R-help at r-project.org mailing list
>>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>>> PLEASE do read the posting guide
>>>> http://www.R-project.org/posting-guide.html
>>>> and provide commented, minimal, self-contained, reproducible code.
>>>
>>> David Winsemius, MD
>>> Heritage Laboratories
>>> West Hartford, CT
>>>
>>> ______________________________________________
>>> R-help at r-project.org mailing list
>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>> PLEASE do read the posting guide
>>> http://www.R-project.org/posting-guide.html
>>> and provide commented, minimal, self-contained, reproducible code.
>>>
>>>
>>
>> --
>> View this message in context:
>> http://old.nabble.com/remove-row-if-the-column-%22date_abandoned%22-has-a-date-in-it-tp26352457p26360384.html
>> Sent from the R help mailing list archive at Nabble.com.
>>
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide
>> http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>
> David Winsemius, MD
> Heritage Laboratories
> West Hartford, CT
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
>
--
View this message in context: http://old.nabble.com/remove-row-if-the-column-%22date_abandoned%22-has-a-date-in-it-tp26352457p26362058.html
Sent from the R help mailing list archive at Nabble.com.
More information about the R-help
mailing list