[R] Newbie wants to compare 2 huge RDSs row by row.
Marsh Hardy ARA/RISK
mhardy at ara.com
Sat Jan 27 22:18:14 CET 2018
Hi Guys, I apologize for my rank & utter newness at R.
I used summary() and found about 95 variables, both character and numeric, all with "Length:368842" I assume is the # of records.
I'd like to know the record number (row #?) of any record where the data doesn't match in the 2 files of what should be the same output.
Thanks in advance, M.
From: Ulrik Stervbo [ulrik.stervbo at gmail.com]
Sent: Saturday, January 27, 2018 10:00 AM
To: Eric Berger
Cc: Marsh Hardy ARA/RISK; r-help at r-project.org
Subject: Re: [R] Newbie wants to compare 2 huge RDSs row by row.
Also, it will be easier to provide helpful information if you'd describe what in your data you want to compare and what you hope to get out of the comparison.
Eric Berger <ericjberger at gmail.com<mailto:ericjberger at gmail.com>> schrieb am Sa., 27. Jan. 2018, 08:18:
An RDS is not a data structure such as a data.frame. It can be anything.
For example if I want to save my objects a, b, c I could do:
> saveRDS( list(a,b,c,), file="tmp.RDS")
Then read them back later with
> myList <- readRDS( "tmp.RDS" )
Do you have additional information about your "RDSs" ?
On Sat, Jan 27, 2018 at 6:54 AM, Marsh Hardy ARA/RISK <mhardy at ara.com<mailto:mhardy at ara.com>>
> Each RDS is 40 MBs. What's a slick code to compare them row by row, IDing
> row numbers with mismatches?
> Thanks in advance.
> R-help at r-project.org<mailto:R-help at r-project.org> mailing list -- To UNSUBSCRIBE and more, see
> PLEASE do read the posting guide http://www.R-project.org/
> and provide commented, minimal, self-contained, reproducible code.
[[alternative HTML version deleted]]
R-help at r-project.org<mailto:R-help at r-project.org> mailing list -- To UNSUBSCRIBE and more, see
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
More information about the R-help