[R-sig-teaching] R Equivalent of SPSS "Split File" and "AutoRecode"

Yihui Xie xieyihui at gmail.com
Fri Nov 14 05:29:32 CET 2008


The code is so long... Why not store the labels in another file and
read them into a vector, then assign labels to columns in a loop?

To split file means you have to do the same procedure on each subset
of the data - this is easy in R: what you need are the splitting
variable and corresponding conditions, then apply your functions to
the subset of data in a loop (or use sapply(), ...)

For autorecode, it is nothing but factor() in R!

Regards,
Yihui
--
Yihui Xie <xieyihui at gmail.com>
Phone: +86-(0)10-82509086 Fax: +86-(0)10-82509086
Mobile: +86-15810805877
Homepage: http://www.yihui.name
School of Statistics, Room 1037, Mingde Main Building,
Renmin University of China, Beijing, 100872, China



On Fri, Nov 14, 2008 at 7:31 AM, Albyn Jones <jones at reed.edu> wrote:
>  I have no clue what "autorecode" means.  Selecting a subset of a file is
> easy.  Suppose your data frame is named X, assuming REGION is numeric:
>
>    attach(X)
>    X17 = X[REGION == 17,]
>
> albyn
>
> On Thu, Nov 13, 2008 at 05:18:24PM -0600, Richardson, Paul wrote:
>>
>> Greetings Fellow Teachers of R,
>>
>>
>>  *
>> I am working on a tutorial for a class to be taught at my college.
>>  *
>> First, I am going to provide a PowerPoint seminar for the faculty, and then I will create a series of Camtasia screen recordings with narration.
>>  *
>> The course is aimed at teaching those familiar with introductory level stats how to perform similar actions in R, that would normally be performed in SPSS.
>>  *
>> Is there an autorecode capability in R which is analogous to autorecode in SPSS?  I'm having trouble with item with item f below (second half) "autorecode".
>>  *
>> I am also having trouble with item "i" below, because I don't know how to split a data file, like you normally do in SPSS.
>>  *
>> Is there some equivalent way to do this in R?
>>
>> thanks much,
>> Paul Richardson
>> cell 806-543-0087
>>
>>
>> a.       Go to AEIS data site:  http://www.tea.state.tx.us/perfreport/aeis/2007/DownloadData.html
>> b.      Select Campus, Reference File, and Layout, and download; then do the same for the Reference data file.  Download Campus Financial Statistics, both layout and download.  Download Student Statistics, both layout and download.
>> c.       Import the data files into R.
>> d.      Using the layout file, create variable labels; create value labels (if possible).  Assign 0 as a missing value for variable BPETALLC.
>> e.       Merge the Reference, Financial, and Student files
>> f.       Recode variable C_RATING into a new, numeric variable named campus.rating in R.  Recode GRDTYPE in a new, numeric variable called grade.type in R.  Use autorecode (if there is something similar in R) to recode CNTYNAME into county.name.  Create a new variable called log.cperra1r, which is the logarithm of CPERRA1R.
>> g.      Do summary statistics (minimum, maximum, range, mean, median, variance, standard deviation, skewness, kurtosis).
>> h.      Run histograms, boxplots, scatterplots, scatterplot matrices
>> i.        Select only cases from Region 17 (using REGION variable) and do the analyses for j-r below; using the equivalent of split file (if possible) to do same analysis for each region separately (using REGION variable).
>> j.        Conduct 1-way ANOVA using campus.rating as IV and CPETECOP as DV
>> k.      Conduct repeated measures ANOVA using BPFEAOPRT and CPFEAOPRT (although these are measuring different things, I am having you use them for purely instructional purposes)
>> l.        Conduct factorial ANOVA using campus.rating and grade.type as IV's and CPETECOP as DV
>> m.    Conduct correlation using CPFPGSPET and CPFPGGIFT.
>> n.      Conduct bivariate regression using CPFPGGIFK and CPFPGSPEK
>> o.      Conduct multiple regression using CPFPGREGT as criterion/outcome variable and CPFPAVOCT, CPFPASPET, and CPFPABILT as predictor variables
>> p.      Conduct the following regression diagnostics
>>                                 i.            Numerical diagnostics
>> 1.      Variance inflation factors
>> 2.      Breusch-Pagan test for heteroscedasticity
>> 3.      Durbin-Watson test for autocorrelation
>> 4.      RESET test for nonlinearity
>> 5.      Bonferonni outlier test
>>                               ii.            Graphs
>> 1.      Basic diagnostic plots
>> 2.      Residual quantile comparison plot
>> 3.      Component+residuals plot
>> 4.      Added variables plot
>> 5.      Influence plot
>> 6.      Effects plot
>> q.      Conduct chi-square using campus.rating and grade.type.
>> r.        Aggregate all the variables in the file by region (using REGION) and save to a new file called cref_cfin_cstud_agg.rdat
>>
>>
>>
>>
>> Below is the actual syntax for the above tasks:
>> ======================// beginning of R syntax //=======================
>> # Import CREF
>> CREF <- sqlQuery(channel = 1, select * from [cref_tabdelimited$])
>> attach(CREF)
>> # SORT
>> CREF<- order(+CREF$campus)
>> CREF
>> # variable labels
>> library(Hmisc)
>> label(CREF$DISTNAME)<- "District Name"
>> label(CREF$CAMPNAME)<- "Campus Name"
>> label(CREF$CACKDTL)<- "Campus 2007 Ratings: GPA Acknowledgment Detail"
>> label(CREF$CAMPUS)<- "CURRENT 9-DIGIT CDC NUMBER"
>> label(CREF$DISTRICT)<- "6 Digit County District Number"
>> label(CREF$COUNTY)<- "3 Digit County Number"
>> label(CREF$REGION)<- "2 Digit ESC Region"
>> label(CREF$CNTYNAME)<- "County Name"
>> label(CREF$GRDTYPE)<- "Campus Type (B,E,M,S) Total Enrollment"
>> label(CREF$GRDSPAN)<- "Grade Span (Total Enrollment)"
>> label(CREF$CFLCHART)<- "CAMPUS 2007 Flag-Chater School (Y/N)"
>> label(CREF$PAIRCAMP)<- "CAMPUS 2007 Flag-Campus/District No. of TAKS Giver"
>> label(CREF$PAIRNAME)<- "CAMPUS 2007 Flag-Campus/District Name of TAKS Giver"
>> label(CREF$C_RATING)<- "Campus 2007 Ratings: Final Accountability Rating"
>> #Import CSTUD
>> CSTUD <- read.table("C:/Users/Matt/Desktop/TTU/Courses/EPSY 5385_Stats Packages/DATA FILES/cstud.dat", header=TRUE, sep=",", na.strings="NA", dec=".", strip.white=TRUE)
>> attach(CSTUD)
>> # SORT
>> CSTUD<- order(+CSTUD$campus)
>> CSTUD
>> # variable labels
>> library(Hmisc)
>> label(CSTUD$CA0GR06N)<- "Campus 2006 Graduates: All Students Count"
>> label(CSTUD$CA0GH06N)<- "Campus 2006 Graduates: Recom HS Pgm All Students Count"
>> label(CSTUD$CA0GM06N)<- "Campus 2006 Graduates: Min HS Pgm All Students Count"
>> label(CSTUD$CB0GR06N)<- "Campus 2006 Graduates: African American Count"
>> label(CSTUD$CH0GR06N)<- "Campus 2006 Graduates: Hispanic Count"
>> label(CSTUD$CW0GR06N)<- "Campus 2006 Graduates: White Count"
>> label(CSTUD$CI0GR06N)<- "Campus 2006 Graduates: Native American Count"
>> label(CSTUD$CP0GR06N)<- "Campus 2006 Graduates: Asian/Pacific Islander Count"
>> label(CSTUD$CS0GR06N)<- "Campus 2006 Graduates: Special Ed Count"
>> label(CSTUD$BPETALLC)<- "Group 2007 Student: All Students Count"
>> label(CSTUD$BA0GR06N)<- "Group 2006 Graduates: All Students Count"
>> label(CSTUD$BB0GR06N)<- "Group 2006 Graduates: African American Count"
>> label(CSTUD$BH0GR06N)<- "Group 2006 Graduates: Hispanic Count"
>> label(CSTUD$BW0GR06N)<- "Group 2006 Graduates: White Count"
>> label(CSTUD$BI0GR06N)<- "Group 2006 Graduates: Native American Count"
>> label(CSTUD$BP0GR06N)<- "Group 2006 Graduates: Asian/Pacific Islander Count"
>> label(CSTUD$BS0GR06N)<- "Group 2006 Graduates: Special Ed Count"
>> label(CSTUD$BA0GH06N)<- "Group 2006 Graduates: Recom HS Pgm All Students Count"
>> label(CSTUD$BA0GM06N)<- "Group 2006 Graduates: Min HS Pgm All Students Count"
>> label(CSTUD$BPETDISC)<- "Group 2006 DAEP: Count"
>> label(CSTUD$CPERRAKR)<- "Campus 2006 Retention: Regular Ed KG Rate"
>> label(CSTUD$CPERRA1R)<- "Campus 2006 Retention: Regular Ed 01 Rate"
>> label(CSTUD$CPERRA2R)<- "Campus 2006 Retention: Regular Ed 02 Rate"
>> label(CSTUD$CPERRA3R)<- "Campus 2006 Retention: Regular Ed 03 Rate"
>> label(CSTUD$CPERRA4R)<- "Campus 2006 Retention: Regular Ed 04 Rate"
>> label(CSTUD$CPERRA5R)<- "Campus 2006 Retention: Regular Ed 05 Rate"
>> label(CSTUD$CPERRA6R)<- "Campus 2006 Retention: Regular Ed 06 Rate"
>> label(CSTUD$CPERRA7R)<- "Campus 2006 Retention: Regular Ed 07 Rate"
>> label(CSTUD$CPERRA8R)<- "Campus 2006 Retention: Regular Ed 08 Rate"
>> label(CSTUD$CPERSAKR)<- "Campus 2006 Retention: Special Ed KG Rate"
>> label(CSTUD$CPERSA1R)<- "Campus 2006 Retention: Special Ed 01 Rate"
>> label(CSTUD$CPERSA2R)<- "Campus 2006 Retention: Special Ed 02 Rate"
>> label(CSTUD$CPERSA3R)<- "Campus 2006 Retention: Special Ed 03 Rate"
>> label(CSTUD$CPERSA4R)<- "Campus 2006 Retention: Special Ed 04 Rate"
>> label(CSTUD$CPERSA5R)<- "Campus 2006 Retention: Special Ed 05 Rate"
>> label(CSTUD$CPERSA6R)<- "Campus 2006 Retention: Special Ed 06 Rate"
>> label(CSTUD$CPERSA7R)<- "Campus 2006 Retention: Special Ed 07 Rate"
>> label(CSTUD$CPERSA8R)<- "Campus 2006 Retention: Special Ed 08 Rate"
>> label(CSTUD$CPETDISC)<- "Campus 2006 DAEP: Count"
>> label(CSTUD$CPETDISP)<- "Campus 2006 DAEP: Percent"
>> label(CSTUD$CPETLEPP)<- "Campus 2007 Student: LEP Percent"
>> label(CSTUD$CPETECOP)<- "Campus 2007 Student: ECO Percent"
>> label(CSTUD$CPETECOC)<- "Campus 2007 Student: ECO Count"
>> label(CSTUD$CPETLEPC)<- "Campus 2007 Student: LEP Count"
>> label(CSTUD$CPETSPEC)<- "Campus 2007 Student: SPE Count"
>> label(CSTUD$CPETGIFC)<- "Campus 2007 Student: GIF/TAL Count"
>> label(CSTUD$CPETBILC)<- "Campus 2007 Student: BIL Count"
>> label(CSTUD$CPETALLC)<- "Campus 2007 Student: All Students Count"
>> label(CSTUD$CPETVOCC)<- "Campus 2007 Student: VOC Count"
>> label(CSTUD$CPETINDC)<- "Campus 2007 Student: Native American Count"
>> label(CSTUD$CPETPACC)<- "Campus 2007 Student: Asian/Pacific Islander Count"
>> label(CSTUD$CPETBLAC)<- "Campus 2007 Student: African American Count"
>> label(CSTUD$CPETHISC)<- "Campus 2007 Student: Hispanic Count"
>> label(CSTUD$CPETWHIC)<- "Campus 2007 Student: White Count"
>> label(CSTUD$CPETGEEC)<- "Campus 2007 Student: EE Count"
>> label(CSTUD$CPETGPKC)<- "Campus 2007 Student: PK Count"
>> label(CSTUD$CPETGKNC)<- "Campus 2007 Student: KG Count"
>> label(CSTUD$CPETG01C)<- "Campus 2007 Student: 01 Count"
>> label(CSTUD$CPETG02C)<- "Campus 2007 Student: 02 Count"
>> label(CSTUD$CPETG03C)<- "Campus 2007 Student: 03 Count"
>> label(CSTUD$CPETG04C)<- "Campus 2007 Student: 04 Count"
>> label(CSTUD$CPETG05C)<- "Campus 2007 Student: 05 Count"
>> label(CSTUD$CPETG06C)<- "Campus 2007 Student: 06 Count"
>> label(CSTUD$CPETG07C)<- "Campus 2007 Student: 07 Count"
>> label(CSTUD$CPETG08C)<- "Campus 2007 Student: 08 Count"
>> label(CSTUD$CPETG09C)<- "Campus 2007 Student: 09 Count"
>> label(CSTUD$CPETG10C)<- "Campus 2007 Student: 10 Count"
>> label(CSTUD$CPETG11C)<- "Campus 2007 Student: 11 Count"
>> label(CSTUD$CPETG12C)<- "Campus 2007 Student: 12 Count"
>> label(CSTUD$CPEMALLC)<- "Campus 2007 Student: 06 Mobility Numerator (Campus Only)"
>> label(CSTUD$CPETWHIP)<- "Campus 2007 Student: White Percent"
>> label(CSTUD$CPETBLAP)<- "Campus 2007 Student: African American Percent"
>> label(CSTUD$CPETHISP)<- "Campus 2007 Student: Hispanic Percent"
>> label(CSTUD$CPETINDP)<- "Campus 2007 Student: Native American Percent"
>> label(CSTUD$CPETPACP)<- "Campus 2007 Student: Asian/Pacific Islander Percent"
>> label(CSTUD$CPETSPEP)<- "Campus 2007 Student: SPE Percent"
>> label(CSTUD$CPETVOCP)<- "Campus 2007 Student: VOC Percent"
>> label(CSTUD$CPETBILP)<- "Campus 2007 Student: BIL Percent"
>> label(CSTUD$CPETGIFP)<- "Campus 2007 Student: GIF/TAL Percent"
>> label(CSTUD$CPETGEEP)<- "Campus 2007 Student: EE Percent"
>> label(CSTUD$CPETGPKP)<- "Campus 2007 Student: PK Percent"
>> label(CSTUD$CPETGKNP)<- "Campus 2007 Student: KG Percent"
>> label(CSTUD$CPETG01P)<- "Campus 2007 Student: 01 Percent"
>> label(CSTUD$CPETG02P)<- "Campus 2007 Student: 02 Percent"
>> label(CSTUD$CPETG03P)<- "Campus 2007 Student: 03 Percent"
>> label(CSTUD$CPETG04P)<- "Campus 2007 Student: 04 Percent"
>> label(CSTUD$CPETG05P)<- "Campus 2007 Student: 05 Percent"
>> label(CSTUD$CPETG06P)<- "Campus 2007 Student: 06 Percent"
>> label(CSTUD$CPETG07P)<- "Campus 2007 Student: 07 Percent"
>> label(CSTUD$CPETG08P)<- "Campus 2007 Student: 08 Percent"
>> label(CSTUD$CPETG09P)<- "Campus 2007 Student: 09 Percent"
>> label(CSTUD$CPETG10P)<- "Campus 2007 Student: 10 Percent"
>> label(CSTUD$CPETG11P)<- "Campus 2007 Student: 11 Percent"
>> label(CSTUD$CPETG12P)<- "Campus 2007 Student: 12 Percent"
>> label(CSTUD$CPEMALLP)<- "Campus 2007 Student: 06 Mobility Percent (Campus Only)"
>> label(CSTUD$BPETWHIP)<- "Group 2007 Student: White Percent"
>> label(CSTUD$BPETBLAP)<- "Group 2007 Student: African American Percent"
>> label(CSTUD$BPETHISP)<- "Group 2007 Student: Hispanic Percent"
>> label(CSTUD$BPETINDP)<- "Group 2007 Student: Native American Percent"
>> label(CSTUD$BPETPACP)<- "Group 2007 Student: Asian/Pacific Islander Percent"
>> label(CSTUD$BPETECOP)<- "Group 2007 Student: ECO Percent"
>> label(CSTUD$BPETLEPP)<- "Group 2007 Student: LEP Percent"
>> label(CSTUD$BPETSPEP)<- "Group 2007 Student: SPE Percent"
>> label(CSTUD$BPETVOCP)<- "Group 2007 Student: VOC Percent"
>> label(CSTUD$BPETBILP)<- "Group 2007 Student: BIL Percent"
>> label(CSTUD$BPETGIFP)<- "Group 2007 Student: GIF/TAL Percent"
>> label(CSTUD$BPETGEEP)<- "Group 2007 Student: EE Percent"
>> label(CSTUD$BPETGPKP)<- "Group 2007 Student: PK Percent"
>> label(CSTUD$BPETGKNP)<- "Group 2007 Student: KG Percent"
>> label(CSTUD$BPETG01P)<- "Group 2007 Student: 01 Percent"
>> label(CSTUD$BPETG02P)<- "Group 2007 Student: 02 Percent"
>> label(CSTUD$BPETG03P)<- "Group 2007 Student: 03 Percent"
>> label(CSTUD$BPETG04P)<- "Group 2007 Student: 04 Percent"
>> label(CSTUD$BPETG05P)<- "Group 2007 Student: 05 Percent"
>> label(CSTUD$BPETG06P)<- "Group 2007 Student: 06 Percent"
>> label(CSTUD$BPETG07P)<- "Group 2007 Student: 07 Percent"
>> label(CSTUD$BPETG08P)<- "Group 2007 Student: 08 Percent"
>> label(CSTUD$BPETG09P)<- "Group 2007 Student: 09 Percent"
>> label(CSTUD$BPETG10P)<- "Group 2007 Student: 10 Percent"
>> label(CSTUD$BPETG11P)<- "Group 2007 Student: 11 Percent"
>> label(CSTUD$BPETG12P)<- "Group 2007 Student: 12 Percent"
>> label(CSTUD$BPEMALLP)<- "Group 2007 Student: 06 Mobility Percent (Campus ONLY)"
>> label(CSTUD$BPERRAKR)<- "Group 2006 Retention: Regular Ed KG Rate"
>> label(CSTUD$BPERRA1R)<- "Group 2006 Retention: Regular Ed 01 Rate"
>> label(CSTUD$BPERRA2R)<- "Group 2006 Retention: Regular Ed 02 Rate"
>> label(CSTUD$BPERRA3R)<- "Group 2006 Retention: Regular Ed 03 Rate"
>> label(CSTUD$BPERRA4R)<- "Group 2006 Retention: Regular Ed 04 Rate"
>> label(CSTUD$BPERRA5R)<- "Group 2006 Retention: Regular Ed 05 Rate"
>> label(CSTUD$BPERRA6R)<- "Group 2006 Retention: Regular Ed 06 Rate"
>> label(CSTUD$BPERRA7R)<- "Group 2006 Retention: Regular Ed 07 Rate"
>> label(CSTUD$BPERRA8R)<- "Group 2006 Retention: Regular Ed 08 Rate"
>> label(CSTUD$BPERSAKR)<- "Group 2006 Retention: Special Ed KG Rate"
>> label(CSTUD$BPERSA1R)<- "Group 2006 Retention: Special Ed 01 Rate"
>> label(CSTUD$BPERSA2R)<- "Group 2006 Retention: Special Ed 02 Rate"
>> label(CSTUD$BPERSA3R)<- "Group 2006 Retention: Special Ed 03 Rate"
>> label(CSTUD$BPERSA4R)<- "Group 2006 Retention: Special Ed 04 Rate"
>> label(CSTUD$BPERSA5R)<- "Group 2006 Retention: Special Ed 05 Rate"
>> label(CSTUD$BPERSA6R)<- "Group 2006 Retention: Special Ed 06 Rate"
>> label(CSTUD$BPERSA7R)<- "Group 2006 Retention: Special Ed 07 Rate"
>> label(CSTUD$BPERSA8R)<- "Group 2006 Retention: Special Ed 08 Rate"
>> label(CSTUD$BPETDISP)<- "Group 2006 DAEP: Percent"
>> CSTUD$BPETALLC [CSTUD$BPETALLC == 0] <- NA
>> CSTUD$BPETALLC [is.na(CSTUD$BPETALLC)] <- 0
>> # Import CFIN
>> CFIN <- read.table("C:/Users/Matt/Desktop/TTU/Courses/EPSY 5385_Stats Packages/DATA FILES/cfin.dat", header=TRUE, sep=",", na.strings="NA", dec=".", strip.white=TRUE)
>> attach(CFIN)
>> # SORT
>> CFIN<- order(+CFIN$campus)
>> CFIN
>> # variable labels
>> library(Hmisc)
>> label(CFIN$CPFEGADST)<- "Campus 2006 Finance: Expenditure by Function-School Leadership, General Fund"
>> label(CFIN$CPFEGADIT)<- "Campus 2006 Finance: Expenditure by Function-Instructional Leadership, General Fund"
>> label(CFIN$CPFEAADST)<- "Campus 2006 Finance: Expenditure by Function-School Leadership, All Funds"
>> label(CFIN$CPFEAADIT)<- "Campus 2006 Finance: Expenditure by Function-Instructional Leadership, All Funds"
>> label(CFIN$CPFPGREGT)<- "Campus 2006 Finance: Expenditure by Program-Regular, GeneralFund"
>> label(CFIN$CPFPGVOCT)<- "Campus 2006 Finance: Expenditure by Program-Career & Technology, General Fund"
>> label(CFIN$CPFPGSPET)<- "Campus 2006 Finance: Expenditure by Program-Special Education, General Fund"
>> label(CFIN$CPFPGGIFT)<- "Campus 2006 Finance: Expenditure by Program-Gifted & Talented, General Fund"
>> label(CFIN$CPFPGBILT)<- "Campus 2006 Finance: Expenditure by Program-Bilingual, General Fund"
>> label(CFIN$CPFPAREGT)<- "Campus 2006 Finance: Expenditure by Program-Regular, All Funds"
>> label(CFIN$CPFPAVOCT)<- "Campus 2006 Finance: Expenditure by Program-Career & Technology, All Funds"
>> label(CFIN$CPFPASPET)<- "Campus 2006 Finance: Expenditure by Program-Special Education, All Funds"
>> label(CFIN$CPFPAGIFT)<- "Campus 2006 Finance: Expenditure by Program-Gifted & Talented, All Funds"
>> label(CFIN$CPFPABILT)<- "Campus 2006 Finance: Expenditure by Program-Bilingual, All Funds"
>> label(CFIN$CPFEGINST)<- "Campus 2006 Finance: Expenditure by Function-Instruction, General Fund"
>> label(CFIN$CPFEAINST)<- "Campus 2006 Finance: Expenditure by Function-Instruction, All Funds"
>> label(CFIN$CPFEGOPRT)<- "Campus 2006 Finance: Expenditure by Function-Total Operating,General Fund"
>> label(CFIN$CPFEAOPRT)<- "Campus 2006 Finance: Expenditure by Function-Total Operating, All Funds"
>> label(CFIN$CPFPGALLT)<- "Campus 2006 Finance: Expenditure by Program-Total, General Fund"
>> label(CFIN$CPFPAALLT)<- "Campus 2006 Finance: Expenditure by Program-Total, All Funds"
>> label(CFIN$CPFEGINSK)<- "Campus 2006 Finance: Expenditure by Function-Instruction PerPupil, General Fund"
>> label(CFIN$CPFEGADIK)<- "Campus 2006 Finance: Expenditure by Function-Instructional Leadership Per Pupil, General Fund"
>> label(CFIN$CPFEGADSK)<- "Campus 2006 Finance: Expenditure by Function-School Leadership Per Pupil, General Fund"
>> label(CFIN$CPFEGOPRK)<- "Campus 2006 Finance: Expenditure by Function-Total OperatingPer Pupil, General Fund"
>> label(CFIN$CPFPGREGK)<- "Campus 2006 Finance: Expenditure by Program-Regular Per Pupil, General Fund"
>> label(CFIN$CPFPGVOCK)<- "Campus 2006 Finance: Expenditure by Program-Career & Technology Per Pupil, General Fund"
>> label(CFIN$CPFPGBILK)<- "Campus 2006 Finance: Expenditure by Program-Bilingual Per Pupil, General Fund"
>> label(CFIN$CPFPGGIFK)<- "Campus 2006 Finance: Expenditure by Program-Gifted & Talented Per Pupil, General Fund"
>> label(CFIN$CPFPGSPEK)<- "Campus 2006 Finance: Expenditure by Program-Special Education Per Pupil, General Fund"
>> label(CFIN$CPFPGALLK)<- "Campus 2006 Finance: Expenditure by Program-Total Per Pupil,General Fund"
>> label(CFIN$CPFEAINSK)<- "Campus 2006 Finance: Expenditure by Function-Instruction PerPupil, All Funds"
>> label(CFIN$CPFEAADIK)<- "Campus 2006 Finance: Expenditure by Function-Instructional Leadership Per Pupil, All Funds"
>> label(CFIN$CPFEAADSK)<- "Campus 2006 Finance: Expenditure by Function-School Leadership Per Pupil, All Funds"
>> label(CFIN$CPFEAOPRK)<- "Campus 2006 Finance: Expenditure by Function-Total OperatingPer Pupil, ALL Funds"
>> label(CFIN$CPFPAREGK)<- "Campus 2006 Finance: Expenditure by Program-Regular Per Pupil, All Funds"
>> label(CFIN$CPFPAVOCK)<- "Campus 2006 Finance: Expenditure by Program-Career & Technology Per Pupil, All Funds"
>> label(CFIN$CPFPABILK)<- "Campus 2006 Finance: Expenditure by Program-Bilingual Per Pupil, All Funds"
>> label(CFIN$CPFPAGIFK)<- "Campus 2006 Finance: Expenditure by Program-Gifted & Talented Per Pupil, All Funds"
>> label(CFIN$CPFPASPEK)<- "Campus 2006 Finance: Expenditure by Program-Special Education Per Pupil, All Funds"
>> label(CFIN$CPFPAALLK)<- "Campus 2006 Finance: Expenditure by Program-Total Per Pupil,All Funds"
>> label(CFIN$CPFEGRELT)<- "Campus 2006 Finance: Expenditure by Function-Instructional-Related Services, General Fund"
>> label(CFIN$CPFEARELT)<- "Campus 2006 Finance: Expenditure by Function-Instructional-Related Services, All Funds"
>> label(CFIN$CPFEGSUPT)<- "Campus 2006 Finance: Expenditure by Function-Support Services-Student, General Fund"
>> label(CFIN$CPFEASUPT)<- "Campus 2006 Finance: Expenditure by Function-Support Services-Student, All Funds"
>> label(CFIN$CPFPGCOMT)<- "Campus 2006 Finance: Expenditure by Program-Compensatory, General Fund"
>> label(CFIN$CPFPACOMT)<- "Campus 2006 Finance: Expenditure by Program-Compensatory, All Funds"
>> label(CFIN$CPFPGOTHT)<- "Campus 2006 Finance: Expenditure by Program-Other, General Fund"
>> label(CFIN$CPFPAOTHT)<- "Campus 2006 Finance: Expenditure by Program-Other, All Funds"
>> label(CFIN$CPFEGOPRP)<- "Campus 2006 Finance: Expenditure by Function-Total OperatingPercent, General Fund"
>> label(CFIN$CPFEGINSP)<- "Campus 2006 Finance: Expenditure by Function-Instruction Percent, General Fund"
>> label(CFIN$CPFEGRELP)<- "Campus 2006 Finance: Expenditure by Function-Instructional-Related Services Percent, General Fund"
>> label(CFIN$CPFEGADIP)<- "Campus 2006 Finance: Expenditure by Function-Instructional Leadership Percent, General Fund"
>> label(CFIN$CPFEGADSP)<- "Campus 2006 Finance: Expenditure by Function-School Leadership Percent, General Fund"
>> label(CFIN$CPFEGSUPP)<- "Campus 2006 Finance: Expenditure by Function-Support Services-Student Percent, General Fund"
>> label(CFIN$CPFEAOPRP)<- "Campus 2006 Finance: Expenditure by Function-Total OperatingPercent, All Funds"
>> label(CFIN$CPFEAINSP)<- "Campus 2006 Finance: Expenditure by Function-Instruction Percent, All Funds"
>> label(CFIN$CPFEARELP)<- "Campus 2006 Finance: Expenditure by Function-Instructional-Related Services Percent, All Funds"
>> label(CFIN$CPFEAADIP)<- "Campus 2006 Finance: Expenditure by Function-Instructional Leadership Percent, All Funds"
>> label(CFIN$CPFEAADSP)<- "Campus 2006 Finance: Expenditure by Function-School Leadership Percent, All Funds"
>> label(CFIN$CPFEASUPP)<- "Campus 2006 Finance: Expenditure by Function-Support Services-Student Percent, All Funds"
>> label(CFIN$CPFPGALLP)<- "Campus 2006 Finance: Expenditure by Program-Total Percent, General Fund"
>> label(CFIN$CPFPGBILP)<- "Campus 2006 Finance: Expenditure by Program-Bilingual Percent, General Fund"
>> label(CFIN$CPFPGVOCP)<- "Campus 2006 Finance: Expenditure by Program-Career & Technology Percent, General Fund"
>> label(CFIN$CPFPGCOMP)<- "Campus 2006 Finance: Expenditure by Program-Compensatory Percent, General Fund"
>> label(CFIN$CPFPGGIFP)<- "Campus 2006 Finance: Expenditure by Program-Gifted & Talented Percent, General Fund"
>> label(CFIN$CPFPGREGP)<- "Campus 2006 Finance: Expenditure by Program-Regular Percent,General Fund"
>> label(CFIN$CPFPGSPEP)<- "Campus 2006 Finance: Expenditure by Program-Special Education Percent, General Fund"
>> label(CFIN$CPFPGOTHP)<- "Campus 2006 Finance: Expenditure by Program-Other Percent, General Fund"
>> label(CFIN$CPFPAALLP)<- "Campus 2006 Finance: Expenditure by Program-Total Percent, All Funds"
>> label(CFIN$CPFPABILP)<- "Campus 2006 Finance: Expenditure by Program-Bilingual Percent, All Funds"
>> label(CFIN$CPFPAVOCP)<- "Campus 2006 Finance: Expenditure by Program-Career & Technology Percent, All Funds"
>> label(CFIN$CPFPACOMP)<- "Campus 2006 Finance: Expenditure by Program-Compensatory Percent, All Funds"
>> label(CFIN$CPFPAGIFP)<- "Campus 2006 Finance: Expenditure by Program-Gifted & Talented Percent, All Funds"
>> label(CFIN$CPFPAREGP)<- "Campus 2006 Finance: Expenditure by Program-Regular Percent,All Funds"
>> label(CFIN$CPFPASPEP)<- "Campus 2006 Finance: Expenditure by Program-Special Education Percent, All Funds"
>> label(CFIN$CPFPAOTHP)<- "Campus 2006 Finance: Expenditure by Program-Other Percent, All Funds"
>> label(CFIN$CPFEGRELK)<- "Campus 2006 Finance: Expenditure by Function-Instructional-Related Services Per Pupil, General Fund"
>> label(CFIN$CPFEARELK)<- "Campus 2006 Finance: Expenditure by Function-Instructional-Related Services Per Pupil, All Funds"
>> label(CFIN$CPFEGSUPK)<- "Campus 2006 Finance: Expenditure by Function-Support Services-Student Per Pupil, General Fund"
>> label(CFIN$CPFEASUPK)<- "Campus 2006 Finance: Expenditure by Function-Support Services-Student Per Pupil, All Funds"
>> label(CFIN$CPFPGCOMK)<- "Campus 2006 Finance: Expenditure by Program-Compensatory PerPupil, General Fund"
>> label(CFIN$CPFPACOMK)<- "Campus 2006 Finance: Expenditure by Program-Compensatory PerPupil, All Funds"
>> label(CFIN$CPFPGOTHK)<- "Campus 2006 Finance: Expenditure by Program-Other Per Pupil,General Fund"
>> label(CFIN$CPFPAOTHK)<- "Campus 2006 Finance: Expenditure by Program-Other Per Pupil,All Funds"
>> label(CFIN$BPFEAOPRT)<- "Campus 2006 Finance: Group Total-Expenditure by Function-Total Operating, All Funds"
>> label(CFIN$BPFEAINST)<- "Campus 2006 Finance: Group Total-Expenditure by Function-Instruction, All Funds"
>> label(CFIN$BPFEARELT)<- "Campus 2006 Finance: Group Total-Expenditure by Function-Instructional-Related Services, All Funds"
>> label(CFIN$BPFEAADIT)<- "Campus 2006 Finance: Group Total-Expenditure by Function-Instructional Leadership, All Funds"
>> label(CFIN$BPFEAADST)<- "Campus 2006 Finance: Group Total-Expenditure by Function-School Leadership, All Funds"
>> label(CFIN$BPFEASUPT)<- "Campus 2006 Finance: Group Total-Expenditure by Function-Support Services--Student, All Funds"
>> label(CFIN$BPFPAALLT)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Total, All Funds"
>> label(CFIN$BPFPABILT)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Bilingual, All Funds"
>> label(CFIN$BPFPAVOCT)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Career & Technology, All Funds"
>> label(CFIN$BPFPACOMT)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Compensatory, All Funds"
>> label(CFIN$BPFPAGIFT)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Gifted & Talented, All Funds"
>> label(CFIN$BPFPAREGT)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Regular, All Funds"
>> label(CFIN$BPFPASPET)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Special Education, All Funds"
>> label(CFIN$BPFPAOTHT)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Other, All Funds"
>> label(CFIN$BPFPAALLP)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Total Percent, All Funds"
>> label(CFIN$BPFPAREGP)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Regular Percent, All Funds"
>> label(CFIN$BPFPABILP)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Bilingual Percent, All Funds"
>> label(CFIN$BPFPACOMP)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Compensatory Percent, All Funds"
>> label(CFIN$BPFPAGIFP)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Gifted & Talented Percent, All Funds"
>> label(CFIN$BPFPAVOCP)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Career & Technology Percent, All Funds"
>> label(CFIN$BPFPASPEP)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Special Education Percent, All Funds"
>> label(CFIN$BPFPAOTHP)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Other Percent, All Funds"
>> label(CFIN$BPFEAOPRP)<- "Campus 2006 Finance: Group Total-Expenditure by Function-Total Operating Percent, All Funds"
>> label(CFIN$BPFEAINSP)<- "Campus 2006 Finance: Group Total-Expenditure by Function-Instruction Percent, All Funds"
>> label(CFIN$BPFEARELP)<- "Campus 2006 Finance: Group Total-Expenditure by Function-Instructional-Related Services Percent, All Funds"
>> label(CFIN$BPFEAADIP)<- "Campus 2006 Finance: Group Total-Expenditure by Function-Instructional Leadership Percent, All Funds"
>> label(CFIN$BPFEAADSP)<- "Campus 2006 Finance: Group Total-Expenditure by Function-School Leadership Percent, All Funds"
>> label(CFIN$BPFEASUPP)<- "Campus 2006 Finance: Group Total-Expenditure by Function-Support Services--Student Percent, All Funds"
>> label(CFIN$BPFEAOPRK)<- "Campus 2006 Finance: Group Total-Expenditure by Function-Total Operating Per Pupil, All Funds"
>> label(CFIN$BPFEAINSK)<- "Campus 2006 Finance: Group Total-Expenditure by Function-Instruction Per Pupil, All Funds"
>> label(CFIN$BPFEARELK)<- "Campus 2006 Finance: Group Total-Expenditure by Function-Instructional-Related Services Per Pupil, All Funds"
>> label(CFIN$BPFEAADIK)<- "Campus 2006 Finance: Group Total-Expenditure by Function-Instructional Leadership Per Pupil, All Funds"
>> label(CFIN$BPFEAADSK)<- "Campus 2006 Finance: Group Total-Expenditure by Function-School Leadership Per Pupil, All Funds"
>> label(CFIN$BPFEASUPK)<- "Campus 2006 Finance: Group Total-Expenditure by Function-Support Services--Student Per Pupil, All Funds"
>> label(CFIN$BPFPAALLK)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Total Per Pupil, All Funds"
>> label(CFIN$BPFPABILK)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Bilingual Per Pupil, All Funds"
>> label(CFIN$BPFPAVOCK)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Career & Technology Per Pupil, All Funds"
>> label(CFIN$BPFPACOMK)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Compensatory Per Pupil, All Funds"
>> label(CFIN$BPFPAGIFK)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Gifted & Talented Per Pupil, All Funds"
>> label(CFIN$BPFPAREGK)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Regular Per Pupil, All Funds"
>> label(CFIN$BPFPASPEK)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Special Education Per Pupil, All Funds"
>> label(CFIN$BPFPAOTHK)<- "Campus 2006 Finance: Group Total-Expenditure by Program-Other Per Pupil, All Funds"
>> # merge two datasets by CAMPUS
>> total <- merge(CREF,CSTUD, by="CAMPUS", all=T)
>> attach(total)
>> total2 <- merge(total,CFIN, by="CAMPUS", all=T)
>> attach(total2)
>> # Sort
>> total2 <- order(+total2$campus)
>> total2
>> #Recodes
>> total2$campus.rating <- recode(total2$C_RATING, '"A" = 1; "E" = 2; "R" = 3; "X" = 4; "L" = 5; ', as.factor.result=TRUE)
>> total2$campus.rating <- factor(total2$campus.rating,levels=c("1","2","3","4","5"),labels=c("Acceptable","Exemplary","Recognized","Unacceptable","Unknown"))
>> total2$grade.type <- recode(total2$GRDTYPE, '"B" = 1; "E" = 2; "M" = 3; "S" = 4; ', as.factor.result=TRUE)
>> total2$grade.type <- factor(total2$grade.type,levels=c("1","2","3","4"),labels=c("Both","Elementary","Middle","Senior"))
>> total2$CPERRA1R <- as.numeric(total2$CPERRA1R)
>> total2$log.cperra1r <- with(total2, log10(CPERRA1R))
>> # auto-recode -- county.name
>> #Summary statistics
>> summary(total2)
>> # Histogram
>> Hist(total2$BA0GH06N, scale="frequency", breaks="Sturges", col="darkgray")
>> #Scatterplot
>> scatterplot(BA0GR06N~BA0GM06N, reg.line=FALSE, smooth=FALSE, labels=FALSE, boxplots='xy', span=0.5, data=total2)
>> #Scatterplot matrices
>> scatterplot.matrix(~BA0GH06N+BA0GR06N+BH0GR06N+BP0GR06N+BPETDISC, reg.line=FALSE, smooth=FALSE, span=0.5, diagonal = 'density', data=total2)
>> # Split File -- CAN'T IN R
>> # 1-way ANOVA
>> total2$CPETECOP <- as.numeric(total2$CPETECOP)
>> .Anova<- lm(CPETECOP ~ campus.rating, data=total2)
>> anova(.Anova)
>> tapply(total2$CPETECOP, total2$campus.rating, mean, na.rm=TRUE) # means
>> tapply(total2$CPETECOP, total2$campus.rating, sd, na.rm=TRUE) # std. deviations
>> tapply(total2$CPETECOP, total2$campus.rating, function(x) sum(!is.na(x))) # counts
>> remove(.Anova)
>> # Repeated measures ANOVA?
>> GLM.1 <- glm(BPFEAOPRT  ~ CPFEAOPRT, family=binomial(logit), data=total2)
>> summary(GLM.1)
>> # Factorial ANOVA
>> # Correlation
>> library(RODBC)
>> total2$CPFPGSPET <- as.numeric(total2$CPFPGSPET)
>> total2$CPFPGGIFT <- as.numeric(total2$CPFPGGIFT)
>> cor.test(total2$CPFPGSPET , total2$CPFPGGIFT, use="pairwise.complete.obs", alternative="two.sided", method="pearson")
>> # Bivariate Regression
>> total2$CPFPGGIFK <- as.numeric(total2$CPFPGGIFK)
>> total2$CPFPGSPEK <- as.numeric(total2$CPFPGSPEK)
>> RegModel.1 <- lm(CPFPGGIFK~CPFPGSPEK, data=total2)
>> summary(RegModel.1)
>> anova(RegModel.1)
>> plot(RegModel.1)
>> termplot(RegModel.1)
>> bptest(CPFPGGIFK~CPFPGSPEK, varformula = ~ fitted.values(RegModel.1), studentize=FALSE, data=total2)
>> dwtest(CPFPGGIFK~CPFPGSPEK, alternative="greater", data=total2)
>> resettest(CPFPGGIFK~CPFPGSPEK, power=2:3, type="regressor", data=total2)
>> outlier.test(RegModel.1)
>> #Graphs
>> oldpar <- par(oma=c(0,0,3,0), mfrow=c(2,2))
>> plot(RegModel.1)
>> par(oldpar)
>> qq.plot(RegModel.1, simulate=TRUE, labels=FALSE)
>> cr.plots(RegModel.1, ask=FALSE)
>> av.plots(RegModel.1, ask=FALSE, identify.points=TRUE)
>> influencePlot(RegModel.1)
>> trellis.device(theme="col.whitebg")
>> plot(all.effects(RegModel.1), ask=FALSE)
>> # Multiple Regression
>> total2$CPFPGREGT <- as.numeric(total2$CPFPGREGT)
>> total2$CPFPAVOCT <- as.numeric(total2$CPFPAVOCT)
>> total2$CPFPASPET <- as.numeric(total2$CPFPASPET)
>> total2$CPFPABILT <- as.numeric(total2$CPFPABILT)
>> RegModel.2 <- lm(CPFPGREGT~CPFPAVOCT+CPFPASPET+CPFPABILT, data=total2)
>> summary(RegModel.2)
>> anova(RegModel.2)
>> plot(RegModel.2)
>> termplot(RegModel.2)
>> vif(RegModel.1)
>> bptest(CPFPGREGT~CPFPAVOCT+CPFPASPET+CPFPABILT, varformula = ~ fitted.values(RegModel.1), studentize=FALSE, data=total2)
>> dwtest(CPFPGREGT~CPFPAVOCT+CPFPASPET+CPFPABILT, alternative="greater", data=total2)
>> resettest(CPFPGREGT~CPFPAVOCT+CPFPASPET+CPFPABILT, power=2:3, type="regressor", data=total2)
>> outlier.test(RegModel.1)
>> #Graphs
>> oldpar <- par(oma=c(0,0,3,0), mfrow=c(2,2))
>> plot(RegModel.2)
>> par(oldpar)
>> qq.plot(RegModel.2, simulate=TRUE, labels=FALSE)
>> cr.plots(RegModel.2, ask=FALSE)
>> av.plots(RegModel.2, ask=FALSE, identify.points=TRUE)
>> influencePlot(RegModel.2)
>> trellis.device(theme="col.whitebg")
>> plot(all.effects(RegModel.2), ask=FALSE)
>> # Chi-square
>> chisq.test(total2$campus.rating, total2$grade.type))
>> # Aggregate
>> attach(total2)
>> cref_cfin_cstud_agg <-aggregate(total2, by=list(REGION),
>>   FUN=mean, na.rm=TRUE)
>>
>> _______________________________________________
>> R-sig-teaching at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-sig-teaching
>>
>
> _______________________________________________
> R-sig-teaching at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-sig-teaching
>




More information about the R-sig-teaching mailing list