From pd.mes at cbs.dk Tue Jan 1 19:07:39 2013 From: pd.mes at cbs.dk (Peter Dalgaard) Date: Tue, 1 Jan 2013 19:07:39 +0100 Subject: R version 3.0.0 Message-ID: This is no secret to those who read the NEWS file of the development version regularly, as the following has been in place since December 12th: \section{\Rlogo CHANGES IN R-devel}{ \subsection{SIGNIFICANT USER-VISIBLE CHANGES}{ \itemize{ \item It is intended that this version will be released as \R 3.0.0. .... However, it seems reasonable to supplement this with a more direct public announcement. The intended timing is to follow the annual release schedule and have R 3.0.0 around April 1 and a finalizing 2.15.3 a month earlier. Major R releases have not previously marked great landslides in terms of new features. Rather, they represent that the codebase has developed to a new level of maturity. This is not going to be an exception to the rule. Version 1.0.0 was released at a point in time when we felt that we had reached a level of completeness and stability high enough to characterize a full statistical system, which could be put to production use. Version 2.0.0 came out after strong enhancements of the memory management subsystem as well as several major features, including Sweave. Version 3.0.0, as of this writing, contains only really major new feature: The inclusion of long vectors (containing more than 2^31-1 elements!). More changes are likely to make it into the final release, but the main reason for having it as a new major release is that R over the last 8.5 years has reached a new level: we now have 64 bit support on all platforms, support for parallel processing, the Matrix package, and much more. On behalf of the R Core Team, Peter D. Happy New Year! -- Peter Dalgaard, Professor, Center for Statistics, Copenhagen Business School Solbjerg Plads 3, 2000 Frederiksberg, Denmark Phone: (+45)38153501 Email: pd.mes at cbs.dk Priv: PDalgd at gmail.com From pd.mes at cbs.dk Fri Mar 1 11:01:32 2013 From: pd.mes at cbs.dk (Peter Dalgaard) Date: Fri, 1 Mar 2013 11:01:32 +0100 Subject: R 2.15.3 is released Message-ID: <4ED7C2EB-2B50-4236-8E9E-CC225E692C45@cbs.dk> The build system rolled up R-2.15.3.tar.gz (codename "Security Blanket") at 9:00 this morning. This is intended to be the final round-up release of the 2.15 series, and in fact of the entire 2.x.y series which started 2004-10-04. The list below details the changes in this release. You can get the source code from http://cran.r-project.org/src/base/R-2/R-2.15.3.tar.gz or wait for it to be mirrored at a CRAN site nearer to you. Binaries for various platforms will appear in due course. For the R Core Team Peter Dalgaard These are the md5sums for the freshly created files, in case you wish to check that they are uncorrupted: MD5 (AUTHORS) = cbf6da8f886ccd8d0dda0cc7ffd1b8ec MD5 (COPYING) = eb723b61539feef013de476e68b5c50a MD5 (COPYING.LIB) = a6f89e2100d9b6cdffcea4f398e37343 MD5 (FAQ) = c82ec3aa971272312ca6f3f28c58d329 MD5 (INSTALL) = 37adac6d0fbadf25b5a40e3f7535415e MD5 (NEWS) = 09e5c175b09d33e28023c655a11e9b8d MD5 (NEWS.html) = c7dccfe18e943427b85e9ddd1c7ba46b MD5 (ONEWS) = 0c3e10eef74439786e5fceddd06dac71 MD5 (OONEWS) = b0d650eba25fc5664980528c147a20db MD5 (R-latest.tar.gz) = b2f1a5d701f1f90679be0c60e1931a5c MD5 (README) = 296871fcf14f49787910c57b92655c76 MD5 (RESOURCES) = c7cb32499ebbf85deb064aab282f93a4 MD5 (THANKS) = 7a87321ccf0ecd2bece697e39dce5e67 MD5 (R-2/R-2.15.3.tar.gz) = b2f1a5d701f1f90679be0c60e1931a5c This is the relevant part of the NEWS file CHANGES IN R VERSION 2.15.3: NEW FEATURES: o lgamma(x) for very small x (in the denormalized range) is no longer Inf with a warning. o image() now sorts an unsorted breaks vector, with a warning. o The internal methods for tar() and untar() do a slightly more general job for 'ustar'-style handling of paths of more than 100 bytes. o Packages compiler and parallel have been added to the reference index (refman.pdf). o untar(tar = "internal") has some support for pax headers as produced by e.g. gnutar --posix (which seems prevalent on OpenSUSE 12.2) or bsdtar --format pax, including long path and link names. o sQuote() and dQuote() now handle 0-length inputs. (Suggestion of Ben Bolker.) o summaryRprof() returns zero-row data frames rather than throw an error if no events are recorded, for consistency. o The included version of PCRE has been updated to 8.32. o The tcltk namespace can now be re-loaded after unloading. The Tcl/Tk event loop is inhibited in a forked child from package parallel (as in e.g. mclapply()). o parallel::makeCluster() recognizes the value random for the environment variable R_PARALLEL_PORT: this chooses a random value for the port and reduces the chance of conflicts when multiple users start a cluster at the same time. UTILITIES: o The default for TAR on Windows for R CMD build has been changed to be internal if no tar command is on the path. This enables most packages to be built 'out of the box' without Rtools: the main exceptions are those which need to be installed to re-build vignettes and need Rtools for installation (usually because they contain compiled code). C-LEVEL FACILITIES: o On a 64-bit Windows platform with enough RAM, R_alloc can now allocate up to just under 32GB like other 64-bit platforms. DEPRECATED AND DEFUNCT: o Use of col2rgb(0) is deprecated (see the help page for its limitations). o The deprecated intensities component returned by hist() is no longer recognized by the plot() method and will be removed in R 3.0.0. o real(), as.real() and is.real() are now formally deprecated and give a warning. o This is formal notice that the non-API EISPACK entry points in R will be removed shortly. INSTALLATION: o The configure tests for Objective C and Objective C++ now work on Mac OS 10.8 with Xcode 4.5.2 (PR#15107). o The cairo-based versions of X11() now work with current versions of cairographics (e.g. 1.12.10). (PR#15168) A workaround for earlier versions of R is to use X11.options(type = "nbcairo"). o Configuration and R CMD javareconf now come up with a smaller set of library paths for Java on Oracle-format JDK (including OpenJDK). This helps avoid conflicts between libraries (such as libjpeg) supplied in the JDK and system libraries. This can always be overridden if needed: see the 'R Installation and Administration' manual. BUG FIXES: o beta(a, b) could overflow to infinity in its calculations when one of a and b was less than one. (PR#15075) o lbeta(a, b) no longer gives NaN if a or b is very small (in the denormalized range). o bquote() is now able to substitute default arguments in single-argument functions. (PR#15077) o browseEnv(html = FALSE) would segfault if called from R (not R.app) on a CRAN-style Mac OS X build of R. o [[<- for lists (generic vectors) needed to increment NAMED count when RHS is used more than once. (PR#15098) o On Windows, warnings about opening a file or pipe with a non-ASCII description were sometimes output in UTF-8 rather than in the current locale's character set. o The call() function did not duplicate its arguments. (PR#15115) o TukeyHSD() could give NA results with some na.action methods such as na.exclude(). (Hinted at on R-help by John Fox.) o The deprecated svd(X, LINPACK = TRUE) could alter X in R 2.15.[12]. (Reported by Bill Dunlap.) o Under Windows, file.link() and file.symlink() used the link name twice, so would always fail. (Reported by Rui Barradas/Oliver Soong). o summaryRprof(memory = "both") mixed up the units of Vcells and Ncells: it now works in bytes. (PR#15138) o tools::Rd2HTML() would sometimes delete text. (PR#15134) o plot() failed for "table" objects containing just one entry. (PR#15118) o embedFonts() needed to quote some filepaths. (PR#15149) o parallel::mccollect() handled NULL returns incorrectly (removing the element rather than setting it to NULL). o The full reference index (fullrefman.pdf) was missing packages compiler and parallel. o The report for optim(method = "L-BFGS-B", control = list(trace = 1)) reported the last completed and not the current iteration, unlike other methods and trace levels. (PR#15103) o qt(1e-12, 1.2) no longer gives NaN. o dt(1e160, 1.2, log=TRUE) no longer gives -Inf. o On Windows the untar() function now quotes the directory name when using an external tar utility, so R CMD check will handle pathnames containing spaces. o The version for Windows 8 and Windows Server 2012 is now displayed by win.version(). (Reported by Gabor Grothendieck.) o The custom Windows installer target myR in the installer Makefile did not work in 2.15.2. (Reported by Erich Neuwirth.) o aperm(matrix(1:6, 2, dimnames=list(A={}, B={})), "A") no longer segfaults. o Expressions involving user defined operators were not always deparsed faithfully. (PR#15179) o The enc2utf8() function converted NA_character_ to "NA" in non-UTF-8 locales. (PR#15201) o The exclude argument to xtabs() was ignored for "factor" arguments. o On Windows, work around an event-timing problem when the RGui console was closed from the 'X' control and the closure cancelled. (This would on some 64-bit systems crash R, typically those with a slow GPU relative to the CPU.) o On unix Rscript will pass the r_arch setting it was compiled with on to the R process so that the architecture of Rscript and that of R will match unless overridden. -- Peter Dalgaard, Professor Center for Statistics, Copenhagen Business School Solbjerg Plads 3, 2000 Frederiksberg, Denmark Phone: (+45)38153501 Email: pd.mes at cbs.dk Priv: PDalgd at gmail.com From pd.mes at cbs.dk Wed Apr 3 12:04:52 2013 From: pd.mes at cbs.dk (Peter Dalgaard) Date: Wed, 3 Apr 2013 12:04:52 +0200 Subject: R 3.0.0 is released Message-ID: The build system rolled up R-3.0.0.tar.gz (codename "Masked Marvel") this morning. The list below details the changes in this release. You can get the source code from http://cran.r-project.org/src/base/R-3/R-3.0.0.tar.gz or wait for it to be mirrored at a CRAN site nearer to you. Binaries for various platforms will appear in due course. For the R Core Team Peter Dalgaard These are the md5sums for the freshly created files, in case you wish to check that they are uncorrupted: MD5 (AUTHORS) = cbf6da8f886ccd8d0dda0cc7ffd1b8ec MD5 (COPYING) = eb723b61539feef013de476e68b5c50a MD5 (COPYING.LIB) = a6f89e2100d9b6cdffcea4f398e37343 MD5 (FAQ) = 43fcae6a4c96e17313d11a0aaefb73f8 MD5 (INSTALL) = 37adac6d0fbadf25b5a40e3f7535415e MD5 (NEWS) = ed5405acecb3ba4a2d9a3467bbcea7e5 MD5 (NEWS.html) = baea8a4f82a3aa9d29d1a73a34238aa1 MD5 (ONEWS) = 0c3e10eef74439786e5fceddd06dac71 MD5 (OONEWS) = b0d650eba25fc5664980528c147a20db MD5 (R-latest.tar.gz) = 5fb80535b0e144a978f67aa2158015de MD5 (README) = e259ae5dd943b8547f0b7719664e815b MD5 (RESOURCES) = c7cb32499ebbf85deb064aab282f93a4 MD5 (THANKS) = d4b45e302b7cad0fc4bb50d2cfe69649 MD5 (R-3/R-3.0.0.tar.gz) = 5fb80535b0e144a978f67aa2158015de This is the relevant part of the NEWS file CHANGES IN R 3.0.0: SIGNIFICANT USER-VISIBLE CHANGES: o Packages need to be (re-)installed under this version (3.0.0) of R. o There is a subtle change in behaviour for numeric index values 2^31 and larger. These never used to be legitimate and so were treated as NA, sometimes with a warning. They are now legal for long vectors so there is no longer a warning, and x[2^31] <- y will now extend the vector on a 64-bit platform and give an error on a 32-bit one. o It is now possible for 64-bit builds to allocate amounts of memory limited only by the OS. It may be wise to use OS facilities (e.g. ulimit in a bash shell, limit in csh), to set limits on overall memory consumption of an R process, particularly in a multi-user environment. A number of packages need a limit of at least 4GB of virtual memory to load. 64-bit Windows builds of R are by default limited in memory usage to the amount of RAM installed: this limit can be changed by command-line option --max-mem-size or setting environment variable R_MAX_MEM_SIZE. o Negative numbers for colours are consistently an error: previously they were sometimes taken as transparent, sometimes mapped into the current palette and sometimes an error. NEW FEATURES: o identical() has a new argument, ignore.environment, used when comparing functions (with default FALSE as before). o There is a new option, options(CBoundsCheck=), which controls how .C() and .Fortran() pass arguments to compiled code. If true (which can be enabled by setting the environment variable R_C_BOUNDS_CHECK to yes), raw, integer, double and complex arguments are always copied, and checked for writing off either end of the array on return from the compiled code (when a second copy is made). This also checks individual elements of character vectors passed to .C(). This is not intended for routine use, but can be very helpful in finding segfaults in package code. o In layout(), the limits on the grid size have been raised (again). o New simple provideDimnames() utility function. o Where methods for length() return a double value which is representable as an integer (as often happens for package Matrix), this is converted to an integer. o Matrix indexing of dataframes by two-column numeric indices is now supported for replacement as well as extraction. o setNames() now has a default for its object argument, useful for a character result. o StructTS() has a revised additive constant in the loglik component of the result: the previous definition is returned as the loglik0 component. However, the help page has always warned of a lack of comparability of log-likelihoods for non-stationary models. (Suggested by Jouni Helske.) o The logic in aggregate.formula() has been revised. It is now possible to use a formula stored in a variable; previously, it had to be given explicitly in the function call. o install.packages() has a new argument quiet to reduce the amount of output shown. o Setting an element of the graphics argument lwd to a negative or infinite value is now an error. Lines corresponding to elements with values NA or NaN are silently omitted. Previously the behaviour was device-dependent. o Setting graphical parameters cex, col, lty, lwd and pch in par() now requires a length-one argument. Previously some silently took the first element of a longer vector, but not always when documented to do so. o Sys.which() when used with inputs which would be unsafe in a shell (e.g. absolute paths containing spaces) now uses appropriate quoting. o as.tclObj() has been extended to handle raw vectors. Previously, it only worked in the other direction. (Contributed by Charlie Friedemann, PR#14939.) o New functions cite() and citeNatbib() have been added, to allow generation of in-text citations from "bibentry" objects. A cite() function may be added to bibstyle() environments. o A sort() method has been added for "bibentry" objects. o The bibstyle() function now defaults to setting the default bibliography style. The getBibstyle() function has been added to report the name of the current default style. o scatter.smooth() now has an argument lpars to pass arguments to lines(). o pairs() has a new log argument, to allow some or all variables to be plotted on logarithmic scale. (In part, wish of PR#14919.) o split() gains a sep argument. o termplot() does a better job when given a model with interactions (and no longer attempts to plot interaction terms). o The parser now incorporates code from Romain Francois' parser package, to support more detailed computation on the code, such as syntax highlighting, comment-based documentation, etc. Functions getParseData() and getParseText() access the data. o There is a new function rep_len() analogous to rep.int() for when speed is required (and names are not). o The undocumented use rep(NULL, length.out = n) for n > 0 (which returns NULL) now gives a warning. o demo() gains an encoding argument for those packages with non-ASCII demos: it defaults to the package encoding where there is one. o strwrap() converts inputs with a marked encoding to the current locale: previously it made some attempt to pass through as bytes inputs invalid in the current locale. o Specifying both rate and scale to [dpqr]gamma is a warning (if they are essentially the same value) or an error. o merge() works in more cases where the data frames include matrices. (Wish of PR#14974.) o optimize() and uniroot() no longer use a shared parameter object across calls. (nlm(), nlminb() and optim() with numerical derivatives still do, as documented.) o The all.equal() method for date-times is now documented: times are regarded as equal (by default) if they differ by up to 1 msec. o duplicated() and unique() gain a nmax argument which can be used to make them much more efficient when it is known that there are only a small number of unique entries. This is done automatically for factors. o Functions rbinom(), rgeom(), rhyper(), rpois(), rnbinom(), rsignrank() and rwilcox() now return integer (not double) vectors. This halves the storage requirements for large simulations. o sort(), sort.int() and sort.list() now use radix sorting for factors of less than 100,000 levels when method is not supplied. So does order() if called with a single factor, unless na.last = NA. o diag() as used to generate a diagonal matrix has been re-written in C for speed and less memory usage. It now forces the result to be numeric in the case diag(x) since it is said to have 'zero off-diagonal entries'. o backsolve() (and forwardsolve()) are now internal functions, for speed and support for large matrices. o More matrix algebra functions (e.g. chol() and solve()) accept logical matrices (and coerce to numeric). o sample.int() has some support for n >= 2^31: see its help for the limitations. A different algorithm is used for (n, size, replace = FALSE, prob = NULL) for n > 1e7 and size <= n/2. This is much faster and uses less memory, but does give different results. o approxfun() and splinefun() now return a wrapper to an internal function in the stats namespace rather than a .C() or .Call() call. This is more likely to work if the function is saved and used in a different session. o The functions .C(), .Call(), .External() and .Fortran() now give an error (rather than a warning) if called with a named first argument. o Sweave() by default now reports the locations in the source file(s) of each chunk. o clearPushBack() is now a documented interface to a long-existing internal call. o aspell() gains filters for R code, Debian Control Format and message catalog files, and support for R level dictionaries. In addition, package utils now provides functions aspell_package_R_files() and aspell_package_C_files() for spell checking R and C level message strings in packages. o bibentry() gains some support for "incomplete" entries with a crossref field. o gray() and gray.colors() finally allow alpha to be specified. o monthplot() gains parameters to control the look of the reference lines. (Suggestion of Ian McLeod.) o Added support for new %~% relation ("is distributed as") in plotmath. o domain = NA is accepted by gettext() and ngettext(), analogously to stop() etc. o termplot() gains a new argument plot = FALSE which returns information to allow the plots to be modified for use as part of other plots, but does not plot them. (Contributed by Terry Therneau, PR#15076.) o quartz.save(), formerly an undocumented part of R.app, is now available to copy a device to a quartz() device. dev.copy2pdf() optionally does this for PDF output: quartz.save() defaults to PNG. o The default method of pairs() now allows text.panel = NULL and the use of .panel = NULL is now documented. o setRefClass() and getRefClass() now return class generator functions, similar to setClass(), but still with the reference fields and methods as before (suggestion of Romain Francois). o New functions bitwNot(), bitwAnd(), bitwOr() and bitwXor(), using the internal interfaces previously used for classes "octmode" and "hexmode". Also bitwShiftL() and bitwShiftR() for shifting bits in elements of integer vectors. o New option "deparse.cutoff" to control the deparsing of language objects such as calls and formulae when printing. (Suggested by a comment of Sarah Goslee.) o colors() gains an argument distinct. o New demo(colors) and demo(hclColors), with utility functions. o list.files() (aka dir()) gains a new optional argument no.. which allows to exclude "." and ".." from listings. o Multiple time series are also of class "matrix"; consequently, head(), e.g., is more useful. o encodeString() preserves UTF-8 marked encodings. Thus if factor levels are marked as UTF-8 an attempt is made to print them in UTF-8 in RGui on Windows. o readLines() and scan() (and hence read.table()) in a UTF-8 locale now discard a UTF-8 byte-order-mark (BOM). Such BOMs are allowed but not recommended by the Unicode Standard: however Microsoft applications can produce them and so they are sometimes found on websites. The encoding name "UTF-8-BOM" for a connection will ensure that a UTF-8 BOM is discarded. o mapply(FUN, a1, ..) now also works when a1 (or a further such argument) needs a length() method (which the documented arguments never do). (Requested by Herv'e Pag`es; with a patch.) o .onDetach() is supported as an alternative to .Last.lib. Unlike .Last.lib, this does not need to be exported from the package's namespace. o The srcfile argument to parse() may now be a character string, to be used in error messages. o The format() method for ftable objects gains a method argument, propagated to write.ftable() and print(), allowing more compact output, notably for LaTeX formatting, thanks to Marius Hofert. o The utils::process.events() function has been added to trigger immediate event handling. o Sys.which() now returns NA (not "") for NA inputs (related to PR#15147). o The print() method for class "htest" gives fewer trailing spaces (wish of PR#15124). Also print output from HoltWinters(), nls() and others. o loadNamespace() allows a version specification to be given, and this is used to check version specifications given in the Imports field when a namespace is loaded. o setClass() has a new argument, slots, clearer and less ambiguous than representation. It is recommended for future code, but should be back-compatible. At the same time, the allowed slot specification is slightly more general. See the documentation for details. o mget() now has a default for envir (the frame from which it is called), for consistency with get() and assign(). o close() now returns an integer status where available, invisibly. (Wish of PR#15088.) o The internal method of tar() can now store paths too long for the ustar format, using the (widely supported) GNU extension. It can also store long link names, but these are much less widely supported. There is support for larger files, up to the ustar limit of 8GB. o Local reference classes have been added to package methods. These are a technique for avoiding unneeded copying of large components of objects while retaining standard R functional behavior. See ?LocalReferenceClasses. o untar() has a new argument restore_times which if false (not the default) discards the times in the tarball. This is useful if they are incorrect (some tarballs submitted to CRAN have times in a local timezone or many years in the past even though the standard required them to be in UTC). o replayplot() cannot (and will not attempt to) replay plots recorded under R < 3.0.0. It may crash the R session if an attempt is made to replay plots created in a different build of R >= 3.0.0. o Palette changes get recorded on the display list, so replaying plots (including when resizing screen devices and using dev.copy()) will work better when the palette is changed during a plot. o chol(pivot = TRUE) now defaults to LAPACK, not LINPACK. o The parse() function has a new parameter keep.source, which defaults to options("keep.source"). o Profiling via Rprof() now optionally records information at the statement level, not just the function level. o The Rprof() function now quotes function names in in its output file on Windows, to be consistent with the quoting in Unix. o Profiling via Rprof() now optionally records information about time spent in GC. o The HTML help page for a package now displays non-vignette documentation files in a more accessible format. o To support options(stringsAsFactors = FALSE), model.frame(), model.matrix() and replications() now automatically convert character vectors to factors without a warning. o The print method for objects of class "table" now detects tables with 0-extents and prints the results as, e.g., < table of extent 0 x 1 x 2 >. (Wish of PR#15198.) o Deparsing involving calls to anonymous functions and has been made closer to reversible by the addition of extra parentheses. o The function utils::packageName() has been added as a lightweight version of methods::getPackageName(). o find.package(lib.loc = NULL) now treats loaded namespaces preferentially in the same way as attached packages have been for a long time. o In Windows, the Change Directory dialog now defaults to the current working directory, rather than to the last directory chosen in that dialog. o available.packages() gains a "license/restricts_use" filter which retains only packages for which installation can proceed solely based on packages which are guaranteed not to restrict use. o New check_packages_in_dir() function in package tools for conveniently checking source packages along with their reverse dependencies. o R's completion mechanism has been improved to handle help requests (starting with a question mark). In particular, help prefixes are now supported, as well as quoted help topics. To support this, completion inside quotes are now handled by R by default on all platforms. o The memory manager now allows the strategy used to balance garbage collection and memory growth to be controlled by setting the environment variable R_GC_MEM_GROW. See ?Memory for more details. o ('For experts only', as the introductory manual says.) The use of environment variables R_NSIZE and R_VSIZE to control the initial (= minimum) garbage collection trigger for number of cons cels and size of heap has been restored: they can be overridden by the command-line options --min-nsize and --min-vsize; see ?Memory. o On Windows, the device name for bitmap devices as reported by .Device and .Devices no longer includes the file name. This is for consistency with other platforms and was requested by the lattice maintainer. win.metafile() still uses the file name: the exact form is used by package tkrplot. o set.seed(NULL) re-initializes .Random.seed as done at the beginning of the session if not already set. (Suggestion of Bill Dunlap.) o The breaks argument in hist.default() can now be a function that returns the breakpoints to be used (previously it could only return the suggested number of breakpoints). o File share/licenses/licenses.db has some clarifications, especially as to which variants of 'BSD' and 'MIT' is intended and how to apply them to packages. The problematic licence 'Artistic-1.0' has been removed. LONG VECTORS: This section applies only to 64-bit platforms. o There is support for vectors longer than 2^31 - 1 elements. This applies to raw, logical, integer, double, complex and character vectors, as well as lists. (Elements of character vectors remain limited to 2^31 - 1 bytes.) o Most operations which can sensibly be done with long vectors work: others may return the error 'long vectors not supported yet'. Most of these are because they explicitly work with integer indices (e.g. anyDuplicated() and match()) or because other limits (e.g. of character strings or matrix dimensions) would be exceeded or the operations would be extremely slow. o length() returns a double for long vectors, and lengths can be set to 2^31 or more by the replacement function with a double value. o Most aspects of indexing are available. Generally double-valued indices can be used to access elements beyond 2^31 - 1. o There is some support for matrices and arrays with each dimension less than 2^31 but total number of elements more than that. Only some aspects of matrix algebra work for such matrices, often taking a very long time. In other cases the underlying Fortran code has an unstated restriction (as was found for complex svd()). o dist() can produce dissimilarity objects for more than 65536 rows (but for example hclust() cannot process such objects). o serialize() to a raw vector is unlimited in size (except by resources). o The C-level function R_alloc can now allocate 2^35 or more bytes. o agrep() and grep() will return double vectors of indices for long vector inputs. o Many calls to .C() have been replaced by .Call() to allow long vectors to be supported (now or in the future). Regrettably several packages had copied the non-API .C() calls and so failed. o .C() and .Fortran() do not accept long vector inputs. This is a precaution as it is very unlikely that existing code will have been written to handle long vectors (and the R wrappers often assume that length(x) is an integer). o Most of the methods for sort() work for long vectors. rank(), sort.list() and order() support long vectors (slowly except for radix sorting). o sample() can do uniform sampling from a long vector. PERFORMANCE IMPROVEMENTS: o More use has been made of R objects representing registered entry points, which is more efficient as the address is provided by the loader once only when the package is loaded. This has been done for packages base, methods, splines and tcltk: it was already in place for the other standard packages. Since these entry points are always accessed by the R entry points they do not need to be in the load table which can be substantially smaller and hence searched faster. This does mean that .C / .Fortran / .Call calls copied from earlier versions of R may no longer work - but they were never part of the API. o Many .Call() calls in package base have been migrated to .Internal() calls. o solve() makes fewer copies, especially when b is a vector rather than a matrix. o eigen() makes fewer copies if the input has dimnames. o Most of the linear algebra functions make fewer copies when the input(s) are not double (e.g. integer or logical). o A foreign function call (.C() etc) in a package without a PACKAGE argument will only look in the first DLL specified in the NAMESPACE file of the package rather than searching all loaded DLLs. A few packages needed PACKAGE arguments added. o The @<- operator is now implemented as a primitive, which should reduce some copying of objects when used. Note that the operator object must now be in package base: do not try to import it explicitly from package methods. PACKAGE INSTALLATION: o The transitional support for installing packages without namespaces (required since R 2.14.0) has been removed. R CMD build will still add a namespace, but a .First.lib() function will need to be converted. R CMD INSTALL no longer adds a namespace (so installation will fail), and a .First.lib() function in a package will be ignored (with an installation warning for now). As an exception, packages without a R directory and no NAMESPACE file can still be installed. o Packages can specify in their DESCRIPTION file a line like Biarch: yes to be installed on Windows with --force-biarch. o Package vignettes can now be processed by other engines besides Sweave; see 'Writing R Extensions' and the tools::vignetteEngine help topic for details. o The *.R tangled source code for vignettes is now included in tarballs when R CMD build is used to produce them. In R 3.0.0, *.R files not in the sources will be produced at install time, but eventually this will be dropped. o The package type "mac.binary" now looks in a path in the repository without any Mac subtype (which used to be universal or leopard): it looks in bin/macosx/contrib/3.0 rather than bin/macosx/leopard/contrib/2.15). This is the type used for the CRAN binary distribution for OS X as from R 3.0.0. o File etc/Makeconf makes more use of the macros $(CC), $(CXX), $(F77) and $(FC), so the compiler in use can be changed by setting just these (and if necessary the corresponding flags and FLIBS) in file ~/.R/Makevars. This is convenient for those working with binary distributions of R, e.g. on OS X. UTILITIES: o R CMD check now gives a warning rather than a note if it finds calls to abort, assert or exit in compiled code, and has been able to find the .o file in which the calls occur. Such calls can terminate the R process which loads the package. o The location of the build and check environment files can now be specified by the environment variables R_BUILD_ENVIRON and R_CHECK_ENVIRON, respectively. o R CMD Sweave gains a --compact option to control possibly reducing the size of the PDF file it creates when --pdf is given. o R CMD build now omits Eclipse's .metadata directories, and R CMD check warns if it finds them. o R CMD check now does some checks on functions defined within reference classes, including of .Call() etc calls. o R CMD check --as-cran notes assignments to the global environment, calls to data() which load into the global environment, and calls to attach(). o R CMD build by default uses the internal method of tar() to prepare the tarball. This is more likely to produce a tarball compatible with R CMD INSTALL and R CMD check: an external tar program, including options, can be specified _via_ the environment variable R_BUILD_TAR. o tools::massageExamples() is better protected against packages which re-define base functions such as cat() and get() and so can cause R CMD check to fail when checking examples. o R CMD javareconf has been enhanced to be more similar to the code used by configure. There is now a test that a JNI program can be compiled (like configure did) and only working settings are used. It makes use of custom settings from configuration recorded in etc/javaconf. o The --no-vignettes argument of R CMD build has been renamed to the more accurate --no-build-vignettes: its action has always been to (re)build vignettes and never omitted them. R CMD check accepts --no-build-vignettes as a preferred synonym for --no-rebuild-vignettes. DEPRECATED AND DEFUNCT: o The ENCODING argument to .C() is defunct. Use iconv() instead. o The .Internal(eval.with.vis) non-API function has been removed. o Support for the converters for use with .C() has been removed, including the oft misused non-API header R_ext/RConverters.h. o The previously deprecated uses of array() with a 0-length dim argument and tapply() with a 0-length INDEX list are now errors. o Translation packages are defunct. o Calling rep() or rep.int() on a pairlist or other non-vector object is now an error. o Several non-API entry points have been transferred to packages (e.g. R_zeroin2) or replaced by different non-API entry points (e.g. R_tabulate). o The 'internal' graphics device invoked by .Call("R_GD_nullDevice", package = "grDevices") has been removed: use pdf(file = NULL) instead. o The .Fortran() entry point "dqrls" which has not been used by R since version 2.15.1 is no longer available. o Functions traceOn() and traceOff() in package methods are now defunct. o Function CRAN.packages() is finally defunct. o Use of col2rgb(0) is defunct: use par("bg") or NA instead. o The long-defunct functions Rd_parse(), anovalist.lm(), categpry(), clearNames(), gammaCody(), glm.fit.null(), lm.fit.null(), lm.wfit.null(), manglePackageNames(), mauchley.test(), package.contents(), print.coefmat(), reshapeLong(), reshapeWide(), tkclose(), tkcmd(), tkfile.dir(), tkfile.tail(), tkopen(), tkputs(), tkread(), trySilent() and zip.file.extract() have been removed entirely (but are still documented in the help system). o The unused dataPath argument to attachNamespace() has been removed. o grid.prompt() has been removed: use devAskNewPage() instead. o The long-deprecated intensities component is no longer returned by hist(). o mean() for data frames and sd() for data frames and matrices are defunct. o chol(pivot = FALSE, LINPACK = TRUE), ch2inv(LINPACK = TRUE), eigen(EISPACK = TRUE), solve(LINPACK = TRUE) and svd(LINPACK = TRUE) are defunct: LAPACK will be used, with a warning. o The keep.source argument to library() and require() is defunct. This option needs to be set at install time. o Documentation for real(), as.real() and is.real() has been moved to 'defunct' and the functions removed. o The maxRasters argument of pdf() (unused since R 2.14.0) has been removed. o The unused fontsmooth argument has been removed from the quartz() device. o All the (non-API) EISPACK entry points in R have been removed. o chol(pivot = TRUE, LINPACK = TRUE) is deprecated. o The long-deprecated use of \synopsis in the Usage section of .Rd files will be removed in R 3.1.0. o .find.package() and .path.package() are deprecated: only the public versions without the dot have ever been in the API. o In a package's DESCRIPTION file, License: X11 is deprecated, since it includes 'Copyright (C) 1996 X Consortium' which cannot be appropriate for a current R package. Use 'MIT' or 'BSD_2_clause' instead. CODE MIGRATION: o The C code underlying base graphics has been migrated to the graphics package (and hence no longer uses .Internal() calls). o Most of the .Internal() calls used in the stats package have been migrated to C code in that package. This means that a number of .Internal() calls which have been used by packages no longer exist, including .Internal(cor) .Internal(cov), .Internal(optimhess) and .Internal(update.formula). o Some .External() calls to the base package (really to the R executable or shared library) have been moved to more appropriate packages. Packages should not have been using such calls, but some did (mainly those used by integrate()). PACKAGE parallel: o There is a new function mcaffinity() which allows getting or setting the CPU affinity mask for the current R process on systems that supports this (currently only Linux has been tested successfully). It has no effect on systems which do not support process affinity. Users are not expected to use this function directly (with the exception of fixing libraries that break affinity settings like OpenBLAS) - the function is rather intended to support affinity control in high-level parallel functions. In the future, R may supplement lack of affinity control in the OS by its own bookkeeping via mcaffinity() related to processes and threads it spawns. o mcparallel() has a new argument mc.affinity which attempts to set the affinity of the child process according to the specification contained therein. o The port used by socket clusters is chosen randomly: this should help to avoid clashes observed when two users of a multi-user machine try to create a cluster at the same time. To reproduce the previous behaviour set environment variable R_PARALLEL_PORT to 10187. C-LEVEL FACILITIES: o There has been some minor re-organization of the non-API header files. In particular, Rinternals.h no longer includes the non-API header R_exts/PrtUtil.h, and that no longer includes R_exts/Print.h. o Passing NULL to .C() is now an error. o .C() and .Fortran() now warn if "single" arguments are used with DUP = FALSE, as changes to such arguments are not returned to the caller. o C entry points R_qsort and R_qsort_I now have start and end as size_t to allow them to work with longer vectors on 64-bit platforms. Code using them should be recompiled. o A few recently added C entry points were missing the remapping to Rf_, notably [dpq]nbinom_mu. o Some of the interface pointers formerly available only to R.app are now available to front-ends on all Unix-alikes: one has been added for the interface to View(). o PACKAGE = "" is now an error in .C() etc calls: it was always contrary to the documentation. o Entry point rcont2 has been migrated to package stats and so is no longer available. o R_SVN_REVISION in Rversion.h is now an integer (rather than a string) and hence usable as e.g. #if R_SVN_REVISION < 70000. o The entry points rgb2hsv and hsv2rgb have been migrated to package grDevices and so are no longer available. o R_GE_version has been increased to 10 and name2col removed (use R_GE_str2col instead). R internal colour codes are now defined using the typedef rcolor. o The REPROTECT macro now checks that the protect index is valid. o Several non-API entry points no longer used by R have been removed, including the Fortran entry points chol, chol2inv, cg, ch and rg, and the C entry points Brent_fmin, fft_factor and fft_work. o If a .External call is registered with a number of arguments (other than -1), the number of arguments passed is checked for each call (as for other foreign function calls). o It is now possible to write custom connection implementations outside core R using R_ext/Connections.h. Please note that the implementation of connections is still considered internal and may change in the future (see the above file for details). INTERNATIONALIZATION: o The management of translations has been converted to R code: see ?tools::update_pkg_po. o The translations for the R interpreter and RGui.exe are now part of the base package (rather than having sources in directory po and being installed to share/locale). Thus the base package supports three translation domains, R-base, R and RGui. o The compiled translations which ship with R are all installed to the new package translations for easier updating. The first package of that name found on .libPaths() at the start of the R session will be used. (It is possible messages will be used before .libPaths() is set up in which case the default translations will be used: set environment variable R_TRANSLATIONS to point to the location of the intended translations package to use this right from the start.) o The translations form a separate group in the Windows installer, so can be omitted if desired. o The markup for many messages has been changed to make them easier to translate, incorporating suggestions from Lukasz Daniel. INSTALLATION: o There is again support for building without using the C 'long double' type. This is required by C99, but system implementations can be slow or flawed. Use configure option --disable-long-double. o make pdf and make install-pdf now make and install the full reference index (including all base and recommended packages). o The 'reference manual' on the Windows GUI menu and included in the installer is now the full reference index, including all base and recommended packages. o R help pages and manuals have no ISBNs because ISBN rules no longer allow constantly changing content to be assigned an ISBN. o The Windows installer no longer installs a Start Menu link to the static help pages; as most pages are generated dynamically, this led to a lot of broken links. o Any custom settings for Java configuration are recorded in file etc/javaconf for subsequent use by R CMD javareconf. o There is now support for makeinfo version 5.0 (which requires a slightly different .texi syntax). o The minimum versions for --use-system-zlib and --use-system-pcre are now tested as 1.2.5 and 8.10 respectively. o On Windows, the stack size is reduced to 16MB on 32-bit systems: misguided users were launching many threads without controlling the stack size. o configure no longer looks for file ~/.Rconfig: ~/.R/config has long been preferred. BUG FIXES: o When R CMD build is run in an encoding other than the one specified in the package's DESCRIPTION file it tries harder to expand the authors at R field in the specified encoding. (PR#14958) o If R CMD INSTALL is required to expand the authors at R field of the DESCRIPTION file, it tries harder to do so in the encoding specified for the package (rather than using ASCII escapes). o Fix in package grid for pushing a viewport into a layout cell, where the layout is within a viewport that has zero physical width OR where the layout has zero total relative width (likewise for height). The layout column widths (or row heights) in this case were being calculated with non-finite values. (Reported by Winston Chang.) o solve(A, b) for a vector b gave the answer names from colnames(A) for LINPACK = TRUE but not in the default case. o La.svd() accepts logical matrices (as documented, and as svd() did). o legend() now accepts negative pch values, in the same way points() long has. o Parse errors when installing files now correctly display the name of the file containing the bad code. o In Windows, tcltk windows were not always properly constructed. (PR#15150) o The internal functions implementing parse(), tools::parseLatex() and tools::parse_Rd() were not reentrant, leading to errors in rare circumstances such as a garbage collection triggering a recursive call. o Field assignments in reference class objects via $<- were not being checked because the magic incantation to turn methods on for that primitive operator had been inadvertently omitted. o setHook(hookname, value, action="replace") set the hook to be the value, rather than a list containing the value as documented. (PR#15167) o If a package used a NEWS.Rd file, the main HTML package index page did not link to it. (Reported by Dirk Eddelbuettel.) o The primitive implementation of @<- was not checking the class of the replacement. It now does a check, quicker but less general than slot<-. See the help. o split(x, f) now recycles classed objects x in the same way as vectors. (Reported by Martin Morgan.) o pbeta(.28, 1/2, 2200, lower.tail=FALSE, log.p=TRUE) is no longer -Inf; ditto for corresponding pt() and pf() calls, such as pt(45, df=5000, lower.tail=FALSE, log.p=TRUE). (PR#15162) o The Windows graphics device would crash R if a user attempted to load the graphics history from a variable that was not a saved history. (PR#15230) o The workspace size for the predict() method for loess() could exceed the maximum integer size. (Reported by Hiroyuki Kawakatsu.) o ftable(x, row.vars, col.vars) now also works when the *.vars arguments are (integer or character vectors) of length zero. o Calling cat() on a malformed UTF-8 string could cause the Windows GUI to lock up. (PR#15227) o removeClass(cc) gave "node stack overflow" for some class definitions containing "array" or "matrix". -- Peter Dalgaard, Professor Center for Statistics, Copenhagen Business School Solbjerg Plads 3, 2000 Frederiksberg, Denmark Phone: (+45)38153501 Email: pd.mes at cbs.dk Priv: PDalgd at gmail.com From pd.mes at cbs.dk Mon May 6 13:58:17 2013 From: pd.mes at cbs.dk (Peter Dalgaard) Date: Mon, 6 May 2013 13:58:17 +0200 Subject: [R] Release plans: R-3.0.1 on May 16 Message-ID: <144C148F-C18F-40DD-9DF5-D292CEE88EC7@cbs.dk> We intend to have a patch release version on May 16. The nickname will be "Good Sport". Apologies for the somewhat belated announcement. -- Peter Dalgaard, Professor Center for Statistics, Copenhagen Business School Solbjerg Plads 3, 2000 Frederiksberg, Denmark Phone: (+45)38153501 Email: pd.mes at cbs.dk Priv: PDalgd at gmail.com From pd.mes at cbs.dk Thu May 16 09:51:42 2013 From: pd.mes at cbs.dk (Peter Dalgaard) Date: Thu, 16 May 2013 09:51:42 +0200 Subject: R 3.0.1 is released Message-ID: The build system rolled up R-3.0.1.tar.gz (codename "Good Sport") this morning. The list below details the changes in this release. You can get the source code from http://cran.r-project.org/src/base/R-3/R-3.0.1.tar.gz or wait for it to be mirrored at a CRAN site nearer to you. Binaries for various platforms will appear in due course. For the R Core Team Peter Dalgaard These are the md5sums for the freshly created files, in case you wish to check that they are uncorrupted: MD5 (AUTHORS) = cbf6da8f886ccd8d0dda0cc7ffd1b8ec MD5 (COPYING) = eb723b61539feef013de476e68b5c50a MD5 (COPYING.LIB) = a6f89e2100d9b6cdffcea4f398e37343 MD5 (FAQ) = c7720d17cb5b89d93797b1a768554331 MD5 (INSTALL) = 37adac6d0fbadf25b5a40e3f7535415e MD5 (NEWS) = 15df4ea633f255947222a924a2f431ad MD5 (NEWS.html) = a2f82dc771e596813aa3524871a48c00 MD5 (ONEWS) = 0c3e10eef74439786e5fceddd06dac71 MD5 (OONEWS) = b0d650eba25fc5664980528c147a20db MD5 (R-latest.tar.gz) = 36d51544b007fff26c7fbf36b02ea5ad MD5 (README) = e259ae5dd943b8547f0b7719664e815b MD5 (RESOURCES) = c7cb32499ebbf85deb064aab282f93a4 MD5 (THANKS) = d4b45e302b7cad0fc4bb50d2cfe69649 MD5 (R-3/R-3.0.1.tar.gz) = 36d51544b007fff26c7fbf36b02ea5ad This is the relevant part of the NEWS file CHANGES IN R 3.0.1: NEW FEATURES: o chooseCRANmirror() and chooseBioCmirror() gain an ind argument (like setRepositories()). o mcparallel has a new argument mc.interactive which can modify the interactive flag in the child process. The new default is FALSE which makes child processes non-interactive by default (this prevents lock-ups due to children waiting for interactive input). o scan() now warns when end-of-file occurs within a quoted string. o count.fields() is now consistent with scan() in its handling of newlines in quoted strings. Instead of triggering an error, this results in the current line receiving NA as the field count, with the next line getting the total count of the two lines. o The default method of image() will plot axes of the class of xlim and ylim (and hence of x and y if there is a suitable range() method). Based on a suggestion of Michael Sumner. o load() now has a verbose argument for debugging support, to print the names of objects just before loading them. o When loading a serialized object encounters a reference to a namespace which cannot be loaded, this is replaced by a reference to the global environment, with a warning. o pairs() gains a line.main option for title placement. o The remaining instances in which serialization to a raw vector was limited to 2GB have been unlimited on a 64-bit platform, and in most cases serialization to a vector of more than 1GB will be substantially faster. UTILITIES: o R CMD config now make use of personal Makevars files under ~/.R and a site file Makevars.site, in the same way as R CMD SHLIB and R CMD INSTALL. This makes the utility more useful in package configure scripts. On Windows finding the personal files may require the environment variable HOME set. The old behaviour can be obtained with the new options --no-user-files and --no-site-files. PACKAGE INSTALLATION: o Alternatives to the site and user customization files Makevars.site and ~/.R/Makevars can be specified _via_ the environment variables R_MAKEVARS_SITE and R_MAKEVARS_USER respectively. These can be used to suppress the use of the default files by setting an empty value (where possible) or a non-existent path. BUG FIXES: o sys.source() did not report error locations when keep.source = TRUE. o as.POSIXct.numeric was coercing origin using the tz argument and not "GMT" as documented (PR#14973). o The active binding to assign fields in reference classes has been cleaned up to reduce dependence on the class' package environment, also fixing bug in initializing read-only fields (inspired by a report from Hadley Wickham). o str(d) no longer gives an error when names(d) contain illegal multibyte strings (PR#15247). o Profiling of built-in functions with line.profiling= TRUE did not record the line from which they were called. o citation(pkg) dropped the header and footer specified in the CITATION file (PR#15257). o Quotes were handled differently when reading the first line and reading the rest, so read.table() misread some files that contained quote characters (PR#15245). o cat() with sep a character vector of length greater than one and more than one argument was using separators inconsistently (PR#15261). o On Windows in R 3.0.0, savePlot() failed because of an incorrect check on the argument count. o unzip(list = TRUE) returned Names as a factor and not a character vector (as documented) for the internal method. (Noticed by Sean O'Riordain.) o contourLines() now checks more comprehensively for conformance of its x, y and z arguments (it was used incorrectly in package R2G2). o Saved graphics display lists are R version-specific. Attempting to load workspaces containing them (or some other version-specific objects) aborted the load in R 3.0.0 and earlier; now it does a partial load and generates a warning instead. o In R 3.0.0, identify() and locator() did not record information correctly, so replaying a graph (e.g. by copying it to another device) would fail. (PR#15271) o Calling file.copy() or dirname() with the invalid input "" (which was being used in packages, despite not being a file path) could have caused a segfault. dirname("") is now "" rather than "." (unless it segfaulted). o supsmu() could read/write outside its input vectors for very short inputs (seen in package rms for n = 4). o as.dendrogram()'s hclust method uses less memory and hence gets considerably faster for large (n ~ 1000) clusterings, thanks to Daniel M"ullner. (PR#15174) o The return value when all workers failed from parallel::mclapply(mc.presechedule = TRUE) was a list of strings and not of error objects. (Spotted by Karl Forner and Bernd Bischl.) o In R 3.0.0, when help() found multiple pages with the same alias, the HTML display of all the selections was not produced. (PR#15282) o splinefun(method="monoH.FC") now produces a function with first argument named x and allows deriv=3, as documented. (PR#15273) o summaryRprof() would only read the first chunksize lines of an Rprof file produced with line.profiling=TRUE. By default, this is the first 100 seconds. (PR#15288) o lsfit() produced an incorrect error message when argument x had more columns than rows or x had a different number of rows than y. (Spotted by Renaud Gaujoux.) o Binary operations on equal length vectors copied the class name from the second operand when the first had no class name, but did not set the object bit. (PR#15299) o The trace() method for reference generator objects failed after those objects became function definitions. o write.table() did not check that factors were constructed correctly, and so caused a segment fault when writing bad ones. (PR#15300) o The internal HTTP server no longer chokes on POST requests without body. It will also pass-through other request types for custom handlers (with the method stored in Request-Method header) instead of failing. -- Peter Dalgaard, Professor Center for Statistics, Copenhagen Business School Solbjerg Plads 3, 2000 Frederiksberg, Denmark Phone: (+45)38153501 Email: pd.mes at cbs.dk Priv: PDalgd at gmail.com From Hadley.Wickham at r-project.org Tue Jul 2 20:32:01 2013 From: Hadley.Wickham at r-project.org (Hadley Wickham) Date: Tue, 2 Jul 2013 13:32:01 -0500 Subject: The R Journal, Volume 5, Issue 1 Message-ID: Dear all, The latest issue of The R Journal is now available at http://journal.r-project.org/archive/2013-1/ Many thanks to all contributors. Hadley -- Editor-in-chief, The R Journal From pdalgd at gmail.com Thu Aug 22 08:29:53 2013 From: pdalgd at gmail.com (peter dalgaard) Date: Thu, 22 Aug 2013 08:29:53 +0200 Subject: R-3.0.2 on September 25 Message-ID: <9354C77B-01F6-45CA-A585-EEA97B34C72A@gmail.com> Just a quick note, mainly to warn off maintainers of the recommended packages, that we intend to release R-3.0.2 on Wednesday, September 25. We'll be following the usual schedule from http://developer.r-project.org/release-checklist.html Notice in particular that new versions of recommended packages should be finalized 14 days before release. For the R Core Team Peter Dalgaard -- Peter Dalgaard, Professor, Center for Statistics, Copenhagen Business School Solbjerg Plads 3, 2000 Frederiksberg, Denmark Phone: (+45)38153501 Email: pd.mes at cbs.dk Priv: PDalgd at gmail.com From jfox at MCMASTER.CA Fri Aug 23 00:43:55 2013 From: jfox at MCMASTER.CA (John Fox) Date: Thu, 22 Aug 2013 18:43:55 -0400 Subject: Rcmdr version 2.0-0 now on CRAN Message-ID: <002501ce9f89$15a4a250$40ede6f0$@mcmaster.ca> Dear R-help list members, Version 2.0-0 of the Rcmdr package is now on CRAN and should appear presently on the various CRAN mirrors. As its number implies, this version represents a milestone in the development of the package, which first appeared on CRAN more than 10 years ago. The transition to version 2.0-0 reflects both substantial upgrades to the Rcmdr interface in the new release (see the release notes below) as well as accumulated changes in recent versions. Of particular note beyond the interface improvements is the integration of HTML and PDF report generation via the knitr and markdown packages. >From the package NEWS file: Changes to Version 2.0-0 o New package co-author: Milan Bouchet-Valet. o Many changes to style of dialogs: tabs, Reset button, Apply button, small interface improvements. o Support for R Markdown, with script and Rmd tabs. o Expanded Options dialog and new Save Options dialog. o Better handling of default fonts. o Improved probability plots. o Introduced plotDistr(), lineplot(), indexplot() convenience functions. o New nonparametric density estimate dialog. o Use automatic point identification as default in plot dialogs. o Calls to deprecated functions .find.package() and .path.package() replaced by find.package() and path.package() (suggestion of Brian Ripley). o Partial correlations now optionally report pairwise p-values (suggestion of Aaron Swink). o Removed Sciviews support code. o Small fixes. o Updated translations (with thanks to the translators): Italian (Stefano Calza), Korean (Jong-Hwa Shin), Romanian (Adrian Dusa), Russian (Alexey Shipunov), Spanish (Manuel Munoz-Marquez). o Show menu item for English introductory manual even if a "translation" is available (suggestion of Manuel Munoz-Marquez). As usual, please report bugs or problems to jfox at mcmaster.ca. Comments and suggestions are also appreciated. Best, John and Milan ----------------------------------------------- John Fox McMaster University Hamilton, Ontario, Canada From pd.mes at cbs.dk Wed Sep 25 13:16:24 2013 From: pd.mes at cbs.dk (Peter Dalgaard) Date: Wed, 25 Sep 2013 13:16:24 +0200 Subject: R 3.0.2 is released Message-ID: The build system rolled up R-3.0.2.tar.gz (codename "Frisbee Sailing") this morning. The list below details the changes in this release. You can get the source code from http://cran.r-project.org/src/base/R-3/R-3.0.2.tar.gz or wait for it to be mirrored at a CRAN site nearer to you. Binaries for various platforms will appear in due course. For the R Core Team Peter Dalgaard These are the md5sums for the freshly created files, in case you wish to check that they are uncorrupted: MD5 (AUTHORS) = cbf6da8f886ccd8d0dda0cc7ffd1b8ec MD5 (COPYING) = eb723b61539feef013de476e68b5c50a MD5 (COPYING.LIB) = a6f89e2100d9b6cdffcea4f398e37343 MD5 (FAQ) = 77da68a9d0abfa9121d54f6ff0bced33 MD5 (INSTALL) = 3964b9119adeaab9ceb633773fc94aac MD5 (NEWS) = e01b5a01aade71ccef967d39f3738e0a MD5 (NEWS.html) = 1925b57c75bd51373adb33a04e2c18f8 MD5 (R-latest.tar.gz) = f9a8374736e7650e4848f33e2e3bbee3 MD5 (README) = e259ae5dd943b8547f0b7719664e815b MD5 (RESOURCES) = c7cb32499ebbf85deb064aab282f93a4 MD5 (THANKS) = d4b45e302b7cad0fc4bb50d2cfe69649 MD5 (R-3/R-3.0.2.tar.gz) = f9a8374736e7650e4848f33e2e3bbee3 This is the relevant part of the NEWS file CHANGES IN R 3.0.2: NEW FEATURES: * The NEWS files have been re-organized. This file contains news for R >= 3.0.0: news for the 0.x.y, 1.x.y and 2.x.y releases is in files NEWS.0, NEWS.1 and NEWS.2. The latter files are now installed when R is installed. An HTML version of news from 2.10.0 to 2.15.3 is available as doc/html/NEWS.2.html. * sum() for integer arguments now uses an integer accumulator of at least 64 bits and so will be more accurate in the very rare case that a cumulative sum exceeds 2^53 (necessarily summing more than 4 million elements). * The example() and tools::Rd2ex() functions now have parameters to allow them to ignore \dontrun markup in examples. (Suggested by Peter Solymos.) * str(x) is considerably faster for very large lists, or factors with 100,000 levels, the latter as in PR#15337. * col2rgb() now converts factors to character strings not integer codes (suggested by Bryan Hanson). * tail(warnings()) now works, via the new `[` method. * There is now support for the LaTeX style file zi4.sty which has in some distributions replaced inconsolata.sty. * unlist(x) now typically returns all non-list xs unchanged, not just the "vector" ones. Consequently, format(lst) now also works when the list lst has non-vector elements. * The tools::getVignetteInfo() function has been added to give information about installed vignettes. * New assertCondition(), etc. utilities in tools, useful for testing. * Profiling now records non-inlined calls from byte-compiled code to BUILTIN functions. * Various functions in stats and elsewhere that use non-standard evaluation are now more careful to follow the namespace scoping rules. E.g. stats::lm() can now find stats::model.frame() even if stats is not on the search path or if some package defines a function of that name. * If an invalid/corrupt .Random.seed object is encountered in the workspace it is ignored with a warning rather than giving an error. (This allows R itself to rely on a working RNG, e.g. to choose a random port.) * seq() and seq.int() give more explicit error messages if called with invalid (e.g. NaN) inputs. * When parse() finds a syntax error, it now makes partial parse information available up to the location of the error. (Request of Reijo Sund.) * Methods invoked by NextMethod() had a different dynamic parent to the generic. This was causing trouble where S3 methods invoked via lazy evaluation could lose track of their generic. (PR#15267) * Code for the negative binomial distribution now treats the case size == 0 as a one-point distribution at zero. * abbreviate() handles without warning non-ASCII input strings which require no abbreviation. * read.dcf() no longer has a limit of 8191 bytes per line. (Wish of PR#15250.) * formatC(x) no longer copies the class of x to the result, to avoid misuse creating invalid objects as in PR#15303. A warning is given if a class is discarded. * Dataset npk has been copied from MASS to allow more tests to be run without recommended packages being installed. * The initialization of the regression coefficients for non-degenerate differenced models in arima() has been changed and in some examples avoids a local maximum. (PR#15396) * termplot() now has an argument transform.x to control the display of individual terms in the plot. (PR#15329) * format() now supports digits = 0, to display nsmall decimal places. * There is a new read-only par() parameter called "page", which returns a logical value indicating whether the next plot.new() call will start a new page. * Processing Sweave and Rd documents to PDF now renders backticks and single quotes better in several instances, including in \code and \samp expressions. * utils::modifyList() gets a new argument keep.null allowing NULL components in the replacement to be retained, instead of causing corresponding components to be deleted. * tools::pkgVignettes() gains argument check; if set to TRUE, it will warn when it appears a vignette requests a non-existent vignette engine. UTILITIES: * R CMD check --as-cran checks the line widths in usage and examples sections of the package Rd files. * R CMD check --as-cran now implies --timings. * R CMD check looks for command gfile if a suitable file is not found. (Although file is not from GNU, OpenCSW on Solaris installs it as gfile.) * R CMD build (with the internal tar) checks the permissions of configure and cleanup files and adds execute permission to the recorded permissions for these files if needed, with a warning. This is useful on OSes and file systems which do not support execute permissions (notably, on Windows). * R CMD build now weaves and tangles all vignettes, so suggested packages are not required during package installation if the source tarball was prepared with current R CMD build. * checkFF() (used by R CMD check) does a better job of detecting calls from other packages, including not reporting those where a function has been copied from another namespace (e.g. as a default method). It now reports calls where .NAME is a symbol registered in another package. * On Unix-alike systems, R CMD INSTALL now installs packages group writably whenever the library (lib.loc) is group writable. Hence, update.packages() works for other group members (suggested originally and from a patch by Dirk Eddelbuettel). * R CMD javareconf now supports the use of symbolic links for JAVA_HOME on platforms which have realpath. So it is now possible to use R CMD javareconf JAVA_HOME=/usr/lib/jvm/java-1.7.0 on a Linux system and record that value rather than the frequently-changing full path such as /usr/lib/jvm/java-1.7.0-openjdk-1.7.0.25.x86_64. * (Windows only.) Rscript -e requires a non-empty argument for consistency with Unix versions of R. (Also Rterm -e and R -e.) * R CMD check does more thorough checking of declared packages and namespaces. It reports * packages declared in more than one of the Depends, Imports, Suggests and Enhances fields of the DESCRIPTION file. * namespaces declared in Imports but not imported from, neither in the NAMESPACE file nor using the :: nor ::: operators. * packages which are used in library() or requires() calls in the R code but were already put on the search path _via_ Depends. * packages declared in Depends not imported _via_ the NAMESPACE file (except the standard packages). Objects used from Depends packages should be imported to avoid conflicts and to allow correct operation when the namespace is loaded but not attached. * objects imported _via_ ::: calls where :: would do. * objects imported by :: which are not exported. * objects imported by ::: calls which do not exist. See 'Writing R Extensions' for good practice. * R CMD check optionally checks for non-standard top-level files and directories (which are often mistakes): this is enabled for --as-cran. * LaTeX style file upquote.sty is no longer included (the version was several years old): it is no longer used in R. A much later version is commonly included in LaTeX distributions but does not play well with the ae fonts which are the default for Sweave vignettes. * R CMD build makes more use of the build sub-directory of package sources, for example to record information about the vignettes. INSTALLATION and INCLUDED SOFTWARE: * The macros used for the texinfo manuals have been changed to work better with the incompatible changes made in texinfo 5.x. * The minimum version for a system xz library is now 5.0.3 (was 4.999). This is in part to avoid 5.0.2, which can compress in ways other versions cannot decompress. * The included version of PCRE has been updated to 8.33. * The included version of zlib has been updated to 1.2.8, a bug-fix release. * The included version of xz utils's liblzma has been updated to 5.0.5. * Since javareconf (see above) is used when R is installed, a stable link for JAVA_HOME can be supplied then. * Configuring with --disable-byte-compilation will override the DESCRIPTION files of recommended packages, which typically require byte-compilation. * More of the installation and checking process will work even when TMPDIR is set to a path containing spaces, but this is not recommended and external software (such as texi2dvi) may fail. PACKAGE INSTALLATION: * Installation is aborted immediately if a LinkingTo package is not installed. * R CMD INSTALL has a new option --no-byte-compile which will override a ByteCompile field in the package's DESCRIPTION file. * License BSD is deprecated: use BSD_3_clause or BSD_2_clause instead. License X11 is deprecated: use MIT or BSD_2_clause instead. * Version requirements for LinkingTo packages are now recognized: they are checked at installation. (Fields with version requirements were previously silently ignored.) * The limit of 500 S3method entries in a NAMESPACE file has been removed. * The default 'version' of Bioconductor for its packages has been changed to the upcoming 2.13, but this can be set by the environment variable R_BIOC_VERSION, e.g. in file Renviron.site. C-LEVEL FACILITIES: * Rdefines.h has been tweaked so it can be included in C++ code after R_ext/Boolean.h (which is included by R.h). Note that Rdefines.h is not kept up-to-date, and Rinternals.h is preferred for new code. * eval and applyClosure are now protected against package code supplying an invalid rho. DEPRECATED AND DEFUNCT: * The unused namespace argument to package.skeleton() is now formally deprecated and will be removed in R 3.1.0. * plclust() is deprecated: use the plot() method for class "hclust" instead. * Functions readNEWS() and checkNEWS() in package tools are deprecated (and they have not worked with current NEWS files for a long time). DOCUMENTATION: * 'An Introduction to R' has a new chapter on using R as a scripting language including interacting with the OS. BUG FIXES: * help.request() could not determine the current version of R on CRAN. (PR#15241) * On Windows, file.info() failed on root directories unless the path was terminated with an explicit ".". (PR#15302) * The regmatches<-() replacement function mishandled results coming from regexpr(). (PR#15311) * The help for setClass() and representation() still suggested the deprecated argument representation=. (PR#15312) * R CMD config failed in an installed build of R 3.0.1 (only) when a sub-architecture was used. (Reported by Berwin Turlach.) * On Windows, the installer modified the etc/Rconsole and etc/Rprofile.site files even when default options were chosen, so the MD5 sums did not refer to the installed versions. (Reported by Tal Galili.) * plot(hclust(), cex =) respects cex again (and possibly others similarly). (Reported by Peter Langfelder.) * If multiple packages were checked by R CMD check, and one was written for a different OS, it would set --no-install for all following packages as well as itself. * qr.coef() and related functions did not properly coerce real vectors to complex when necessary. (PR#15332) * ftable(a) now fixes up empty dimnames such that the result is printable. * package.skeleton() was not starting its search for function objects in the correct place if environment was supplied. (Reported by Karl Forner.) * Parsing code was changing the length field of vectors and confusing the memory manager. (PR#15345) * The Fortran routine ZHER2K in the reference BLAS had a comment-out bug in two places. This caused trouble with eigen() for Hermitian matrices. (PR#15345 and report from Robin Hankin) * vignette() and browseVignettes() did not display non-Sweave vignettes properly. * Two warning/error messages have been corrected: the (optional) warning produced by a partial name match with a pairlist, the error message from a zero-length argument to the : operator. (Found by Radford Neal; PR#15358, PR#15356) * svd() returned NULL rather than omitting components as documented. (Found by Radford Neal; PR#15360) * mclapply() and mcparallel() with silent = TRUE could break a process that uses stdout output unguarded against broken pipes (e.g., zip will fail silently). To work around such issues, they now replace stdout with a descriptor pointed to /dev/null instead. For this purpose, internal closeStdout and closeStderr functions have gained the to.null flag. * log(), signif() and round() now raise an error if a single named argument is not named x. (PR#15361) * deparse() now deparses raw vectors in a form that is syntactically correct. (PR#15369) * The jpeg driver in Sweave created a JPEG file, but gave it a .png extension. (PR#15370) * Deparsing of infix operators with named arguments is improved. (PR#15350) * mget(), seq.int() and numericDeriv() did not duplicate arguments properly. (PR#15352, PR#15353, PR#15354) * kmeans(algorithm = "Hartigan-Wong") now always stops iterating in the QTran stage. (PR#15364). * read.dcf() re-allocated incorrectly and so could segfault when called on a file with lines of more than 100 bytes. * On systems where mktime() does not set errno, the last second before the epoch could not be converted from POSIXlt to POSIXct. (Reported by Bill Dunlap.) * add1.glm() miscalculated F-statistics when df > 1. (Bill Dunlap, PR#15386). * stem() now discards infinite inputs rather than hanging. (PR#15376) * The parser now enforces C99 syntax for floating point hexadecimal constants (e.g. 0x1.1p0), rather than returning unintended values for malformed constants. (PR#15234) * model.matrix() now works with very long LHS names (more than 500 bytes). (PR#15377) * integrate() reverts to the pre-2.12.0 behaviour: from 2.12.0 to 3.0.1 it sometimes failed to achieve the requested tolerance and reported error estimates that were exceeded. (PR#15219) * strptime() now handles %W fields with value 0. (PR#15915) * R is now better protected against people trying to interact with the console in startup code. (PR#15325) * Subsetting 1D arrays often lost dimnames (PR#15301). * Unary + on a logical vector did not coerce to integer, although unary - did. * na.omit() and na.exclude() added a row to a zero-row data frame. (PR#15399) * All the (where necessary cut-down) vignettes are installed if R was configured with --without-recommended-packages. * source() did not display filenames when reporting syntax errors. * Syntax error reports misplaced the caret pointing out the bad token. * (Windows only) Starting R with R (instead of Rterm or Rgui) would lose any zero-length strings from the command line arguments. (PR#15406) * Errors in the encoding specified on the command line via --encoding=foo were not handled properly. (PR#15405) * If x is a symbol, is.vector(x, "name") now returns TRUE, since "name" and "symbol" should be synonyms. (Reported by Herv'e Pag`es.) * R CMD rtags works on platforms (such as OS X) with a XSI-conformant shell command echo. (PR#15231) * is.unsorted(NA) returns false as documented (rather than NA). * R CMD LINK did not know about sub-architectures. * system() and system2() are better protected against users who misguidedly have spaces in the temporary directory path. * file.show() and edit() are now more likely to work on file paths containing spaces. (Where external utilities are used, not the norm on Windows nor in R.app which should previously have worked.) * Packages using the methods package are more likely to work when they import it but it is not attached. (Several parts of its C code were looking for its R functions on the search path rather than in its namespace.) * lgamma(-x) is no longer NaN for very small x. * (Windows) system2() now respects specifying stdout and stderr as files if called from Rgui. (PR#15393) * Closing an x11() device whilst locator() or identify() is in progress no longer hangs R. (PR#15253) * list.dirs(full.names = FALSE) was not implemented. (PR#15170) * format() sometimes added unnecessary spaces. (PR#15411) * all.equal(check.names = FALSE) would ignore the request to ignore the names and would check them as attributes. * The symbol set by tools::Rd2txt_options(itemBullet=) was not respected in some locales. (PR#15435) * mcMap() was not exported by package parallel. (PR#15439) * plot() for TukeyHSD objects did not balance dev.hold() and dev.flush() calls on multi-page plots. (PR#15449) -- Peter Dalgaard, Professor Center for Statistics, Copenhagen Business School Solbjerg Plads 3, 2000 Frederiksberg, Denmark Phone: (+45)38153501 Email: pd.mes at cbs.dk Priv: PDalgd at gmail.com