| untar {utils} | R Documentation | 
Extract or List Tar Archives
Description
Extract files from or list the contents of a tar archive.
Usage
untar(tarfile, files = NULL, list = FALSE, exdir = ".",
      compressed = NA, extras = NULL, verbose = FALSE,
      restore_times =  TRUE,
      support_old_tars = Sys.getenv("R_SUPPORT_OLD_TARS", FALSE),
      tar = Sys.getenv("TAR"))
Arguments
| tarfile | The pathname of the tar file: tilde expansion (see
 | 
| files | A character vector of recorded filepaths to be extracted: the default is to extract all files. | 
| list | If  | 
| exdir | The directory to extract files to (the equivalent of
 | 
| compressed | (Deprecated in favour of auto-detection, used only
for an external  The external command may ignore the selected compression type and detect a type automagically. | 
| extras | 
 | 
| verbose | logical: if true echo the command used for an external
 | 
| restore_times | logical. If true (default) restore file modification times. If false, the equivalent of the -m flag. Times in tarballs are supposed to be in UTC, but tarballs have been submitted to CRAN with times in the future or far past: this argument allows such times to be discarded. Note that file times in a tarball are stored with a resolution of 1 second, and can only be restored to the resolution supported by the file system (which on a FAT system is 2 seconds). | 
| support_old_tars | logical.  If false (the default), the external
 If true, the R code calls an appropriate decompressor and pipes
the output to  | 
| tar | character string: the path to the command to be used or
 | 
Details
This is either a wrapper for a tar command or for an
internal implementation written in R.  The latter is used if
tarfile is a connection or if the argument tar is
"internal" or "" (except on Windows, when
tar.exe is tried first).
Unless otherwise stated three types of compression of the tar file are
supported: gzip, bzip2 and xz.
What options are supported will depend on the tar
implementation used: the "internal" one is intended to provide
support for most in a platform-independent way.
- GNU tar:
- Modern GNU - tarversions support compressed archives and since 1.15 are able to detect the type of compression automatically: version 1.22 added support for- xzcompression and version 1.31 for- zstdcompression.- On a Unix-alike, - configurewill set environment variable TAR, preferring GNU tar if found as- gtaror- gnutar.
- bsdtar:
- macOS 10.6 and later (and FreeBSD, NetBSD >= 9 and some other OSes) have a - tarfrom the libarchive project which detects known-to-it forms of compression automagically. However, this may rely on an external command being available: macOS has a tar which knows about- zstdcompression, but relies on a- zstdcommand which it does not supply.- This added support for - xzin 2019 and for- zstdin 2020 (if the appropriate library or external program is available).- Recent versions of Windows supply a build of - bsdtaras- tar.exe, but with compiled-in support only for- gzipcompression.
- OpenBSD:
- OpenBSD's - tardoes not detect compression automagically. It has no support for- xzbeyond reporting that the file is- xz-compressed. So- support_old_tars = TRUEis recommended.
- Older support:
- Environment variable R_GZIPCMD gives the command to decompress - gzipfiles, and R_BZIPCMD for- bzip2files. (On Unix-alikes these are set at installation if found.) An external program called- xzor- zstdis used if available: if not decompression is expected to fail.
Arguments compressed and verbose are only used for an
external tar.
Some external tar commands will detect some of
lrzip, lzma, lz4 and lzop
compression in addition to gzip, bzip2,
xz and zstd.  (For some external tar
commands, compressed tarfiles can only be read if the appropriate
utility program is available.)  For GNU tar, further
(de)compression programs can be specified by e.g. extras
  = "-I lz4".  For bsdtar this could be extras =
  "--use-compress-program lz4".  Most commands will detect (the
nowadays rarely seen) ‘.tar.Z’ archives compressed by
compress.
The internal implementation restores symbolic links as links on a
Unix-alike, and as file copies on Windows (which works only for
existing files, not for directories), and hard links as links.  If the
linking operation fails (as it may on a FAT file system), a file copy
is tried.  Since it uses gzfile to read a file it can
handle files compressed by any of the methods that function can
handle: at least compress, gzip, bzip2,
xz and zstd compression, and some types of
lzma compression.  It will create the parent directories for
directories or files in the archive if necessary.  It handles the
USTAR/POSIX, GNU and pax ways of handling file paths of
more than 100 bytes, and the GNU way of handling link targets of more
than 100 bytes.
You may see warnings from the internal implementation such as
unsupported entry type 'x'
This often indicates an invalid archive: entry types "A-Z" are
allowed as extensions, but other types are reserved.  The only thing
you can do with such an archive is to find a tar program that
handles it, and look carefully at the resulting files.  There may also
be the warning 
using pax extended headers
This indicates that additional information may have been discarded, such as ACLs, encodings ....
The former standards only supported ASCII filenames (indeed, only
alphanumeric plus period, underscore and hyphen).  untar makes
no attempt to map filenames to those acceptable on the current system,
and treats the filenames in the archive as applicable without any
re-encoding in the current locale.
The internal implementation does not special-case ‘resource
forks’ in macOS: that system's tar command does. This may
lead to unexpected files with names with prefix ‘._’.
Some external commands have ‘security’ measures to change the
extracted file paths.  For example, GNU tar and
bsdtar remove the leading slash on absolute filepaths:
specify extras = "-P" to override this. bsdtar
refuses to extract paths containing ".."  with the same
workaround.  The internal implementation removes leading slashes (with
a warning) and stops with an error for paths starting with "~"
or containing "..", again unless extras = "-P" is specified.
Extracted files will overwrite existing file unless flag -k
is included in extras.
Value
If list = TRUE, a character vector of (relative or absolute)
paths of files contained in the tar archive.
Otherwise the return code from system with an external
tar or 0L, invisibly.