[R] R: website address for the pseuso-XLS files
Barry Rowlingson
b.rowlingson at lancaster.ac.uk
Tue May 25 16:17:30 CEST 2010
On Tue, May 25, 2010 at 3:09 PM, Ted Harding
<Ted.Harding at manchester.ac.uk> wrote:
> It is OpenOffice 2.0 (as updated to openoffice.org-core,
> dfsg.2-7etch9 Sat Jan 16 2010) running on Linux (Debian Etch,
> originally installed Sept 2007), so none of it is particularly
> recent. The command to view the files is like:
>
> oocalc Prostatic_Neoplasm-miRNAs.DiseaseTargets.xls
I just tried running those HTML fragments through 'html2text'. Comes
out quite neatly, although this may get messed up by your font
settings in an email page:
$ html2text Prostatic_Neoplasm-miRNAs.DiseaseTargets.xls
Input recoding failed due to invalid input sequence. Unconverted part
of text follows.
�
|________________|Name _______|________________|____________|_____________|
|Prostatic |hsa-miR-222 |hsa-mir-222 |X |19351827 |
|Neoplasms_______|_______________|________________|____________|_____________|
|Prostatic |hsa-miR-106a |hsa-mir-106a |X |16461460 |
|Neoplasms_______|_______________|________________|____________|_____________|
|Prostatic |hsa-miR-221 |hsa-mir-221 |X |19107213 |
|Neoplasms_______|_______________|________________|____________|_____________|
etc etc. At least all the HTML tagging is gone.
Even better, doing html2text -width 500 puts each record on one line,
separated by | and padded out with _ chars.
Barry
More information about the R-help
mailing list