[R] R: website address for the pseuso-XLS files

Barry Rowlingson b.rowlingson at lancaster.ac.uk
Tue May 25 16:17:30 CEST 2010


On Tue, May 25, 2010 at 3:09 PM, Ted Harding
<Ted.Harding at manchester.ac.uk> wrote:
> It is OpenOffice 2.0 (as updated to openoffice.org-core,
> dfsg.2-7etch9 Sat Jan 16 2010) running on Linux (Debian Etch,
> originally installed Sept 2007), so none of it is particularly
> recent. The command to view the files is like:
>
>  oocalc Prostatic_Neoplasm-miRNAs.DiseaseTargets.xls

 I just tried running those HTML fragments through 'html2text'. Comes
out quite neatly, although this may get messed up by your font
settings in an email page:

$ html2text Prostatic_Neoplasm-miRNAs.DiseaseTargets.xls
Input recoding failed due to invalid input sequence. Unconverted part
of text follows.
�
|________________|Name  _______|________________|____________|_____________|
|Prostatic       |hsa-miR-222    |hsa-mir-222     |X           |19351827     |
|Neoplasms_______|_______________|________________|____________|_____________|
|Prostatic       |hsa-miR-106a   |hsa-mir-106a    |X           |16461460     |
|Neoplasms_______|_______________|________________|____________|_____________|
|Prostatic       |hsa-miR-221    |hsa-mir-221     |X           |19107213     |
|Neoplasms_______|_______________|________________|____________|_____________|

etc etc. At least all the HTML tagging is gone.

Even better, doing html2text -width 500 puts each record on one line,
separated by | and padded out with _ chars.

Barry



More information about the R-help mailing list