[BioC] error reading GSE file
Sean Davis
sdavis2 at mail.nih.gov
Tue Jul 26 14:39:42 CEST 2011
On Tue, Jul 26, 2011 at 6:38 AM, Reema Singh <reema28sep at gmail.com> wrote:
> Dear all
>
> I am trying to read a GSE file in R using GEOquery package but i am getting
> following error.Kindly tell me why i am getting this error. I have tried to
> find out on google. But no luck...
>
> u <- getGEO(filename="GSE1106_family.soft",GSEMatrix=TRUE)
> Parsing....
> Found 22 entities...
> GPL199 (1 of 22 entities)
> GSM18235 (2 of 22 entities)
> GSM18236 (3 of 22 entities)
> Error in substr(x, start = matches + patlen, stop = 1e+07) :
> invalid multibyte string at '<92>s pre'
Hi, Reema.
This is caused by an invalid character in the data from NCBI. I have
contacted them to fix the problem. In the meantime, you can try:
u = getGEO('GSE1106')
This will grab the GSEMatrix file which is apparently unaffected.
Sean
More information about the Bioconductor
mailing list