[BioC] error reading GSE file
Sean Davis
sdavis2 at mail.nih.gov
Tue Jul 26 16:16:19 CEST 2011
On Tue, Jul 26, 2011 at 8:39 AM, Sean Davis <sdavis2 at mail.nih.gov> wrote:
> On Tue, Jul 26, 2011 at 6:38 AM, Reema Singh <reema28sep at gmail.com> wrote:
>> Dear all
>>
>> I am trying to read a GSE file in R using GEOquery package but i am getting
>> following error.Kindly tell me why i am getting this error. I have tried to
>> find out on google. But no luck...
>>
>> u <- getGEO(filename="GSE1106_family.soft",GSEMatrix=TRUE)
>> Parsing....
>> Found 22 entities...
>> GPL199 (1 of 22 entities)
>> GSM18235 (2 of 22 entities)
>> GSM18236 (3 of 22 entities)
>> Error in substr(x, start = matches + patlen, stop = 1e+07) :
>> invalid multibyte string at '<92>s pre'
>
> Hi, Reema.
>
> This is caused by an invalid character in the data from NCBI. I have
> contacted them to fix the problem.
One has to love the GEO staff. They've already fixed the problem.
Thanks for the report, Reema.
Sean
> In the meantime, you can try:
>
> u = getGEO('GSE1106')
>
> This will grab the GSEMatrix file which is apparently unaffected.
>
> Sean
>
More information about the Bioconductor
mailing list