[BioC] xps import data derived from different batches

cstrato cstrato at aon.at
Mon Mar 8 21:50:52 CET 2010


Dear Steve,

Currently, CEL-files can exist in three formats, .i.e text, xda and 
calvin, where both xda and calvin are binary formats, however both have 
header lines which are text only.

Your output for DK_A3_03.CEL shows the header of a calvin file. In 
contrast, the output for DK_A1_01.CEL is binary only, thus the complete 
header is missing. I cannot even say that this is a CEL-file at all.

I am afraid that you cannot use this CEL-file. If you have access to the 
DK_A1_01.DAT file you should be able recreate the corresponding CEL-file.

Best regards
Christian


Steve Shen wrote:
> Hi Christian,
>
> It's possible. I looked the time stamp for the files that bad one has 
> different time stamp too. Here is what I got after command issued.
>
> Best,
> Steve
>
> ew-host:PhaseII ssshen$ head -n17 DK_A1_01.CEL
> ?0?K???UU?>??3??b???dd?D?**Ֆ?7QQ݊
> ??hXd7?¢?"""?M??LoDƘ97##3?9f??s3?!DZ???cL95N}??<??Ų)??g????g???s??k???_?:?? 
> SOx?+-_??cg?ᇴ???????;ꨩ|???#ƾy?????:?H;?Ï?z?ч}t??v??Gf??'v???N]t?)???z? 
> N???????/yR???zq??=??O?    ?N9i??????'?|?I'/:??SOZtz??( 
> ???qش?;?ȣ?q?aS;X?v???π??^?????b???)v??lۛ?ᯃ?k;?N
>                                                                
> ?;???[d??I?8??8[h????E? |{p??8?ʬ???????i?'X˪p?>Ύ?'?+O???w
>                             ??Z?
> >??"~o?K??    ??P
>                    ;.|??'??? 
> ?zj???v?b??:ʎ??~Ҟ???S???ÿ???7?k?S?^Ozgx???鋸????????? 
> ?ǀ????.??Be??~o??w;l???)???????=6Ǔ???>?n?k6?Y????1w?U??y?????݄??v??g<???ąO??v?}???~???}???Բc?Ț???_~q|?dž????{?*3??}??8>p?I?z???}\?|????????]???'??!3???1?/?N??=kG???$?Y?ͱ7N????{??e??d~B?????wo 
> ;.?jy\܋???/??Z?7,????xc????ޟ??s???ݟdٻ???????>?g???4?=????~??|zEM??g?O?'%-?~=?Ȃא*??6>?q>y??t<???,?}z_^?V?x?B?)c 
> ???-?G???F??X??^?????
> ??Z ?=A+|????̚M9Fl???????+;?e?C    ?i??5?    
> Y??-?1?6?~?y????^???i0j???w??Ī?=?:?q?jqbZ?.-i??DK?_?S??u????|C??}???IOL??b??~ד?K?y???'ٻ????    
> ?A?O?????!?W2???????+/xB?զ~;-|s?????!???.(??????????;??Qv??hE?׊{?[?o
> Zm.VѾ?y?rg????x?`Q?G???*???E??N?xh???*??*??o?"????U?8?T???????33??>*
> ?h?멏Z?????>??V??>??ƽc?'????q???????Os݌???#k???<?????w?N;?ԓ?? 
> ~????>?x??|??????ِA?��Q??z_~?G<??>???kN?l?X
>           ?0p???????????_??pC??a?"b??<Ό?????Z?X?g????????G???/>????
>                                                                    
> |???!&v?6??|&?T?s??C??    
> y??q??12??|???`???D?W8?w???Gx?c??Ӓ?=?W^'?w'S??]????A`?N??c?????)?}!80??I???c?b?:???????????S}?M?:??!??ǂşZ?\{?"h?????v??Y˱?^e~??4Q???>???-?Z?s??S?~ç???Ӣ????_??li??{b
>                               
> ?????Ze???oO?:???????"]j%3?O??#4????W?G?g?O=?z??hU???g???h??hE at QOȚ??k:???w'?????"u?<???*????g??
>                                  ??*E!8<V?_?P??W?/Q?????
>                                                         {?݄
> :5????>˖?|?X??^^??`9?)?ߥ?1??ȣ?8???i?M=ҝ?#???>u?QS??㈑????n??l?fh??
>                                                                   
> ??/?????q???h;gx?߮?????c?U?=???4z;-ʒ????????c1?c?倍??|m?????,?G????? 
> (??? 
> J?c???&???e|,??%k{?3?O???o??????<V6?/Y???|???_??'.7?X?ҟ^??E???9?q?#m??y??ҟ~.?!?s?ş??eE??'Rn
>                                             
> ?????ş???c?]???Ա????,~??<?|?h?????~?{??X?DZ?<~?||??????כ??????g?FJ?<????:6'????H??a????w??!?u?}`?k????Q?߇?[y??fGӸ|
>                                                                  
> ?H??O?"?~??<:'??????+i??L??\?FI?G<?(?c6?W???+?'?Yٟ?(??,??N????'B??1y? 
> '2??'??HװӋ??5`{?an??SN9??1??15??=v?ӎ?N??G?x从??쨃?O=j??S?v???N?z?Q?? 
> u????? 찣Ù{???c?p?å????Sg̷?,?i??G?v4??_??S??:<?K????>????_?Xy??????^? 
> ݂~9!??8?rh?
> ?????>
> E??????2??ʬ:??f?۝
> ??^C??z??(???M6c 
> ?/?m????6#??Y?6#?X??p???r?f,??km???~f?>?:{?f???s???!?1????،5?|???k??X?Wtl?B\?[??3??>gz??i???????a?ӱ޹z~??f???w??????3??Wn???ٌ??n????W?k_KgٌeX?Z}?zP?]գ}|l??/\w    
> ???s????&s?ی??;?b???p?y?}٨?pn????a?+???e]?~(?i5?1,8,
> +??}e?????粖?(????    ?N?m??Yâ?s
>                                   ?+?]?{Š?p?????Y
>                                                  
> ?e?+???{yG?}>?G??j??9?ఴ???????n?<????^Wn<?ki?g?u
>      ?????]?B?]?1????~??m?t?u?j??O??????It????:W?u|n?}??????1??/????µU???
>                                                                          
> ???????D???]??-?7?}Έ??V?meK?h}???????t???>??~??'??z??xf?ֿ|H?[5*|??ݟ?Bp>c??qf?Ӄ??,?C??\???h???C????~?f?~П۫?Op???????7k]?+?_6???rzY1?t?tw~??b????    
> 8,?????}?6??ަ??Oqߥ[D?h??珥??p?&?}U?׀??}?v?a?}??ﰮ??@??9
>                                                                                      
> GMp:{??r??uz<s??w??k??g?˧?E????&}_???{?V??6?? 
> ?s??u?˓???w??G??̵N??Nw?\?lҾ??\9?W?;?%???
>                                                                               
> ?????]???Y?kgn???Z??&?ѯ?Ϝ???z ?ȏ?~??3?O??犟W?/?.?<???m????u??t?? 
> 9K?????(???o????o?Dɣ?
>                                                                        
> ???Dg??i.??
> /?g??R?߹]?G
> M?<????uyׯ???]Gx??U??x??*?k?I?P~-?V????k?;?????^?t|??    
> ?V8}???g???ż?q?I??U&?`D8
> ???͢??]^@???????????9]?cWם=M?<?'?W?|GzX???._Vlp9?A?C?^0?K]??p:[???#?k??    
> ??????={?ӑ??YC??R?S???}=??>???ֺ??
> ??l??u??s]߯u???8??}??}??????Wl^?ڸ?v9?r???,?
> ???'??C?כuݙ[$?Wn?男?EpM-?"</к?m?:??$y??????R???g???}???Y???#?|?6} 
> ~??????]???????en?֎??t:??]?????GM?<Մ?
>           ??g?:>???e
>                     .?/?a????p??t
> ?Zڣ??.N??z?????5????j}??!??C?G>2??\?????;>??????O9|?G+?j?Ko???i??g??? 
> D?ˆt?3???Ѐ??l??W??=????n?}tD???p??]/?????<c?g????m?׏??>??x.?@?3?@?_?? 
> z>?7??^׏?G?.?秥._>8??͘l?c?;?c????G??k ?5???s??G?.a?D> 
> ?8?/u9p??CI?wďg????gG????KT6?????l?????K?_4,?,u?$?-??=?p???w????|?=n?t???/?<$?D;??t?????s{?? 
> K?O???;]t??G??9g??=?????><???????K~?9????v??]?lv=????n?}Ј?|??Mx?l?=??^L>?T}~???>>?o??&????
>       
> ?$?9??_??????l????q_?j?k?ޒ????3?n??gП?7????nҘ?g????Y?^?��????????>?wc?? 
> r8???t???<f????|/???)c???:?>Ir??ǵ??W?K??m?5??D???Y?????8??{_濏{??? 
> O?{?w?s???1???!
>                                                                  
> ?????????z??9c?k??q?i.?l?????s????G????}c??7??7ί???"??k??1?m?i??Ω~??s?g??? 
> 8?'????o'?g??<?̺{?ٮgӽ??o???c?]??G?e??/Ƭ????Ϙ????~_w?eY0ޟa=????2??럏? 
> gſ#~?Ǭk??e,/?8??&??'X??????3??Y?t?f????y4?57???)?eY?E????5?q???1k?0?9??`?-
> zh
>   
> ???4??????dY??+??c???>????x?d?~?6????????}????{??H{S-˹?ϸ???cIΏ???y?p\???? 
> C??.{??{??gL?5nY??(wkm|???M?e??k?w{??#\??6c??????<
> ?Iޛ?#?y??????8?:?lG9?βL???=???z%ʲ֘?O??
>                                      
> X??^???G??|??0ٯ???V?????(o&????_?n,??    
> c~7n?3?uQ>??͟un???G'??z?|?oY?I?g??H?cuvă?8r>f˘????F??'?K\C?Sz???e???Q?2?i???u??????̉?s??v??p\~6????G?5?V[~?t?|?i,??N?c?/q[,?S?=???~?x]k
>                              
> Oĵ??s??Q?Gz??#?"m?X?h?{?ִq???$??p?`?V???g???V??yn????a??{\^mux??O?{?6&??[???G??v?˭$˜F)?|}?}?#G?3?q?W#?"?D??´1????,ɽhE9Z??$ϣ.#?ޱ??i???6Bi?~?2?/?y?eA????{?\???N?z%?F?-ƍ?h??g?^???7D??c;?~??(????WGڋ8??~L?b??8?f??{]??E8DR^S??$?z?~?܋???}q?`?U'?b.?FG???K???C|^????ыr)??ܵ?rAm%?ao?}?g?ne{    
> :?|vc?M??g???
> ?w??!ث??????????????6`???7??Y?nw9??j??A??O    
> ???q?5???`?pUNh2??j?36aoDŽ?n    ?No7??    ???8??kz~x???)?]O?????'?d?v>
> ???W;}???Ȼ???k?L??????_9џ???_?Ϯ?#??k???7?uߚ{?zڗքO?8?>????J_???N?M?~rx?[?=????^C????U?{,ɹ?c????߂/???3????S????h8?mx=>?q?|????w????x??>?`r~??}?a?vu?&?e?g????&?]g=?V}?G?xxM
>                                                                              
> ???(ca_Zk?<<?o???I7=???ԝ??6?j??? O???g?'?{]?M??Wh??׀?ܾ?a>?N?
> Y??W????-<#???h????>???[???[?>?????{?F?s?o  
> >j??W{?J_?֛???۩?Lu??}M???^Fj>??H?{?    ?oX?t??
>
>
>
> new-host:PhaseII ssshen$ head -n17 DK_A3_03.CEL
> ;={ffymetrix-calvin-intensity60000029318-1257442883-0000021238-0000007719-0000000038en-US+affymetrix-algorithm-nameDFeature 
> Extraction Cell Generation
> text/plainaffymetrix-algorithm-version1.0.0.676
> text/plainaffymetrix-array-type?MoGene-1_0-st-v1
> text/plainaffymetrix-library-package?Universal
> text/plainaffymetrix-cel-rowstext/x-calvin-integer-32affymetrix-cel-colstext/x-calvin-integer-32program-company 
> Affymetrix, Inc.
> text/plain
>           program-nameFAffymetrix Genechip Command Console
> text/plain
> program-id0.0.0.676
> text/plain,affymetrix-algorithm-param-FeatureExtractioTRUE
> text/plain1affymetrix-algorithm-param-IgnoreShiftRowOutliers
> FALSE
> text/plain(affymetrix-algorithm-param-FixedCellSizTRUE
> text/plain&affymetrix-algorithm-param-UseSubgridTRUE
> text/plain*affymetrix-algorithm-param-RandomizePixels
> FALSE
> text/plain%affymetrix-algorithm-param-ErrorBasisStdvMean
> text/plain+affymetrix-algorithm-param-PercentileSpreadAptext/x-calvin-float"affymetrix-algorithm-param-StdMult??text/x-calvin-float*affymetrix-algorithm-param-ExtendPoolWidthtext/x-calvin-integer-32+affymetrix-algorithm-param-ExtendPoolHeighttext/x-calvin-integer-32%affymetrix-algorithm-param-OutlierLow???text/x-calvin-float&affymetrix-algorithm-param-OutlierHigh??text/x-calvin-float4affymetrix-algorithm-param-OutlierRatioLowPercentile7text/x-calvin-integer-325affymetrix-algorithm-param-OutlierRatioHighPercentileKtext/x-calvin-integer-32%affymetrix-algorithm-param-CellMargintext/x-calvin-integer-32.affymetrix-algorithm-param-HalfCellRowsDivisortext/x-calvin-integer-320affymetrix-algorithm-param-HalfCellRowsRemaindertext/x-calvin-integer-327affymetrix-algorithm-param-CellIntensityCalculationTypePercentile
> new-host:PhaseII ssshen$
>
>
> On Mon, Mar 8, 2010 at 2:46 PM, cstrato <cstrato at aon.at 
> <mailto:cstrato at aon.at>> wrote:
>
>     Dear Steve,
>
>     As you have mentioned in your last mail, you got a corrupt
>     CEL-file which has a size of more than 10-times the usual size.
>     This could be a missing EOF.
>     However, the error message you get should only appear for
>     Version=3 CEL-files, i.e. text files, which could explain the
>     large size. Could it be that someone has converted this CEL-file
>     from binary to text format?
>
>     Could you please run the following commands:
>     $ head -n17 DK_A1_01.CEL
>     $ head -n17 DK_A3_03.CEL
>     and send me the output.
>
>     Best regards
>     Christian
>
>
>     Steve Shen wrote:
>
>         The problem is indeed this CEL file. It went well if I remove
>         this .CEL file from import list. The question is what's wrong
>         with this file. It can be read as normal one from affy tools
>         (e.g. expression console). If someone interested in this
>         problem, I can attached both good and bad cel files.
>
>         Thanks,
>         Steve
>
>         On Sun, Mar 7, 2010 at 10:00 PM, Steve Shen <sshen at bu.edu
>         <mailto:sshen at bu.edu> <mailto:sshen at bu.edu
>         <mailto:sshen at bu.edu>>> wrote:
>
>            The problem seems to related to one of the cel files, error
>         says:
>
>            Error: Header line of file
>            <~/Projects/DKotton_Gustavo/PhaseII/DK_A1_01.CEL> is not
>         correct.
>            What does this mean? To me all .cel files are same.
>
>            Best,
>            Steve
>
>
>
>            On Sun, Mar 7, 2010 at 9:51 PM, Steve Shen <sshen at bu.edu
>         <mailto:sshen at bu.edu>
>            <mailto:sshen at bu.edu <mailto:sshen at bu.edu>>> wrote:
>
>                Sorry, misclick!
>
>                On Sun, Mar 7, 2010 at 9:46 PM, Steve Shen
>         <sshen at bu.edu <mailto:sshen at bu.edu>
>                <mailto:sshen at bu.edu <mailto:sshen at bu.edu>>> wrote:
>
>                    Dear Christian,
>
>                    Just couldn't figure out what exactly problem is, but
>                    maybe you can point out easily. I have a set of cel
>         files
>                    which derived from two different batches. I tried to
>                    import data with one command line, but it didn't
>         work (the
>                    error message is below) and getTreeNames give me an
>         error
>                    says "object 'xxx.root' not found", but root file is
>                    actually in the folder.
>
>                    I then import first batch cel files first and use
>         addData
>                    command to import second batch. It turned out that the
>                    importdata command worked well as expected, but addData
>                    command generated same exact error message as first
>         time
>                    (see below). However, this time the getTreeNames
>         worked.
>
>                    The commands and errors are appended. Thanks,
>
>                    Steve
>
>                
>                    ###import two batches
>
>                
>                    > data.xps <- import.data(scheme.mogene10st,
>                    "gm_dk_phase2", filedir = datdir, celfiles= celfiles,
>                    celnames=celnames)
>
>                Errors, (see below)
>                > getTreeNames(rootFile(data.xps))
>                Error: object 'data.xps' not found
>                Error in rootFile(data.xps) :
>                  error in evaluating the argument 'object' in selecting a
>                method for function 'rootFile'
>
>                ###import first batch first, then add
>                > data.gm.dk.xps <- import.data(scheme.mogene10st,
>                "gm_dk_phase2", filedir=datdir, celdir=celdir,
>                celfiles=celfiles, celnames=celnames)
>                #no errors
>                > data.gm.dk.xps <- addData(data.gm.dk.xps, celdir=celdir,
>                celfiles=celfiles02, celnames=celnames02)
>                Error in .local(object, ...) : error in function
>         ‘ImportData’
>                Opening file
>         <~/XPS/scheme/Scheme_MoGene10stv1r4_na30.root> in
>                <READ> mode...
>                Opening file <~/XPS/ROOTdata/gm_dk_phase2_cel.root> in
>                <UPDATE> mode...
>                Importing
>         <~/Projects/DKotton_Gustavo/PhaseII/DK_A7_07.CEL> as
>                <DK_A7_07.cel>...
>                   hybridization statistics:
>                      1 cells with minimal intensity 18
>                      1 cells with maximal intensity 11987
>                Importing
>         <~/Projects/DKotton_Gustavo/PhaseII/DK_A8_08.CEL> as
>                <DK_A8_08.cel>...
>                   hybridization statistics:
>                      17 cells with minimal intensity 21
>                      1 cells with maximal intensity 13337
>                Importing <~/DKotton_Gustavo/PhaseII/DK_B1_09.CEL> as
>                <DK_B1_09.cel>...
>                   hybridization statistics:
>                      2 cells with minimal intensity 19
>                      1 cells with maximal intensity 11798
>                Importing
>         <~/Projects/DKotton_Gustavo/PhaseII/DK_A4_04.CEL> as
>                <DK_A4_04.cel>...
>                   hybridization statistics:
>                      27 cells with minimal intensity 19
>                      1 cells with maximal intensity 14787
>                Importing
>         <~/Projects/DKotton_Gustavo/PhaseII/DK_A5_05.CEL> as
>                <DK_A5_05.cel>...
>                Error: Header line of file
>              
>          </Users/ssshen/US/BU_CTSI/Projects/DKotton_Gustavo/PhaseII/DK_A1_01.CEL>
>                is not correct.
>                   hybridization statistics:
>                      7 cells with minimal intensity 20
>                      1 cells with maximal intensity 15474
>                Importing
>         <~/Projects/DKotton_Gustavo/PhaseII/DK_A3_03.CEL> as
>                <DK_A3_03.cel>...
>                   hybridization statistics:
>                      1 cells with minimal intensity 20
>                      1 cells with maximal intensity 12507
>
>
>
>
>



More information about the Bioconductor mailing list