[R] vlmc - "In vlmc(traffic.clusters.stationary, cutoff = i) : alphabet with >1-letter strings; trying to abbreviate"
Constantinos Antoniou
constantinos.antoniou.rlists at gmail.com
Wed Jun 30 21:08:58 CEST 2010
Dear Martin,
Thank you very much for the (reassuring and helpful) response.
Best regards,
Costas
On Jun 30, 2010, at 3:58 PM, Martin Maechler wrote:
>>>>>> "CA" == Constantinos Antoniou <constantinos.antoniou.rlists at gmail.com>
>>>>>> on Wed, 30 Jun 2010 12:07:16 +0300 writes:
>>>>>> "CA" == Constantinos Antoniou <constantinos.antoniou.rlists at gmail.com>
>>>>>> on Wed, 30 Jun 2010 12:07:16 +0300 writes:
>
> CA> Dear all (copying the package author),
>
> CA> I have a question on the vlmc package. I am trying to
> CA> model a time series, where each element can take one of
> CA> 11 values (the result of some clustering). When I run
> CA> the following command (synthetic data to facilitate
> CA> self-contained example)
>
> (very good)
>
>> I get the following warning: ("alphabet with >1-letter strings; trying to
>> abbreviate")
>
>> +++ START+++
>>> library(VLMC)
>>> a <- floor(runif(1000,0,11))
>>> vc <- vlmc(a,cutoff=5)
>> Warning message:
>> In vlmc(a, cutoff = 5) :
>> alphabet with >1-letter strings; trying to abbreviate
>>> vc
>> 'vlmc' a Variable Length Markov Chain;
>> alphabet 'abcdefghijk', |alphabet| = 11, n = 1000.
>> Call: vlmc(dts = a, cutoff.prune = 5)
> -> extensions (= $size ) :
>> ord.MC context nr.leaves total
>> 2 72 61 1608
>> AIC = 5247
>>>
>> +++ END+++
>
>> The questions are:
>> 1. What is it trying to do?
>
> Your contains values 0 1 2 .. 10
> it tries to match them to 1-letter strings, but '10' "is 2
> letters"
>
>> 2. How is it abbreviating?
>
> (not really important: using abbreviate()
>
>> 3. How much should I worry about it?
>
> not at all. The warning is just to inform you that your input
> looks a bit "unusual" to vlmc.
>
> I do agree however, that one could argue that vlmc() should work
> for inputs with values
> 0:m
> or 1:n
> without a warning.
>
>> 4. What can I do?
>
> You could use
>
> vc <- vlmc(letters[1+a], cutoff=5)
>
> to get the exact same model, but without a warning.
> or
> vc <- vlmc(a, cutoff=5, quiet = TRUE)
> or
> vc <- vlmc(a, cutoff=5, code1char = FALSE)
>
>
>
>> I have looked at the documentation plus
>> M<c3><a4>chler M. and B<c3><bc>hlmann P. (2004) Variable Length Markov Chains:
>> Methodology, Computing, and Software. _J. Computational and
>> Graphical Statistics_ *2*, 435-455.
>
> That's good.
> The examples there all have a character vector (of
> strings with 1 letter/character) as input.
>
>
>> Thanks for any feedback,
>
> You're welcome!
> Martin Maechler, ETH Zurich
>
>
>> --
>> Constantinos Antoniou, Ph.D., Assistant Professor
>> National Technical University of Athens
>> Laboratory of Transportation Engineering
>> School of Rural and Surveying Engineering
>> 9 Heroon Politechniou st., 15780-Zografou, Athens, Greece
>> T: +30 210 7722783 - F: +30 210 7722629
>> antoniou at central.ntua.gr - http://users.ntua.gr/antoniou
>
>
>> Dear all (copying the package author), I have a question
>> on the vlmc package. I am trying to model a time series,
>> where each element can take one of 11 values (the result
>> of some clustering). When I run the following command
>> (synthetic data to facilitate self-contained example) I
>> get the following warning: ("alphabet with >1-letter
>> strings; trying to abbreviate")
>
>
>
>> +++ START+++
>>> library(VLMC) a <- floor(runif(1000,0,11)) vc <-
>>> vlmc(a,cutoff=5)
>> Warning message: In vlmc(a, cutoff = 5) : alphabet with
>>> 1-letter strings; trying to abbreviate
>>> vc
>> 'vlmc' a Variable Length Markov Chain; alphabet
>> 'abcdefghijk', |alphabet| = 11, n = 1000. Call:
>> vlmc(dts = a, cutoff.prune = 5)
> -> extensions (= $size ) :
>> ord.MC context nr.leaves total 2 72 61 1608 AIC =
>> 5247
>>>
>> +++ END+++
>
>> The questions are: 1. What is it trying to do? 2. How
>> is it abbreviating? 3. How much should I worry about
>> it? 4. What can I do?
>
>> I have looked at the documentation plus M<c3><a4>chler
>> M. and B<c3><bc>hlmann P. (2004) Variable Length Markov
>> Chains: Methodology, Computing, and Software.
>> _J. Computational and Graphical Statistics_ *2*,
>> 435-455.
>
>
>> Thanks for any feedback, Costas
>
>
>
>> -- Constantinos Antoniou, Ph.D., Assistant Professor
>> National Technical University of Athens Laboratory of
>> Transportation Engineering School of Rural and Surveying
>> Engineering 9 Heroon Politechniou st., 15780-Zografou,
>> Athens, Greece T: +30 210 7722783 - F: +30 210 7722629
>> antoniou at central.ntua.gr - http://users.ntua.gr/antoniou
>
>
More information about the R-help
mailing list