[R] Text Pattern Recognition - Model

Bert Gunter bgunter.4567 at gmail.com
Mon Aug 17 18:20:30 CEST 2015


Wrong list. This list is about R, not about statistics/statistical
learning. Post to a stats list like stats.stackexchange.com for
methods issues. Once you figure out what you want to do, R almost
certainly can do it -- search to find out what fuctions/packages to
use.

Cheers,
Bert
Bert Gunter

"Data is not information. Information is not knowledge. And knowledge
is certainly not wisdom."
   -- Clifford Stoll


On Mon, Aug 17, 2015 at 7:27 AM, Anshuk Pal Chaudhuri
<anshuk.p at motivitylabs.com> wrote:
> Hi,
>
> I have a training dataset which has two columns which has around 70 values.
>
> 1.       "PNRNo" whose values like UT768G, CXKA, 4IOI59, 4BV7TW...(typical PNR number patterns)
>
> 2.       I have created one more factor variable mentioning (IsPNR) - so all the values are 1 (true)
>
> My first objective is to create a model on this training set which would recognize the text pattern.
>
> Second objective: The  model would then be used to predict IsPNR with new set of test values like "Anshuk", "4EL58S"...as 0 and 1...
>
> Which model would be best for recognizing such kind of pattern and having decent accuracy? I tried naiveBayes, but I don't think it is all doing a good job. Its predicting all the test values as true. I suppose bayes is not meant for this.
>
>
> Regards,
> Anshuk Pal Chaudhuri
>
>
>         [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list