[R] Inf in nnet final value for validation data

Thu Jun 11 23:28:28 CEST 2009

Andrea,

You can calculate predictions for your validation data based on nnet objects using the predict function (the predict function can also be used for regressions, quantile regressions, etc.)

If you create a neural net with the following code: 

library(nnet)

# 3 hidden neurons, for classification (linout = F), and not a skip layer network (skip = F, or T if you want)

mynet.nn <- nnet(dependent_variable ~ ., data = train, size = 3, decay = 1e-3, linout = F, skip = F, maxit = 1000, Hess = T)  

# calculate predictions for your training data and append to data frame called train

train$predictions <- predict(mynet.nn)

# calculate predictions for your validation data and append to data frame called valid

valid$predictions  <- predict(mynet.nn, valid)  # you need to pass your neural net object and your validation dataset to the predict function

To just get the predictions for your validation dataset this is all you need. I do not know why you need to calculate the log likelihood.

Hope this helps.

Jude

Andrea wrote:

Hi,

I use nnet for my classification problem and have a problem concerning the calculation of the final value for my validation data.(nnet only calculates the final value for the training data). I made my own final value formula (for the training data I get the same value as nnet):

    # prob-matrix

    pmatrix <- cat*fittedValues

    tmp <- rowSums(pmatrix) 

    # -log likelihood

    finalValue <- sum(-log(tmp))

    # add penalty term

    finalValue + sum(decay * weights^2)

where cat is a matrix with cols for each possible category and a row for each data record. The values are 1 for the target categories of a data record and 0 otherwise.

My problem is, that I get Inf-values for some validation data records, because the rowsum of cat*fittedValues gets 0 and the log gets Inf.

Has anyone an idea how to deal with that problem properly? How does nnet?

I´m thinking of a penalty value for those values. That means if cat*fittedValues == 0 not to calculate the log but add e.g. 100 instead of "-log(tmp)" to the finalValue-sum??

But how to determine the penalty value???

I´m looking forwar for all suggestions,

Andrea.

___________________________________________
Jude Ryan
Director, Client Analytical Services
Strategy & Business Development
UBS Financial Services Inc.
1200 Harbor Boulevard, 4th Floor
Weehawken, NJ 07086-6791
Tel. 201-352-1935
Fax 201-272-2914
Email: jude.ryan at ubs.com

-------------- next part --------------
Please do not transmit orders or instructions regarding a UBS 
account electronically, including but not limited to e-mail, 
fax, text or instant messaging. The information provided in 
this e-mail or any attachments is not an official transaction 
confirmation or account statement. For your protection, do not 
include account numbers, Social Security numbers, credit card 
numbers, passwords or other non-public information in your e-mail. 
Because the information contained in this message may be privileged, 
confidential, proprietary or otherwise protected from disclosure, 
please notify us immediately by replying to this message and 
deleting it from your computer if you have received this 
communication in error. Thank you. 

UBS Financial Services Inc. 
UBS International Inc. 
UBS Financial Services Incorporated of Puerto Rico 
UBS AG

UBS reserves the right to retain all messages. Messages are protected
and accessed only in legally justified cases.