The arboretum procedure

§ PROC DMNEURL: Approximation to PROC NEURAL

Yüklə 3,07 Mb.

Pdf görüntüsü

səhifə	62/148
tarix	30.04.2018
ölçüsü	3,07 Mb.
	#40673

1 ... 58 59 60 61 62 63 64 65 ... 148

WEIGHT or WEIGHTS Statement WEIGHT
Scoring the Model Using the OUTEST= Data set
SQUARE
GAUSS
IDENT
The DMREG Procedure Overview Procedure Syntax
Details Examples

PROC DMNEURL: Approximation to PROC NEURAL

WEIGHT or WEIGHTS Statement

WEIGHT onevar ;

WEIGHTS onevar ;

One numeric (interval scaled) variable may be speciﬁed as a WEIGHT variable. It is

recommended to specify the WEIGHT variable already in the PROC DMDB invoca-

tion. Then the information is saved in the catalog and that variable is used automati-

cally as a FREQ variable in PROC DMNEURL.

Scoring the Model Using the OUTEST= Data set

The score value

is computed for each observation

with nonmissing

value of the target (response) variable

of the input data set. All information needed

for scoring an observation of the DMDB data set is contained in the output of the

OUTEST= data set. First an observation from the input data set is mapped into a

vector

Ú

of

new values in which

1. CLASS predictor variables with

categories are replaced by

dummy (binary) variables, depending on the fact whether the variable has miss-

ing values or not.

2. Missing values in interval predictor variables are replaced by the mean value of

this variable in the DMDB data set. This mean value is taken from the catalog

of the DMDB data set.

3. The values of a WEIGHT or FREQ variable are multiplied into the observation.

4. For an interval target variable

its value is transformed into the interval [0,1]

by the relationship

5. All predictor variables are transformed into values with zero mean and unit

standard deviation by

Ò´Ü

´Ü

The values for

Ò´Ü

and

´Ü

are listed in the OUTEST= data set.

This means, that in the presence of CLASS variables the n-vector

has more entries

than the observation in the data set.

The scoring is additive across the stages. The following information is available for

scoring each stage

components (eigenvectors)

each of dimension

the best activation function

and a speciﬁed link function

the

optimal parameter estimates

Purpose of PROC DMNEURL

For each component

we compute the component score

similar to principal component analysis. With those values

the model can be ex-

pressed as

Ò×Ø

×Ø

´Ù

µµ

where

is the best activation function and

is the speciﬁed link function.

In other words, this means, that given the

the value

is computed from

´Ù

where

and

are two of the

¾

£

optimal parameters

and

is deﬁned as

SQUARE

Ùµ

Ù

TANH

£

Ø

Ùµ

ARCTAN

Ò´

Ùµ

LOGIST

ÜÔ´

Ùµ

´½

ÜÔ´

Ùµµ

GAUSS

ÜÔ´ ´

Ùµ

µ

SIN

£

×

Ò´

Ùµ

COS

Ó×´

Ùµ

EXP

ÜÔ´

Ùµ

For the ﬁrst component

and

, for the second component

and

, and for the last component

Ô ½

and

are used.

The link function

is applied on

and yields to

IDENT

Û

LOGIST

ÜÔ´Û

µ

´½

ÜÔ´Û

µ

RECIPR

Across all stages the values of

are added to the predicted value (posterior)

The DMREG Procedure

The DMREG Procedure

Overview

Procedure Syntax

PROC DMREG Statement

CLASS Statement

CODE Statement

DECISION Statement

FREQ Statement

MODEL Statement

NLOPTIONS Statement

REMOTE Statement

SCORE Statement

Details

Examples

Example 1: Linear and Quadratic Logistic Regression with an Ordinal Target (Rings Data)

Example 2: Performing a Stepwise OLS Regression (DMREG Baseball Data)

Example 3: Comparison of the DMREG and LOGISTIC Procedures when Using a Categorical Input

Variable

References

Yüklə 3,07 Mb.

Dostları ilə paylaş:

1 ... 58 59 60 61 62 63 64 65 ... 148