Download BinClass: A Software Package for Classifying Binary Vectors User's
Transcript
4.1.2 Joining (PNN+GLA) The module \join" has an implementation of the PNN + GLA -A, RPNN + GLA [17] and SC-join -J algorithms. The SC-join algorithm does not include GLA. The PNN + GLA and RPNN + GLA algorithms use GLA after the distance has dropped below the threshold value set by switch -T. The program continues joining classes until there is no decrease in the value of the stochastic complexity. With big datasets which actually have a small number of classes, this algorithm is slow, but is known to give good results on certain type of datasets. command: joingla switches -q quiet mode (no screen output) -A use deterministic version (PNN instead of RPNN) -EFF set epsilon to FF -TFF GLA starting threshold -JXX Join with SC criterion until XX classes remain inputs .data data set .partition1 input partition if -J set .header format le outputs .output log le .partition best partition (classication) found .partition2 best partition (classication) found, if -J set 4.1.3 Splitting The \split" module is an implementation of the Split+GLA [17] algorithm. It works just the opposite to the joining method. There are both randomized and deterministic versions -A of the algorithm. Splitting is continued until there is no decrease in the value of SC in a prescribed number of steps -S. The split method yields results very quickly and thus can be used to pre-screen a dataset before the use of the automatic SC minimizer in the classication module. command: splitgla switches 35