good4 {nomclust}R Documentation

Goodall 4 (G4) Measure

Description

A function for calculation of a proximity (dissimilarity) matrix based on the G4 similarity measure.

Usage

good4(data)

Arguments

data

A data.frame or a matrix with cases in rows and variables in colums.

Details

The Goodall 4 similarity measure was presented in (Boriah et al., 2008). It is a simple modification of the original Goodall measure (Goodall, 1966). It assigns higher weights to the frequent categories matches.

Value

The function returns an object of class "dist".

Author(s)

Zdenek Sulc.
Contact: zdenek.sulc@vse.cz

References

Boriah S., Chandola V., Kumar V. (2008). Similarity measures for categorical data: A comparative evaluation. In: Proceedings of the 8th SIAM International Conference on Data Mining, SIAM, p. 243-254.

Goodall V.D. (1966). A new similarity index based on probability. Biometrics, 22(4), p. 882.

See Also

eskin, good1, good2, good3, iof, lin, lin1, of, sm, ve, vm.

Examples

# sample data
data(data20)

# dissimilarity matrix calculation
prox.good4 <- good4(data20)


[Package nomclust version 2.2.1 Index]