sm {nomclust} | R Documentation |
A function for calculation of a proximity (dissimilarity) matrix based on the SM similarity measure.
sm(data)
data |
A data.frame or a matrix with cases in rows and variables in colums. |
The simple matching coefficient (Sokal, 1958) represents the simplest way of measuring similarity. It does not impose any weights. By a given variable, it assigns the value 1 in case of match and value 0 otherwise.
The function returns an object of class "dist".
Zdenek Sulc.
Contact: zdenek.sulc@vse.cz
Boriah S., Chandola V., Kumar V. (2008). Similarity measures for categorical data: A comparative evaluation.
In: Proceedings of the 8th SIAM International Conference on Data Mining, SIAM, p. 243-254.
Sokal R., Michener C. (1958). A statistical method for evaluating systematic relationships. In: Science bulletin, 38(22),
The University of Kansas.
eskin
,
good1
,
good2
,
good3
,
good4
,
iof
,
lin
,
lin1
,
of
,
ve
,
vm
.
# sample data data(data20) # dissimilarity matrix calculation prox.sm <- sm(data20)