d (i, x |2 ) i1 j1 i2 j2 ip jp
– d(i,i) = 0
– d(i,j) = d(j,i) – d(i,j) d(i,k) + d(k,j)
2019/1/28 AI&DM BUPT 16
– Calculate the standardized measurement (z-score)
xif m f zif sf
2019/1/28 AI&DM BUPT 18
4.2 Binary Variables (二值变量)
• A contingency table (相依表)for binary data
where i = (xi1, xi2, …, xip) and j = (xj1, xj2, …, xjp) are two p-dimensional data objects, and q is a positive integer
2019/1/28 AI&DM BUPT 17
4.1 Interval-valued variables (cont. 2)
Object j
1
Object i
0 b d
sum a b cd p
1 0
a c
sum a c b d
• Simple matching coefficient (if the binary variable is
symmetric (对称的)):
d (i, j)
bc a bc d bc a bc
2019/1/28
AI&DM BUPT
4
Example
Price($)
7 20 22 50 51 53