TY - GEN
T1 - TOP-MATA
T2 - 7th International Conference on Service Systems and Service Management, ICSSSM'10
AU - Zhu, Shiwei
AU - Wu, Junjie
AU - Xia, Guoping
AU - Li, Min
PY - 2010
Y1 - 2010
N2 - Recent years have witnessed an increased interest in computing cosine similarities between documents (or commodities). Most previous studies require the specification of a minimum similarity threshold to perform cosine similarity search. However, it is usually difficult for users to provide an appropriate threshold in practice. Instead, in this paper, we propose to search top-K strongly related pairs of objects as measured by the cosine similarity. Specifically, we first define the cosine similarity measure from the association analysis point of view and identify the monotone property of an upper bound of the cosine measure, then exploit a Max-First traversal strategy for developing the TOP-MATA algorithm. Compared with previous TOP-DATA method, TOP-MATA has the advantage of saving the computations for false-positive item pairs. Finally, experimental results demonstrate the computational efficiency of the algorithm.
AB - Recent years have witnessed an increased interest in computing cosine similarities between documents (or commodities). Most previous studies require the specification of a minimum similarity threshold to perform cosine similarity search. However, it is usually difficult for users to provide an appropriate threshold in practice. Instead, in this paper, we propose to search top-K strongly related pairs of objects as measured by the cosine similarity. Specifically, we first define the cosine similarity measure from the association analysis point of view and identify the monotone property of an upper bound of the cosine measure, then exploit a Max-First traversal strategy for developing the TOP-MATA algorithm. Compared with previous TOP-DATA method, TOP-MATA has the advantage of saving the computations for false-positive item pairs. Finally, experimental results demonstrate the computational efficiency of the algorithm.
KW - Anti-monotone property
KW - Association analysis
KW - Cosine similarity
KW - Interestingness measure
UR - https://www.scopus.com/pages/publications/77955943645
U2 - 10.1109/ICSSSM.2010.5530100
DO - 10.1109/ICSSSM.2010.5530100
M3 - 会议稿件
AN - SCOPUS:77955943645
SN - 9781424464876
T3 - 2010 7th International Conference on Service Systems and Service Management, Proceedings of ICSSSM' 10
SP - 994
EP - 998
BT - 2010 7th International Conference on Service Systems and Service Management, Proceedings of ICSSSM' 10
Y2 - 28 June 2010 through 30 June 2010
ER -