Similarity function recommender service using incremental user knowledge acquisition. (English)
Kappel, Gerti (ed.) et al., Service-oriented computing. 9th international conference, ICSOC 2011, Paphos, Cyprus, December 5‒8, 2011 Proceedings. Berlin: Springer (ISBN 978-3-642-25534-2/pbk). Lecture Notes in Computer Science 7084, 219-234 (2011).
Summary: Similar entity search is the task of identifying entities that most closely resemble a given entity (e.g., a person, a document, or an image). Although many techniques for estimating similarity have been proposed in the past, little work has been done on the question of which of the presented techniques are most suitable for a given similarity analysis task. Knowing the right similarity function is important as the task is highly domain- and data-dependent. In this paper, we propose a recommender service that suggests which similarity functions (e.g., edit distance or jaccard similarity) should be used for measuring the similarity between two entities. We introduce the notion of “similarity function recommendation rule” that captures user knowledge about similarity functions and their usage contexts. We also present an incremental knowledge acquisition technique for building and maintaining a set of similarity function recommendation rules.