×

Integration of candidate hash trees in concurrent processing of frequent itemset queries using Apriori. (English) Zbl 1235.93093

Summary: Frequent itemset mining is often regarded as advanced querying where a user specifies the source dataset and pattern constraints using a given constraint model. In this paper, we address the problem of processing batches of frequent itemset queries using the Apriori algorithm. The best solution of this problem proposed so far is Common Counting, which consists in concurrent execution of the queries using Apriori with the integration of scans of the parts of the database shared among the queries. We propose a new method - Common Candidate Tree, offering a more tight integration of the concurrently processed queries by sharing memory data structures, i.e., candidate hash trees. The experiments show that Common Candidate Tree outperforms Common Counting in terms of execution time. Moreover, thanks to smaller memory consumption, Common Candidate Tree can be applied to larger batches of queries.

MSC:

93B40 Computational methods in systems theory (MSC2010)
68T10 Pattern recognition, speech recognition
93-04 Software, source code, etc. for problems pertaining to systems and control theory
PDFBibTeX XMLCite
Full Text: EuDML Link