| publication name | Prism: A primal-encoding approach for frequent sequence mining. ICDM07 |
|---|---|
| Authors | Karam Gouda, Mosab Hassaan, Mohammed J Zaki |
| year | 2007 |
| keywords | |
| journal | Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on |
| volume | Not Available |
| issue | Not Available |
| pages | 487 - 492 |
| publisher | Not Available |
| Local/International | International |
| Paper Link | 10.1109/ICDM.2007.33 |
| Full paper | download |
| Supplementary materials | Not Available |
Abstract
Sequence mining is one of the fundamental data mining tasks. In this paper we present a novel approach called Prism, for mining frequent sequences. Prism utilizes a vertical approach for enumeration and support counting, based on the novel notion o/prime block encoding, which in turn is based on prime factorization theory. Via an extensive evaluation on both synthetic and real datasets, we show that Prism outperforms popular sequence mining methods like SPADE [10], PrefixSpan [6] and SPAM [2], by an order of magnitude or more.