Theme-Logo
  • Login
  • Home
  • Course
  • Publication
  • Theses
  • Reports
  • Published books
  • Workshops / Conferences
  • Supervised PhD
  • Supervised MSc
  • Supervised projects
  • Education
  • Language skills
  • Positions
  • Memberships and awards
  • Committees
  • Experience
  • Scientific activites
  • In links
  • Outgoinglinks
  • News
  • Gallery
publication name Fast Vertical Mining Using Diffsets. KDD 2003
Authors Mohammed J. Zaki and Karam Gouda
year 2003
keywords
journal
volume Not Available
issue Not Available
pages Not Available
publisher Not Available
Local/International International
Paper Link Not Available
Full paper download
Supplementary materials Not Available
Abstract

A number of vertical mining algorithms have been proposed recently for association mining, which have shown to be very e ective and usually outperform horizontal approaches. The main advantage of the vertical format is support for fast frequency counting via intersection operations on transaction ids (tids) and automatic pruning of irrelevant data. The main problem with these approaches is when intermediate results of vertical tid lists become too large for memory, thus a ecting the algorithm scalability. In this paper we present a novel vertical data representation called Di set, that only keeps track of di erences in the tids of a candidate pattern from its generating frequent patterns. We show that di sets drastically cut down the size of memory required to store intermediate results. We show how di sets, when incorporated into previous vertical mining methods, increase the performance signi cantly.

Benha University © 2023 Designed and developed by portal team - Benha University