Using a Hash-Based Method with Transaction Trimming and Database Scan Reduction
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
seminar topics
Active In SP

Posts: 559
Joined: Mar 2010
24-03-2010, 07:21 PM

Using a Hash-Based Method with Transaction Trimming and Database Scan Reduction for Mining Association Rules

Presented By:
Jong Soo Park, Ming-Syan Chen and Philip S. Yu


In this paper, we examine the issue of mining association rules among items in a large database of sales transactions. Mining association rules means that given a database of sales transactions, to discover all associations among items such that the presence of some items in a transaction will imply the presence of other items in the same transaction. The mining of association rules can be mapped into the problem of discovering large itemsets where a large itemset is a group of items which appear in a su cient number of transactions. The problem of discovering large itemsets can be solved by constructing a candidate set of itemsets rst and then, identifying, within this candidate set, those itemsets that meet the large itemset requirement. Generally this is done iteratively for each large k-itemset in increasing order of k where a large k-itemset is a large itemset with k items. To determine large itemsets from a huge number of candidate large itemsets in early iterations is usually the dominating factor for the overall data mining performance. To address this issue, we develop an e ective algorithm for the candidate set generation. It is a hash based algorithm and is especially e ective for the generation of candidate set for large 2-itemsets. Explicitly, the number of candidate 2-itemsets generated by the proposed algorithm is, in orders of magnitude, smaller than that by previous methods, thus resolving the performance bottleneck. Note that the generation of smaller candidate sets enables us to e ectively trim the transaction database size at a much earlier stage of the iterations, thereby reducing the computational cost for later iterations signi cantly. The advantage of the proposed algorithm also provides us an opportunity of reducing the amount of disk I/O required. Extensive simulation study is conducted to evaluate performance of the proposed algorithm.

read full report;jsessionid=270FD1E816B347FB6D60996A624E0F5A?doi=
Use Search at wisely To Get Information About Project Topic and Seminar ideas with report/source code along pdf and ppt presenaion

Important Note..!

If you are not satisfied with above reply ,..Please


So that we will collect data for you and will made reply to the request....OR try below "QUICK REPLY" box to add a reply to this page

Quick Reply
Type your reply to this message here.

Image Verification
Please enter the text contained within the image into the text box below it. This process is used to prevent automated spam bots.
Image Verification
(case insensitive)

Possibly Related Threads...
Thread Author Replies Views Last Post
  Database management concepts seminar tips 9 3,820 23-07-2016, 02:17 PM
Last Post: Dhanabhagya
Last Post: Guest
  Security using colors and Armstrong numbers Moiz ansari 2 170 21-10-2015, 12:10 PM
Last Post: mkaasees
  Test case Reduction-An Experimental Analysis through Coverage Techniques seminar addict 1 705 18-03-2014, 02:37 PM
Last Post: Guest
  Network Reconfiguration Using a Genetic Approach for Loss and Reliability Optimizati project report helper 1 1,602 16-10-2013, 05:35 AM
Last Post: dreambanned
  An OCR Free Method for Word Spotting in Printed Documents pdf seminar projects maker 0 391 25-09-2013, 04:46 PM
Last Post: seminar projects maker
  RCC BOX CULVERT - METHODOLOGY AND DESIGNS INCLUDING COMPUTER METHOD pdf seminar projects maker 0 381 24-09-2013, 04:22 PM
Last Post: seminar projects maker
  Oracle Database seminar ideas 2 869 23-09-2013, 10:16 AM
Last Post: seminar projects maker
  INTRODUCTION TO DATABASE MANAGEMENT SYSTEMS seminar paper 3 1,558 12-09-2013, 09:39 AM
Last Post: seminar projects maker
  Biometrics Based Authentication Systems (Download Full Abstract And Report) computer science crazy 13 8,345 24-08-2013, 04:38 PM
Last Post: Guest