Prakash's portfolio (Data Architect: Business Intelligence & Data Professional)
Association Rules
Introduction
Terminology
Algorithm
Apriori
FP-growth
Data Set
Demo
References
Itemset
A collection of one or more items
– Example: {Milk, Bread, Diaper}
k-itemset
– An itemset that contains k items
Support count (s)
Frequency of occurrence of an itemset
— E.g. s({Milk, Bread, Diaper}) = 2
Support
Fraction of transactions that contain an itemset
— E.g. s({Milk, Bread, Diaper}) = 2/5
Frequent Itemset
An itemset whose support is greater than or equal to a minsup threshold (expert defined)
Confidence
Confidence(A=>B) = p(B|A) = (support_count(A U B)) / support_count(A)
Lift
Lift is the ratio between the confidence and support:
Lift (A1=>A2) = Confidence(A=>B)/Support(B))