Best frequent itemset package in python

Question

Could anyone please recommend a good frequent itemset package in python? I only need to find frequent itemset, no need of finding the association rules.

In my personal exp, I found R's apriori and FP-growth much better than their Python alternatives. So, if you're open to considering R, you should try them :) — Dawny33, Mar 09 '17 at 06:09

score 11 · Answer 1 · edited Jun 16 '20 at 11:08

I also recommend MLXtend library for frequent itemsets.

usage example:

dataset = [['Milk', 'Onion', 'Nutmeg', 'Kidney Beans', 'Eggs', 'Yogurt'],
           ['Dill', 'Onion', 'Nutmeg', 'Kidney Beans', 'Eggs', 'Yogurt'],
           ['Milk', 'Apple', 'Kidney Beans', 'Eggs'],
           ['Milk', 'Unicorn', 'Corn', 'Kidney Beans', 'Yogurt'],
           ['Corn', 'Onion', 'Onion', 'Kidney Beans', 'Ice cream', 'Eggs']]

te = TransactionEncoder()

te_ary = te.fit(dataset).transform(dataset)

df = pd.DataFrame(te_ary, columns=te.columns_)

frequent_itemsets = apriori(df, min_support=0.1, use_colnames=True)

print frequent_itemsets

this package has memory error when you have too many distinct items. Not recommended for Big Data — Snow, Jul 07 '20 at 12:40

score 4 · Answer 2 · answered Feb 07 '17 at 00:41

4

Orange3-Associate package provides frequent_itemsets() function based on FP-growth algorithm.

answered Feb 07 '17 at 00:41

K3---rnc

3,442
1
12
12

score 3 · Answer 3 · answered Dec 19 '17 at 20:50

MLXtend library has been really useful for me. In its docummentation there is an Apriori implementation that outputs the frequent itemset.

Please check the first example available in http://rasbt.github.io/mlxtend/user_guide/frequent_patterns/apriori/.

Best frequent itemset package in python

3 Answers3

usage example: