How to test the confidence for a rule based system?

Question

I have a multi-class dataset and I generated based on it rules. That is, if certain features are seen then it must be a certain class. I chose only rules with precision 1 (with respect to the whole dataset)

It is worth mentioning the dataset is highly imbalance. The major class has 60K samples and other classes can have around 1K samples.

Now, consider a rule that applies to, let's say, 17 endpoints. My question is, which test should I apply to check if I can be confident about this rule? I guess the size of the class will have an effect on it.

score 1 · Answer 1 · answered Jul 04 '22 at 07:15

1

The confidence interval is the main indicator to test confidence, and it is measurable thanks to the data volume indeed, but also whether or not the endpoints are reliable enough.

Therefore, testing data is necessary at each endpoint or set of endpoints: The percentage of success from a specific confidence interval is defined either manually thanks to the known business limitations or automatically through a Gaussian function.

answered Jul 04 '22 at 07:15

Nicolas Martin

4,509
1
6
15

Thanks. Can you help me describe the test? – greenButMellow Jul 04 '22 at 16:19
Could you create in your question a small but representative example? – Nicolas Martin Jul 04 '22 at 17:15

How to test the confidence for a rule based system?

1 Answers1