I have an upcoming publication, where I present an algorithm for clustering tabular datasets with missing values. I want to do quantitative evaluations of my algorithm and qualitative evaluation. In this question I am asking for ideas for attractive qualitative experiments that "wow, the audience".
- Quantitative evaluation: An obvious experiment is to take a complete dataset, remove some values and compare the clustering results of the complete dataset with the incomplete dataset. With this nice quantitative evaluation can be done, that is why I will do it. However this is not very attractive.
- Qualitative evaluation: Ignoring the missing values problem, an attractive use-case for clustering is image segmentation like this and that. However, there would be little meaning values in those dataset - I mean, in practice all values are available. There might be noise (regarding the color values), but not missing data. Setting noise to NA, would first require a noise detector, but that is not part of my publication. I have trouble finding another use-case that one can "see" and is not the setup I mentioned in 1.