I have a distance metric on a collection of strings on the order of tens of thousands. What would be an intuitive way to summarize how 'different' these strings are or when they overlap?
My goal is, to visually ensure high entropy and be able to recognize clustering regions and the strings associated with it.
I envision a kind of clustering plot where there is some radius around each string that captures its neighbors.... but this requires having a meaningful coordinate system.