Replies: 2 comments 7 replies
-
I don't think this is actually expensive to do with at most a couple of hundred topic embeddings. Running something like the cosine similarity on the topic embeddings with one another is quite fast and I believe does not take that much memory. |
Beta Was this translation helpful? Give feedback.
7 replies
-
Decided to go with a slightly modified version of: |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hey, I'm wondering if there's a good way to measure a topic's variance.
Variance being how different documents within the topic are relative to each other, and the corpus as a whole. Doing pairwise similarity comparisons is expensive so I'd like to avoid that. I know OCTIS has a diversity measure, but this is calculated over the entire topic model, rather than per topic.
Any ideas?
Beta Was this translation helpful? Give feedback.
All reactions