New Features - Zero-shot or Seed Words? #1662
-
I am curious about the differences between Zero-shot and Seed Word features? Both sound great, and require me to provide info on the 'expected' topics via suggested words/topics. Are there use cases where one of these should be chosen over the other, or would you suggest trying both in models and choosing based on results. Any advice welcome, thanks. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
In my project, the use of seed words has resulted in these seed words appearing more often (as keywords) in topic representations. It seems that seed words would affect a wide scope of topics. On the other hand, keywords in "expected topics" seem to only affect topics if they contain the specified set of keywords. Just my two cents and curious as to what insights that @MaartenGr may have on this. |
Beta Was this translation helpful? Give feedback.
There are large differences between the two methods:
Seed Words gives the topic model complete freedom to discover any topic regardless of the seed word you gave it. The only thing seed words does is when a topic is creating, ranking certain words higher.
Zero-shot topic modeling, in constrast, is a method that allows you to guide the topic creation towards specific topics that you created before. Compared to seed words, this will actually change the topics that are being created and influence the topic assignment.
To sum up: