Skip to content

Online Topic Modeling Placing Similar Documents into Different Topics #929

Answered by MaartenGr
vantubbe asked this question in Q&A
Discussion options

You must be logged in to vote

@vantubbe In part, it depends on the sub-models that you used to perform the online topic modeling such as the embedding model, dimensionality reduction, clustering, etc. That all can greatly influence how the topics are being clustered. Perhaps the embedding model is trained to focus on a specific part of the text and less on the context, perhaps the dimensionality reduction algorithm needs more or le

It is difficult to say without actually seeing the code and knowing which sub-models are being used. Having said that, it might be worthwhile to check out some of the parameter tunings here. Also, you could use the .clusters attribute in the river algorithm to checkout some of the clusters …

Replies: 2 comments 3 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
3 replies
@vantubbe
Comment options

@MaartenGr
Comment options

@vantubbe
Comment options

Answer selected by vantubbe
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants