-
In the example shown here: https://maartengr.github.io/BERTopic/getting_started/merge/merge.html#example, 3 models are being merged together using the Each of the models uses the same UMAP model. I have a few questions connected to this:
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
Nope, UMAP is not merged at all and is completely removed from the merged model. Both HDBSCAN and UMAP are removed and only the topic representations themselves are being saved. The result is a similar representation as when you save a model with
This is not wrong conceptually since the fitted UMAP models are not merged but ignored and removed from the final topic model.
Same as above. |
Beta Was this translation helpful? Give feedback.
The reduced embeddings are not used in the merged model, only the original embeddings. This means that there is no effect here with respect to different parameters of HDBSCAN/UMAP aside from the topic representations (the actual labels/keywords) that are being generated.