Replies: 1 comment 1 reply
-
Yes, you can pre-calculate the embeddings for your documents and then supply them to BERTopic as follows: from bertopic import BERTopic
topic_model = BERTopic()
topics, probs = topic_model.fit_transform(docs, embeddings) |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I would like to use the BERTopic on the fly where the users could get topics in a reasonable amount of time for a large dataset, i.e. over 10k documents. In this case, it would make sense if I calculate the embedding vectors in advance, save them somewhere and when a new querry comes in, I only pass the embedding vectors to the BERTopic. Is such a functionality already implemented?
Thank you very much.
Beta Was this translation helpful? Give feedback.
All reactions