Does BERTopic come with any methods that assist user to determine the number of topic (k)? #1235

chentitus · 2023-05-06T12:56:27Z

chentitus
May 6, 2023

Dear Maarten,

I am pretty new to Python and BERTopic. I wonder if BERTopic has any methods that can help users decide the number of topic/clusters, such as perplexity score. Any help is appreciated!

MaartenGr · 2023-05-07T13:00:49Z

MaartenGr
May 7, 2023
Maintainer

Choosing the number of topics highly depends on the underlying clustering algorithm as each will behave differently. For instance, k-Means with a centroid-based technique handles clustering differently from a density-based algorithm like HDBSCAN. So it would depend on the underlying model that you use. I personally would advise involving human evaluation as much as possible as grid-searching evaluation metrics often leads to misleading results.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does BERTopic come with any methods that assist user to determine the number of topic (k)? #1235

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Does BERTopic come with any methods that assist user to determine the number of topic (k)? #1235

chentitus May 6, 2023

Replies: 1 comment

MaartenGr May 7, 2023 Maintainer

chentitus
May 6, 2023

MaartenGr
May 7, 2023
Maintainer