-
Notifications
You must be signed in to change notification settings - Fork 774
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Chosen represented Topic #2048
Comments
Thank you for the kind words!
It's difficult to say without seeing the full code, versions, output of
Is this a representative document or the topic representation? |
This is the full code : def get_topic_modeling(df, prompt, model, tokenizer):
|
The Prompt : example_prompt = """
The topic is described by the following keywords: 'meat, beef, eat, eating, emissions, steak, food, health, processed, chicken'. Based on the information about the topic above, please create a short label of this topic. Make sure you to only return the label and nothing more. [/INST] Environmental impacts of eating meat The topic is described by the following keywords: '[KEYWORDS]'. Based on the information about the topic above, please create a short label of this topic. Make sure you to only return the label and nothing more. |
it's a topic representation as you see the first one is good since it's general topic/ label but second one isn't represbatble , do you think the problem is with the prompt ? |
It might be but it depends on the LLM that you are using. It's not in the code specifically but it seems you are using Llama 2 (can't see which version). You could also use Llama 3 which is quite a bit better or other newer models like Mistral, Phi-3, Command R+, Qwen2, etc. Note that you can also track the prompts with: |
First I would like to thank you for your great tool.
I have a question,
This is one of the topic Representation in my Documents :
"President's son trial in Manhattan"
However most of the documents under this topic aren't related to Hunter Biden but yes mostly talk about politics,
is there a way to make the representation more general ?
The text was updated successfully, but these errors were encountered: