-
Notifications
You must be signed in to change notification settings - Fork 95
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: add issue comment TFIDF similarity metrics and issue comment Ja… #1242
Conversation
hi @bifenglin , if this pr has not completed, you can convert it to draft PR |
…ccard similarity Signed-off-by: bifenglin <[email protected]>
@@ -15,6 +16,10 @@ def getRelatedUsers(config): | |||
return related_users.getRelatedUsers(config) | |||
def getAttention(config): | |||
return attention.getAttention(config) | |||
def getIssueCommentTFIDFSimilarity(config): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It seems that config
has never been used
""" | ||
TODO: get Selected Acotrs | ||
""" | ||
sql = 'SELECT DISTINCT(actor_id) FROM opensource.gh_events') |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we can add a limit clause such as limit 10
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The function aims to find all users if have no selecting choice.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
DISTINCT
selection from the whole table may lead to a OOM exception since the ClickHouse only has 128GB memory, we should definitely avoid this kind of SQL.
hi, I think this pr can be converted into a case in |
hi @bifenglin. I'm adding this to the
|
This PR adds the metric about actors' comment TFIDF similarity and actors' Jaccard similarity.