Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

And does the full text help??? #3

Open
jambo6 opened this issue Oct 29, 2021 · 1 comment
Open

And does the full text help??? #3

jambo6 opened this issue Oct 29, 2021 · 1 comment

Comments

@jambo6
Copy link

jambo6 commented Oct 29, 2021

Asking because I'm doing a project where I was looking at full abstracts, and have recently switched to single sentences for relation extraction because labelling was faster + abstracts seemed like it might be overkill. Very interested on thoughts of the best way to do this.

@danich1
Copy link
Contributor

danich1 commented Nov 1, 2021

Asking because I'm doing a project where I was looking at full abstracts, and have recently switched to single sentences for relation extraction because labelling was faster + abstracts seemed like it might be overkill. Very interested on thoughts of the best way to do this.

So long story short full text doesn't improve performance as performance is mainly reliant on the label functions generated. If I'm following correctly you are looking at the whole abstracts for relation extraction rather than individual sentences? If so, then you will still end up with examining individual sentences for extraction. Now if you are trying to circumvent the manual effort of hand labeling sentences, then using the my constructed label functions could help.

One strong warning is that performance for this approach relies on the label functions themselves to be effective. Depending on the task, you could end up in a trap where you have to constantly go back and read a whole lot of sentences to generate more label functions just to improve model performance. If this happens manually labeling sentences themselves and not using label functions could be a lot faster.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants