Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to generate alignment based on token level? #13

Open
michelleqyhqyh opened this issue Sep 27, 2022 · 2 comments
Open

How to generate alignment based on token level? #13

michelleqyhqyh opened this issue Sep 27, 2022 · 2 comments

Comments

@michelleqyhqyh
Copy link

I can use your command to generate alignment based on bpe level. But how to generate alignment based on token level?

@carboncoo
Copy link
Collaborator

carboncoo commented Sep 28, 2022

By default, the generated alignment is already word-level, as indicated by https://github.com/THUNLP-MT/Mask-Align/blob/main/thualign/utils/alignment.py#L168 with the remove_bpe parameter of the weights_to_align method.

@carboncoo
Copy link
Collaborator

carboncoo commented Sep 28, 2022 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants