You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The dediacritic tool doesn't seem to work within Google Colab with Python version 3.7. I tried to manually modify the Emoji library but to no result.
from google.colab import output
output.enable_custom_widget_manager()
from google.colab import drive
drive.mount('/content/drive/')
!pip install camel-tools==1.4.1 -f https://download.pytorch.org/whl/torch_stable.html
os.environ['CAMELTOOLS_DATA'] = '/content/drive/MyDrive/SAAL/EnAr/CAMeL'
!camel_data -i all
from camel_tools.utils.dediac import dediac_ar
After loading the tokenizer using Hugging Face's AutoTokenizer, I have to set the tokenizer model_max_length maually to 512, otherwise the value is an extremely large integer > 1e10
The text was updated successfully, but these errors were encountered:
Hello,
The text was updated successfully, but these errors were encountered: