Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hacktoberfest: new languages pt.2 #992

Merged
merged 3 commits into from
Oct 5, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
106 changes: 105 additions & 1 deletion hacktoberfest_challenges/datasets_without_language.csv
Original file line number Diff line number Diff line change
Expand Up @@ -171,4 +171,108 @@ status,pr_url,hub_id,downloads,likes
,,[stas/wmt14-en-de-pre-processed](https://huggingface.co/datasets/stas/wmt14-en-de-pre-processed),423,1
,,[Jackmin108/c4-en-validation](https://huggingface.co/datasets/Jackmin108/c4-en-validation),1131,0
,,[cfilt/iitb-english-hindi](https://huggingface.co/datasets/cfilt/iitb-english-hindi),1147,11
,,[argilla/databricks-dolly-15k-curated-en](https://huggingface.co/datasets/argilla/databricks-dolly-15k-curated-en),9651261,9
,,[argilla/databricks-dolly-15k-curated-en](https://huggingface.co/datasets/argilla/databricks-dolly-15k-curated-en),9651261,9
,,[jegormeister/dutch-snli](https://huggingface.co/datasets/jegormeister/dutch-snli),90,0
,,[Iskaj/dutch_corpora_parliament_processed](https://huggingface.co/datasets/Iskaj/dutch_corpora_parliament_processed),88,0
,,[AgentWaller/dutch-formatted-oasst1](https://huggingface.co/datasets/AgentWaller/dutch-formatted-oasst1),0,0
,,[AgentWaller/dutch-oasst1-qlora-format](https://huggingface.co/datasets/AgentWaller/dutch-oasst1-qlora-format),0,0
,,[BramVanroy/stackoverflow-chat-dutch-llamav2-format](https://huggingface.co/datasets/BramVanroy/stackoverflow-chat-dutch-llamav2-format),0,0
,,[manu/french_librispeech_text_only](https://huggingface.co/datasets/manu/french_librispeech_text_only),76,0
,,[tbboukhari/Alpaca-in-french](https://huggingface.co/datasets/tbboukhari/Alpaca-in-french),8,0
,,[ismailiismail/multi_paraphrasing_french](https://huggingface.co/datasets/ismailiismail/multi_paraphrasing_french),6,0
,,[FreedomIntelligence/alpaca-gpt4-french](https://huggingface.co/datasets/FreedomIntelligence/alpaca-gpt4-french),4,0
,,[FreedomIntelligence/sharegpt-french](https://huggingface.co/datasets/FreedomIntelligence/sharegpt-french),2,0
,,[vekkt/french_CEFR](https://huggingface.co/datasets/vekkt/french_CEFR),1,0
,,[Harsit/xnli2.0_train_french](https://huggingface.co/datasets/Harsit/xnli2.0_train_french),0,0
,,[Makxxx/french_CEFR](https://huggingface.co/datasets/Makxxx/french_CEFR),0,0
,,[sugam11/french-snli](https://huggingface.co/datasets/sugam11/french-snli),0,0
,,[Brendan/nlp244_french_snli](https://huggingface.co/datasets/Brendan/nlp244_french_snli),0,0
,,[pvisnrt/french-snli](https://huggingface.co/datasets/pvisnrt/french-snli),0,0
,,[pranjali97/french_translated_snli](https://huggingface.co/datasets/pranjali97/french_translated_snli),0,0
,,[FreedomIntelligence/evol-instruct-french](https://huggingface.co/datasets/FreedomIntelligence/evol-instruct-french),0,0
,,[gollumeo/french-litterature](https://huggingface.co/datasets/gollumeo/french-litterature),0,0
,,[nielsr/datacomp_small_french_captions](https://huggingface.co/datasets/nielsr/datacomp_small_french_captions),0,0
,,[manu/french_5p](https://huggingface.co/datasets/manu/french_5p),0,0
,,[germank/hh-generated_flan_t5_large_with_features2](https://huggingface.co/datasets/germank/hh-generated_flan_t5_large_with_features2),681,0
,,[germank/hh-rlhf_with_features_flan_t5_large](https://huggingface.co/datasets/germank/hh-rlhf_with_features_flan_t5_large),336,0
,,[german-nlp-group/german_common_crawl](https://huggingface.co/datasets/german-nlp-group/german_common_crawl),116,7
,,[mtc/german_seahorse_dataset_with_articles](https://huggingface.co/datasets/mtc/german_seahorse_dataset_with_articles),87,0
,,[roskoN/stereoset_german](https://huggingface.co/datasets/roskoN/stereoset_german),74,0
,,[serbog/job_listing_german_cleaned_bert](https://huggingface.co/datasets/serbog/job_listing_german_cleaned_bert),20,0
,,[germank/hh-generated_flan_t5_large_with_features2_flan_t5_large](https://huggingface.co/datasets/germank/hh-generated_flan_t5_large_with_features2_flan_t5_large),16,0
,,[AgentWaller/german-formatted-oasst1](https://huggingface.co/datasets/AgentWaller/german-formatted-oasst1),15,1
,,[serbog/job_listing_german_cleaned](https://huggingface.co/datasets/serbog/job_listing_german_cleaned),2,0
,,[erebos/germanZickleinLLAMA2Dataset](https://huggingface.co/datasets/erebos/germanZickleinLLAMA2Dataset),2,0
,,[thisserand/health_care_german](https://huggingface.co/datasets/thisserand/health_care_german),1,0
,,[philschmid/prompted-germanquad](https://huggingface.co/datasets/philschmid/prompted-germanquad),0,0
,,[philschmid/test_german_squad](https://huggingface.co/datasets/philschmid/test_german_squad),0,2
,,[Harsit/xnli2.0_german](https://huggingface.co/datasets/Harsit/xnli2.0_german),0,1
,,[Harsit/xnli2.0_train_german](https://huggingface.co/datasets/Harsit/xnli2.0_train_german),0,0
,,[akash418/german_europarl](https://huggingface.co/datasets/akash418/german_europarl),0,0
,,[joelniklaus/german_rental_agreements](https://huggingface.co/datasets/joelniklaus/german_rental_agreements),0,1
,,[fathyshalab/Dialogsum-german](https://huggingface.co/datasets/fathyshalab/Dialogsum-german),0,1
,,[fathyshalab/Dialogsum-german-kurz](https://huggingface.co/datasets/fathyshalab/Dialogsum-german-kurz),0,2
,,[fathyshalab/google-presto-german](https://huggingface.co/datasets/fathyshalab/google-presto-german),0,0
,,[dvilasuero/alpaca-german-validation](https://huggingface.co/datasets/dvilasuero/alpaca-german-validation),0,0
,,[fathyshalab/germanquad_qg_qg_dataset](https://huggingface.co/datasets/fathyshalab/germanquad_qg_qg_dataset),0,0
,,[fathyshalab/germanquad_qaeval_dataset](https://huggingface.co/datasets/fathyshalab/germanquad_qaeval_dataset),0,0
,,[AgentWaller/german-oasst1-qlora-format](https://huggingface.co/datasets/AgentWaller/german-oasst1-qlora-format),0,0
,,[AgentWaller/german-oasst1-qa-format](https://huggingface.co/datasets/AgentWaller/german-oasst1-qa-format),0,0
,,[Jakelolipopp/truthful_qa-validation-german_q_n_a](https://huggingface.co/datasets/Jakelolipopp/truthful_qa-validation-german_q_n_a),0,0
,,[germank/hh-rlhf_with_features](https://huggingface.co/datasets/germank/hh-rlhf_with_features),0,0
,,[germank/hh-rlhf_with_features_flan_t5_large-no_eos](https://huggingface.co/datasets/germank/hh-rlhf_with_features_flan_t5_large-no_eos),0,0
,,[germank/hh-rlhf_with_features_flan_t5_large_lll_relabeled](https://huggingface.co/datasets/germank/hh-rlhf_with_features_flan_t5_large_lll_relabeled),0,0
,,[germank/hh-rlhf_with_features_flan_t5_large_rx](https://huggingface.co/datasets/germank/hh-rlhf_with_features_flan_t5_large_rx),0,0
,,[typevoid/german-company-addresses](https://huggingface.co/datasets/typevoid/german-company-addresses),0,1
,,[paoloitaliani/news_articles](https://huggingface.co/datasets/paoloitaliani/news_articles),40,0
,,[pere/italian_tweets_500k](https://huggingface.co/datasets/pere/italian_tweets_500k),14,0
,,[pere/italian_tweets_10M](https://huggingface.co/datasets/pere/italian_tweets_10M),11,0
,,[thomasavare/italian-dataset-deepl2](https://huggingface.co/datasets/thomasavare/italian-dataset-deepl2),3,0
,,[FreedomIntelligence/sharegpt-italian](https://huggingface.co/datasets/FreedomIntelligence/sharegpt-italian),2,0
,,[thomasavare/italian-dataset-helsinki](https://huggingface.co/datasets/thomasavare/italian-dataset-helsinki),2,0
,,[scribis/italian-literature-corpus-mini](https://huggingface.co/datasets/scribis/italian-literature-corpus-mini),1,0
,,[FreedomIntelligence/alpaca-gpt4-italian](https://huggingface.co/datasets/FreedomIntelligence/alpaca-gpt4-italian),1,0
,,[FreedomIntelligence/evol-instruct-italian](https://huggingface.co/datasets/FreedomIntelligence/evol-instruct-italian),0,1
,,[flxclxc/english-norwegian-bible-set](https://huggingface.co/datasets/flxclxc/english-norwegian-bible-set),0,0
,,[NbAiLab/norwegian-xsum](https://huggingface.co/datasets/NbAiLab/norwegian-xsum),0,4
,,[afkfatih/turkishdataset](https://huggingface.co/datasets/afkfatih/turkishdataset),48,0
,,[merve/turkish_instructions](https://huggingface.co/datasets/merve/turkish_instructions),36,4
,,[W4nkel/turkish-sentiment-dataset](https://huggingface.co/datasets/W4nkel/turkish-sentiment-dataset),16,0
,,[kmkarakaya/turkishReviews-ds-mini](https://huggingface.co/datasets/kmkarakaya/turkishReviews-ds-mini),4,0
,,[erkanxyzalaca/turkishKuran](https://huggingface.co/datasets/erkanxyzalaca/turkishKuran),4,0
,,[nanelimon/turkish-social-media-bullying-dataset](https://huggingface.co/datasets/nanelimon/turkish-social-media-bullying-dataset),3,5
,,[kmkarakaya/turkishReviews-ds](https://huggingface.co/datasets/kmkarakaya/turkishReviews-ds),0,1
,,[volkanaltintas/turkishTradeReviews-ds-mini-4000](https://huggingface.co/datasets/volkanaltintas/turkishTradeReviews-ds-mini-4000),0,0
,,[cansen88/turkishReviews_5_topic](https://huggingface.co/datasets/cansen88/turkishReviews_5_topic),0,0
,,[orhanxakarsu/turkishReviews-ds-mini](https://huggingface.co/datasets/orhanxakarsu/turkishReviews-ds-mini),0,0
,,[orhanxakarsu/turkishPoe-ds-mini1](https://huggingface.co/datasets/orhanxakarsu/turkishPoe-ds-mini1),0,0
,,[orhanxakarsu/turkishPoe-ds-mini2](https://huggingface.co/datasets/orhanxakarsu/turkishPoe-ds-mini2),0,0
,,[orhanxakarsu/turkishPoe-generation](https://huggingface.co/datasets/orhanxakarsu/turkishPoe-generation),0,0
,,[orhanxakarsu/turkishPoe-generation-1](https://huggingface.co/datasets/orhanxakarsu/turkishPoe-generation-1),0,0
,,[orhanxakarsu/turkish-poem-generation](https://huggingface.co/datasets/orhanxakarsu/turkish-poem-generation),0,0
,,[Harsit/xnli2.0_turkish](https://huggingface.co/datasets/Harsit/xnli2.0_turkish),0,0
,,[Harsit/xnli2.0_train_turkish](https://huggingface.co/datasets/Harsit/xnli2.0_train_turkish),0,0
,,[eminecg/turkishReviews-ds-mini](https://huggingface.co/datasets/eminecg/turkishReviews-ds-mini),0,0
,,[erkanxyzalaca/turkishReviews-ds-mini](https://huggingface.co/datasets/erkanxyzalaca/turkishReviews-ds-mini),0,0
,,[ozz/turkishReviews-ds-mini](https://huggingface.co/datasets/ozz/turkishReviews-ds-mini),0,0
,,[erytrn/turkishReviews-ds-mini](https://huggingface.co/datasets/erytrn/turkishReviews-ds-mini),0,0
,,[erytrn/turkishReviews-ds-mini2](https://huggingface.co/datasets/erytrn/turkishReviews-ds-mini2),0,0
,,[ramazank2000/turkishReviews-ds-mini1](https://huggingface.co/datasets/ramazank2000/turkishReviews-ds-mini1),0,0
,,[Hilalcelik/turkishReviews-ds-mini](https://huggingface.co/datasets/Hilalcelik/turkishReviews-ds-mini),0,0
,,[sebinbusra/turkishReviews-ds-mini](https://huggingface.co/datasets/sebinbusra/turkishReviews-ds-mini),0,0
,,[kaaniince/turkishReviews-project](https://huggingface.co/datasets/kaaniince/turkishReviews-project),0,0
,,[kaaniince/turkishReviews-ds-textGeneration](https://huggingface.co/datasets/kaaniince/turkishReviews-ds-textGeneration),0,0
,,[AzerKBU/turkishReviews-ds-mini](https://huggingface.co/datasets/AzerKBU/turkishReviews-ds-mini),0,0
,,[bosnakdev/turkishReviews-ds-mini](https://huggingface.co/datasets/bosnakdev/turkishReviews-ds-mini),0,0
,,[yankihue/tweets-turkish](https://huggingface.co/datasets/yankihue/tweets-turkish),0,0
,,[yankihue/turkish-news-categories](https://huggingface.co/datasets/yankihue/turkish-news-categories),0,0
,,[Mursel/turkishReviews-ds-mini](https://huggingface.co/datasets/Mursel/turkishReviews-ds-mini),0,0
,,[Veyselbyte/turkishReviews-ds-mini](https://huggingface.co/datasets/Veyselbyte/turkishReviews-ds-mini),0,0
,,[cagrimehmet/turkishReviews-ds-mini](https://huggingface.co/datasets/cagrimehmet/turkishReviews-ds-mini),0,0
,,[styraist/turkishReview-ds-mini](https://huggingface.co/datasets/styraist/turkishReview-ds-mini),0,0
,,[serkandyck/turkish_instructions](https://huggingface.co/datasets/serkandyck/turkish_instructions),0,0
,,[Memis/turkishReviews-ds-mini](https://huggingface.co/datasets/Memis/turkishReviews-ds-mini),0,0
,,[PulsarAI/turkish_movie_sentiment](https://huggingface.co/datasets/PulsarAI/turkish_movie_sentiment),0,0
,,[ahmet1338/turkishReviews-ds-mini](https://huggingface.co/datasets/ahmet1338/turkishReviews-ds-mini),0,0
,,[nogyxo/question-answering-ukrainian](https://huggingface.co/datasets/nogyxo/question-answering-ukrainian),1,1
,,[nogyxo/question-answering-ukrainian-json-answers](https://huggingface.co/datasets/nogyxo/question-answering-ukrainian-json-answers),0,0
Loading