Skip to content

Spring_2023

Kelly Marchisio edited this page Apr 23, 2023 · 28 revisions
Week Topic Presenter Paper (With Link)
January 23, 2023 Welcome and Topic Selection David Yarowsky and Kenton Murray
January 30, 2023 Low-Resource Kenton Murray FLEURS with slides and some FLoRes-101 and v1.0 and a bit of FLORES-200
February 6, 2023 Typology Rachel Wicks From Phonology to Syntax: Unsupervised Linguistic Typology at Different Levels with Language Embeddings with maybe some background
February 13, 2023 Phonology Henry Li Investigating phonological theories with crowd-sourced data: The Inventory Size Hypothesis in the light of Lingua Libre and Domain-Informed Probing of wav2vec 2.0 Embeddings for Phonetic Features
February 20, 2023 Code-switching Stella Li Are Multilingual Models Effective in Code-Switching? and Language Models for Code-switch Detection of te reo Maori and English in a Low-resource Setting
February 27, 2023 Codeswitching Bismarck Odoom End-to-End Speech Translation for Code Switched Speech
March 6, 2023 Representation Learning Tianjian Li Discovering Low-rank Subspaces for Language-agnostic Multilingual Representations (Xie et al., EMNLP 2022) and slides
March 13, 2023 Representation Learning Kenton Murray IsoScore: Measuring the Uniformity of Embedding Space Utilization and slides
March 20, 2023 Spring Break
March 27, 2023
April 3, 2023 Representation Learning Ujvala Pradeep An Isotropy Analysis in the Multilingual BERT Embedding Space and slides
April 10, 2023 Multilingual Transfer Neha Verma Match the Script, Adapt if Multilingual: Analyzing the Effect of Multilingual Pretraining on Cross-lingual Transferability and slides
April 17, 2023 Multilingual Modeling Nikhil Sharma XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models
April 24, 2023 Corpus Quality Kelly Marchisio Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets and Does Corpus Quality Really Matter for Low-Resource Languages?
Clone this wiki locally