Skip to content

Commit

Permalink
chk2 21 Jun
Browse files Browse the repository at this point in the history
  • Loading branch information
Roman Koshkin authored and Roman Koshkin committed Jun 21, 2024
1 parent db7fee4 commit 8961bc3
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions data/projects.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -16,10 +16,10 @@ title: Projects
## Recent

<div className="grid sm:grid-cols-2 gap-6">
<ProjectWithBadges url="https://github.com/RomanKoshkin/transllama" title="🔪toLLMatch" badges={["NLP", "pytorch", "machine translation", "AWS"]}>
<ProjectWithBadges url="https://github.com/RomanKoshkin/toLLMatch" title="🔪toLLMatch" badges={["NLP", "pytorch", "machine translation", "AWS"]}>
Zero-shot context-aware simultaneous machine translation leveraging an open-source ASR engine and LLM.
</ProjectWithBadges>
<ProjectWithBadges url="https://github.com/RomanKoshkin/toLLMatch" title="🦙TransLLaMa" badges={["NLP", "LLMOps", "machine translation", "MLOps"]}>
<ProjectWithBadges url="https://github.com/RomanKoshkin/transllama" title="🦙TransLLaMa" badges={["NLP", "LLMOps", "machine translation", "MLOps"]}>
LLM-based simultaneous speech-to-text machine translation. Decoder-only large language models (LLMs) have recently demonstrated impressive capabilities in text generation and reasoning. Nonetheless, they have limited applications in simultaneous machine translation (SiMT), currently dominated by encoder-decoder transformers. This study demonstrates that, after fine-tuning on a small dataset comprising causally aligned source and target sentence pairs, a pre-trained open-source LLM can control input segmentation directly by generating a special "wait" token. This obviates the need for a separate policy and enables the LLM to perform English-German and English-Russian SiMT tasks with BLEU scores that are comparable to those of specific state-of-the-art baselines. We also evaluated closed-source models such as GPT-4, which displayed encouraging results in performing the SiMT task without prior training (zero-shot), indicating a promising avenue for enhancing future SiMT systems.
</ProjectWithBadges>
<ProjectWithBadges url="https://github.com/RomanKoshkin/conv-seq" title="🧠convSeq" badges={["DL", "computational neuroscience"]}>
Expand Down

0 comments on commit 8961bc3

Please sign in to comment.