Skip to content
View ai-nikolai's full-sized avatar
🍉
Building Stuff
🍉
Building Stuff

Block or report ai-nikolai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ai-nikolai/README.md

👋 Hi there!

I am a CS PhD student in LLM agents Imperial College London with Marek Rei. That's my latest work: StateAct, which outperforms ReAct by ~10%.

I work on:

  • 🧠 LLM Reasoning
  • 🛠️ LLMs + Tools
  • 🤖 LLM agents

Specifically, investigating 🔭:

  1. How to acquire new skills (automatically) for LLM Agents.
  2. How to make LLM agents follow instructions better (fine-tuning, RAG, ++).
  3. How to help LLM agents recover from mistakes.

Get in touch with me:

  • 📫 via email.
  • 🌱 to collaborate.

🧰 Toolkit

Python C++ CUDA JavaScript LateX

PyTorch TensorFlow HuggingFace

NumPy Pandas SciPy matplotlib SpaCy scikit-learn Jupyter Anaconda

Linux Git

Pinned Loading

  1. LLamp LLamp Public

    Larage Language Model Planning (LLAMP)

    Jupyter Notebook

  2. barl barl Public

    Bayesian Approximate Reinforcement Learning (BARL)

    Python 1

  3. Wluper/matilda Wluper/matilda Public

    MATILDA: Multi-AnnoTator multi-language Interactive Lightweight Dialogue Annotator

    JavaScript 149 28

  4. Wluper/Retrograph Wluper/Retrograph Public

    Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers

    Python 20 5

  5. StateAct StateAct Public

    StateAct

  6. ai-nikolai.github.io ai-nikolai.github.io Public

    Forked from alshedivat/al-folio

    Nikolai Rozanov's personal homepage.

    HTML