HaoshengZou

Follow

Haosheng Zou (邹昊晟) HaoshengZou

Follow

LLM alignment@360, prev.@miHoYo & 4Paradigm. PhD@THU, advised by Prof. Jun Zhu.

37 followers · 5 following

360
Beijing
https://scholar.google.com/citations?user=zrqzQswAAAAJ

Achievements

Achievements

Pinned Loading

minGPT24 minGPT24 Public

Python
reversi-alpha-zero reversi-alpha-zero Public

Forked from mokemokechicken/reversi-alpha-zero

Reversi reinforcement learning by AlphaGo Zero methods.

Python
tianshou tianshou Public

Forked from thu-ml/tianshou

An elegant PyTorch deep reinforcement learning platform.

Python
trl trl Public

Python
schroederdewitt/multiagent_mujoco schroederdewitt/multiagent_mujoco Public

Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.

Python 334 34
taufikxu/youtube taufikxu/youtube Public archive

Youtube-8M challenge on Kaggle

Python 3 1