CUHK ARISE Lab

All

10 repositories

GAMABench
Public
Benchmarking LLMs' Gaming Ability in Multi-Agent Environments
Jupyter Notebook
•
GNU General Public License v3.0
•0•42•0•0•Updated Nov 25, 2024Nov 25, 2024
PsychoBench
Public
Benchmarking LLMs' Psychological Portrayal
Python
•
GNU General Public License v3.0
•2•68•0•1•Updated Nov 21, 2024Nov 21, 2024
EmotionBench
Public
Benchmarking LLMs' Emotional Alignment with Humans
Python
•
GNU General Public License v3.0
•4•69•1•1•Updated Sep 25, 2024Sep 25, 2024
LLMPersonality
Public
Code and Results of the Paper Titled: Revisiting the Reliability of Psychological Scales on Large Language Models
Python
•0•29•0•0•Updated Sep 24, 2024Sep 24, 2024
MAS-Resilience
Public
Code and data for our paper "On the Resilience of Multi-Agent Systems with Malicious Agents"
Python
•
GNU General Public License v3.0
•0•13•0•0•Updated Aug 5, 2024Aug 5, 2024
ECHO
Public
Evaluating AI Chatbots’ Role-Play Ability
Python
•
GNU General Public License v3.0
•0•2•0•0•Updated Apr 30, 2024Apr 30, 2024
3100-PJ-TUT-3
Public
HTML
•2•1•0•0•Updated Feb 13, 2023Feb 13, 2023
3100-PJ-TUT-2
Public
Python
•3•1•0•0•Updated Jan 29, 2023Jan 29, 2023
AEON
Public
An automated tool to evaluate the quality of textual adversarial examples.
Python
•
MIT License
•1•8•0•0•Updated Jul 19, 2022Jul 19, 2022
ml4code-dataset
Public
A collection of datasets for machine learning for big code
MIT License
•5•46•0•0•Updated Oct 8, 2021Oct 8, 2021