Change the repository type filter
All
Repositories list
17 repositories
Spider2
PublicOSWorld
Public[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer EnvironmentsPai-Megatron-Patch
PublicOpenAgents
Public[COLM 2024] OpenAgents: An Open Platform for Language Agents in the WildDS-1000
Public[ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".BRIGHT
Publictext2reward
Public[ICLR 2024] Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"Spider2-V
Public[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?instructor-embedding
Public[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddingsarks
Publicxlang-paper-reading
PublicPaper collection on building and evaluating language model agents via executable language groundingbatch-prompting
Public.github
PublicBinder
Public[ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"UnifiedSKG
Public[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language modelsicl-selective-annotation
Public[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"diagrams_toolkit
Public