"We are a group of students who have just begun our research careers. These articles are a collection that we found helpful after reading, and we hope they can be of assistance to everyone."
- A Comprehensive Survey of Retrieval-Augmented Generation (RAG) Evolution, Current Landscape and Future Directions
- A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration
- A pathology foundation model for cancer diagnosis and prognosis prediction
- A Survey on Evaluation of Large Language Models
- A Survey on Multimodal Large Language Models
- A Survey on Self-Evolution of Large Language Models
- Adaptive Collaboration Strategy for LLMs in Medical Decision Making
- Agent Hospital- A Simulacrum of Hospital with Evolvable Medical Agents
- Agent-as-a-Judge Evaluate Agents with Agents
- AgentCoord multi agent collaboration
- AgentScope A Flexible yet Robust Multi-Agent Platform
- AgentVerse
- AIPatient Simulating Patients with EHRs and LLM Powered Agentic Workflow
- Are Language Models Actually Useful for Time Series Forecasting
- Are More LLM Calls All You Need
- Are Multi-Agent Discussions the Key
- Attention Is All You Need
- AUTODETECT Towards a Unified Framework for Automated Weakness
- AutoGen
- Autonomous Agents for Collaborative Task under Information Asymmetry
- Autonomous Agents for Collaborative Task under Information Asymmetry_camera_ready
- BattleAgentBench A Benchmark for Evaluating Cooperation and Competition Capabilities of Language Models in Multi-Agent Systems
- BERT
- BERT
- BEYOND HUMAN TRANSLATION
- camel
- Chain of Code Reasoning with a Language Model-Augmented Code Emulator
- Chain-of-thought
- ChatDev
- chatdev_communicative_agents
- CHATEVAL
- Cognition _ A review of OpenAI o1 and how we evaluate coding agents
- Computing machinery and intelligence
- CONFLICTBANK A Benchmark for Evaluating Knowledge Conflicts in Large Language Models
- Cooperate or Collapse
- Cooperation and Punishment in Public Goods Experiments
- CRITICBENCH Evaluating Large Language Models as Critic
- CToolEval A Chinese Benchmark for LLM-Powered Agent Evaluation in Real-World API Interactions
- Decoding Echo Chambers LLM-Powered Simulations Revealing Polarization in Social Networks
- Deep learning
- Deep Residual Learning for Image Recognition
- Deep_Residual_Learning_CVPR_2016_paper
- Do Language Models Enjoy Their Own Stories
- DPO-2023-direct-preference-optimization-your-language-model-is-secretly-a-reward-model-Paper-Conference
- Dynamic domain testing with multi-agent Markov chain Monte Carlo
- DYNAMIC LLM-AGENT NETWORK
- Edu Agents
- EHRAgent_Code_Empowers_Lar
- Empowering Biomedical Discovery with AI Agents
- Enhancing Code Translation in Language Models with Few-Shot Learning via Retrieval-Augmented Generation
- EoT
- Evaluating LLMs Playing
- EvaluLLM LLM Assisted Evaluation of Generative Outputs
- Experiential Co-Learning of Software-Developing Agents
- Exploring Large Language Models for Communication Games
- Generating Reasonable and Diversified Story Ending
- Genie Generative Interactive Environments
- Gpt-4 technical
- GPT-4 Technical Report
- GPT2
- GPT2
- GPTSwarm
- Graph of Thoughts
- Graph of Thoughts Solving Elaborate Problems with Large Language Models
- HDFLOW ENHANCING LLM COMPLEX PROBLEM-SOLVING WITH HYBRID THINKING AND DYNAMIC WORKFLOWS
- human_cooperation
- HYPER-CONNECTIONS
- Hypothetical Minds
- ImageNet
- Improving Automatic Source Code
- Improving Diversity of Commonsense Generation
- Improving Diversity of Commonsense Generation by
- INTERNET OF AGENTS
- Into the Unknown Unknowns Engaged Human Learning through Participation in Language Model Agent Conversations
- Iterative Experience Refinement of Software-Developing Agents
- Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
- KAN
- Knowledge Editing for Large Language Models A Survey
- Language Agents Foundations, Prospects, and Risks
- Language Models are Few-Shot Learners
- Large language models in medicine
- LENGTH DESENSITIZATION IN DIRECTED PREFERENCE OPTIMIZATION
- Length-Controlled AlpacaEval A Simple Way to Debias Automatic Evaluators
- LLM-BASED MULTI-AGENT POETRY GENERATION IN NON-COOPERATIVE ENVIRONMENTS
- LLM-BLENDER
- LLM-empowered Chatbots for Psychiatrist and Patient Simulation Application and Evaluation
- Make Your LLM Fully Utilize the Context
- MAKING PRETRAINED LANGUAGE MODELS GREAT ON TABULAR PREDICTION
- Mamba Linear-Time Sequence Modeling with Selective State Spaces
- MAO A Framework for Process Model Generation with Multi-Agent Orchestration
- MedAgents- Large Language Models as Collaborators for Zero-shot Medical Reasoning
- Medical Education - 2009 - McGaghie - A critical review of simulation%E2%80%90based medical education research 2003 2009
- MEDICAL GRAPH RAG TOWARDS SAFE MEDICAL
- MetaGPT
- MiniCPM
- MINPROMPT
- MoE_2017
- MoE_2023
- More Agents Is All You Need
- Multi-Agent Software Development through Cross-Team Collaboration
- Multi-Modal and Multi-Agent Systems Meet Rationality- A Survey
- ODYSSEY Empowering Agents with Open World Skills
- OpenGraph
- Overview-of-standardized-pat
- PRACT Optimizing Principled Reasoning and Acting of LLM Agent
- PrefCLM Enhancing Preference-based Reinforcement Learning
- Proportional Aggregation of Preferences for Sequential Decision Making
- RAGAs
- RAGCHECKER A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
- RepoBench
- Roleplay-doh Enabling Domain-Experts to Create LLM-simulated Patients via Eliciting and Adhering to Principles
- SAM2
- Scaling Large-Language-Model-based Multi-Agent Collaboration
- sefl-instruct
- Self Organized Agents
- Self-collaboration Code Generation via ChatGPT
- SELF-EVOLVING MULTI-AGENT COLLABORATION NETWORKS FOR SOFTWARE DEVELOPMENT
- SportsMetrics
- Standardized Patient Encounters A Method for Teaching and Evaluation
- Tell me more
- The Benefits and Risks of Being a Standardized Patient A Narrative Review of the Literature
- The Earth is Flat because... Investigating LLMs’ Belief towards Misinformation via Persuasive Conversation
- Think-on-Process Dynamic Process Generation for Collaborative Development of Multi-Agent System
- TOT-tree-of-thoughts-deliberate-problem-solving-with-large-language-models-Paper-Conference
- Towards a Client-Centered Assessment of LLM Therapists by Client Simulation
- Towards Evaluating and Building Versatile Large Language Models for Medicine
- Towards Multimodal Surgical Assistant via Structured Surgical Video Learning
- Turn Every Application into an Agent Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents
- Two Tales of Persona in LLMs- A Survey of Role-Playing and Personalization
- ULTRAFEEDBACK Boosting Language Models with Scaled AI Feedback
- Uncovering the Strategic Reasoning Limitations of
- Virtual Patient Simulation at U.S. and Canadian Medical Schools
- Writing AI Conference Papers A Handbook for Beginners
⭐" Join us in improving this repository!"