Skip to content

Latest commit

 

History

History
188 lines (95 loc) · 10.9 KB

llm-tool-use.md

File metadata and controls

188 lines (95 loc) · 10.9 KB

Paper collection for LLM tool use

Introduction

Papers

  1. TALM: Tool Augmented Language Models. Arxiv

    Aaron Parisi, Yao Zhao, Noah Fiedel [pdf], 2022.5

  2. Binding Language Models in Symbolic Languages. ICLR 2023

    Zhoujun Cheng, Tianbao Xie, Peng Shi, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu [pdf], 2022.10

  3. Synergizing Reasoning and Acting in Language Models. ICLR 2023

    Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, Yuan Cao [pdf], 2022.10

  4. PAL: Program-aided Language Models. ICML 2023

    Luyu Gao, Aman Madaan, Shuyan Zhou, Uri Alon, Pengfei Liu, Yiming Yang, Jamie Callan, Graham Neubig [pdf], 2022.11

  5. Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks. Arxiv

    Wenhu Chen, Xueguang Ma, Xinyi Wang, William W. Cohen [pdf], 2022.11

  6. Planning with Large Language Models via Corrective Re-prompting. Neurips 2023 workshop

    Shreyas Sundara Raman, Vanya Cohen, Eric Rosen, Ifrah Idrees, David Paulius, Stefanie Tellex [pdf], 2022.11

  7. Large language models are versatile decomposers: Decompose evidence and questions for table-based reasoning. SIGIR 2023

    Yunhu Ye, Binyuan Hui, Min Yang, Binhua Li, Fei Huang, Yongbin Li [pdf], 2023.1

  8. Augmented Language Models: a Survey. Arxiv

    Grégoire Mialon, Roberto Dessì, Maria Lomeli, Christoforos Nalmpantis, Ram Pasunuru, Roberta Raileanu, Baptiste Rozière, Timo Schick, Jane Dwivedi-Yu, Asli Celikyilmaz, Edouard Grave, Yann LeCun, Thomas Scialom [pdf], 2023.2

  9. Collaborating with language models for embodied reasoning. NeurIPS 2022 LaReL workshop

    Ishita Dasgupta, Christine Kaeser-Chen, Kenneth Marino, Arun Ahuja, Sheila Babayan, Felix Hill, Rob Fergus [pdf], 2023.2

  10. Toolformer: Language Models Can Teach Themselves to Use Tools. Arxiv

    Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Luke Zettlemoyer, Nicola Cancedda, Thomas Scialom [pdf], 2023.2

  11. Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models. Arxiv

    Chenfei Wu, Shengming Yin, Weizhen Qi, Xiaodong Wang, Zecheng Tang, Nan Duan [pdf], 2023.3

  12. ViperGPT: Visual Inference via Python Execution for Reasoning. Arxiv

    Dídac Surís, Sachit Menon, Carl Vondrick [pdf], 2023.3

  13. ART: Automatic multi-step reasoning and tool-use for large language models. Arxiv

    Bhargavi Paranjape, Scott Lundberg, Sameer Singh, Hannaneh Hajishirzi, Luke Zettlemoyer, Marco Tulio Ribeiro [pdf], 2023.3

  14. TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs. Arxiv

    Yaobo Liang, Chenfei Wu, Ting Song, Wenshan Wu, Yan Xia, Yu Liu, Yang Ou, Shuai Lu, Lei Ji, Shaoguang Mao, Yun Wang, Linjun Shou, Ming Gong, Nan Duan [pdf], 2023.3

  15. HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace. Arxiv

    Yongliang Shen, Kaitao Song, Xu Tan, Dongsheng Li, Weiming Lu, Yueting Zhuang [pdf], 2023.3

  16. OpenAGI: When LLM Meets Domain Experts. Arxiv

    Yingqiang Ge, Wenyue Hua, Kai Mei, Jianchao Ji, Juntao Tan, Shuyuan Xu, Zelong Li, Yongfeng Zhang [pdf], 2023.4

  17. API-Bank: A Benchmark for Tool-Augmented LLMs. Arxiv

    Minghao Li, Feifan Song, Bowen Yu, Haiyang Yu, Zhoujun Li, Fei Huang, Yongbin Li [pdf], 2023.4

  18. Tool Learning with Foundation Models. Arxiv

    Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, Zheni Zeng, Yufei Huang, Chaojun Xiao, Chi Han, Yi Ren Fung, Yusheng Su, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang, Jing Yi, Yuzhang Zhu, Zhenning Dai, Lan Yan, Xin Cong, Yaxi Lu, Weilin Zhao, Yuxiang Huang, Junxi Yan, Xu Han, Xian Sun, Dahai Li, Jason Phang, Cheng Yang, Tongshuang Wu, Heng Ji, Zhiyuan Liu, Maosong Sun [pdf], 2023.4

  19. GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information. Arxiv

    Qiao Jin, Yifan Yang, Qingyu Chen, Zhiyong Lu [pdf], 2023.4

  20. Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models. Arxiv

    Pan Lu, Baolin Peng, Hao Cheng, Michel Galley, Kai-Wei Chang, Ying Nian Wu, Song-Chun Zhu, Jianfeng Gao [pdf], 2023.4

  21. LLM+P: Empowering Large Language Models with Optimal Planning Proficiency. Arxiv

    Bo Liu, Yuqian Jiang, Xiaohan Zhang, Qiang Liu, Shiqi Zhang, Joydeep Biswas, Peter Stone [pdf], 2023.4

  22. Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks. Arxiv

    Shicheng Xu, Liang Pang, Huawei Shen, Xueqi Cheng, Tat-Seng Chua [pdf], 2023.4

  23. ToolCoder: Teach Code Generation Models to use API search tools. Arxiv

    Kechi Zhang, Ge Li, Jia Li, Zhuo Li, Zhi Jin [pdf], 2023.5

  24. Small models are valuable plug-ins for large language models. Arxiv

    Canwen Xu, Yichong Xu, Shuohang Wang, Yang Liu, Chenguang Zhu, Julian McAuley [pdf], 2023.5

  25. ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings. Arxiv

    Shibo Hao, Tianyang Liu, Zhen Wang, Zhiting Hu [pdf], 2023.5

  26. CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing. Arxiv

    Zhibin Gou, Zhihong Shao, Yeyun Gong, Yelong Shen, Yujiu Yang, Nan Duan, Weizhu Chen [pdf], 2023.5

  27. Making Language Models Better Tool Learners with Execution Feedback. Arxiv

    Shuofei Qiao, Honghao Gui, Huajun Chen, Ningyu Zhang [pdf], 2023.5

  28. PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents. Arxiv

    Simeng Sun, Yang Liu, Shuohang Wang, Chenguang Zhu, Mohit Iyyer [pdf], 2023.5

  29. ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models. Arxiv

    Binfeng Xu, Zhiyuan Peng, Bowen Lei, Subhabrata Mukherjee, Yuchen Liu, Dongkuan Xu [pdf], 2023.5

  30. Gorilla: Large Language Model Connected with Massive APIs. Arxiv

    Shishir G. Patil, Tianjun Zhang, Xin Wang, Joseph E. Gonzalez [pdf], 2023.5

  31. On the Tool Manipulation Capability of Open-source Large Language Models. Arxiv

    Qiantong Xu, Fenglu Hong, Bo Li, Changran Hu, Zhengyu Chen, Jian Zhang [pdf], 2023.5

  32. Large Language Models as Tool Makers. Arxiv

    Tianle Cai, Xuezhi Wang, Tengyu Ma, Xinyun Chen, Denny Zhou [pdf], 2023.5

  33. GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction. Arxiv

    Rui Yang, Lin Song, Yanwei Li, Sijie Zhao, Yixiao Ge, Xiu Li, Ying Shan [pdf], 2023.5

  34. CREATOR: Disentangling Abstract and Concrete Reasonings of Large Language Models through Tool Creation. Arxiv

    Cheng Qian, Chi Han, Yi R. Fung, Yujia Qin, Zhiyuan Liu, Heng Ji [pdf], 2023.5

  35. Modular Visual Question Answering via Code Generation. ACL 2023

    Sanjay Subramanian, Medhini Narasimhan, Kushal Khangaonkar, Kevin Yang, Arsha Nagrani, Cordelia Schmid, Andy Zeng, Trevor Darrell, Dan Klein [pdf], 2023.6

  36. ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases. Arxiv

    Qiaoyu Tang, Ziliang Deng, Hongyu Lin, Xianpei Han, Qiao Liang, Le Sun [pdf], 2023.6

  37. Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow. Arxiv

    Wenqi Zhang, Yongliang Shen, Weiming Lu, Yueting Zhuang [pdf], 2023.6

  38. ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs Arxiv

    Yujia Qin, Shihao Liang, Yining Ye, Kunlun Zhu, Lan Yan, Yaxi Lu, Yankai Lin, Xin Cong, Xiangru Tang, Bill Qian, Sihan Zhao, Runchu Tian, Ruobing Xie, Jie Zhou, Mark Gerstein, Dahai Li, Zhiyuan Liu, Maosong Sun [pdf], 2023.7

  39. LLM Powered Autonomous Agents Blog

    Lilian Weng, [blog] 2023.7

  40. Language Agents in the Digital World: Opportunities and Risks Blog

    Shunyu Yao and Karthik Narasimhan, [blog] 2023.7

  41. ExpeL: LLM Agents Are Experiential Learners Arxiv

    Andrew Zhao, Daniel Huang, Quentin Xu, Matthieu Lin, Yong-Jin Liu, Gao Huang, [pdf] 2023.8

  42. Agents: An Open-source Framework for Autonomous Language Agents Arxiv

    Wangchunshu Zhou, Yuchen Eleanor Jiang, Long Li, Jialong Wu, Tiannan Wang, Shi Qiu, Jintian Zhang, Jing Chen, Ruipu Wu, Shuai Wang, Shiding Zhu, Jiyu Chen, Wentao Zhang, Ningyu Zhang, Huajun Chen, Peng Cui, Mrinmaya Sachan, [pdf] 2023.9

  43. MindAgent: Emergent Gaming Interaction Arxiv

    Ran Gong, Qiuyuan Huang, Xiaojian Ma, Hoi Vo, Zane Durante, Yusuke Noda, Zilong Zheng, Song-Chun Zhu, Demetri Terzopoulos, Li Fei-Fei, Jianfeng Gao, [pdf] 2023.9

  44. Multimodal Foundation Models: From Specialists to General-Purpose Assistants Arxiv

    Chunyuan Li, Zhe Gan, Zhengyuan Yang, Jianwei Yang, Linjie Li, Lijuan Wang, Jianfeng Gao, [pdf] 2023.9

  45. Identifying the Risks of LM Agents with an LM-Emulated Sandbox Arxiv

    Yangjun Ruan, Honghua Dong, Andrew Wang, Silviu Pitis, Yongchao Zhou, Jimmy Ba, Yann Dubois, Chris J. Maddison, Tatsunori Hashimoto, [pdf] 2023.9

  46. Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models Arxiv Andy Zhou, Kai Yan, Michal Shlapentokh-Rothman, Haohan Wang, Yu-Xiong Wang, [pdf] 2023.9