Narratron: Collaborative Writing and Shadow-playing of Children Stories with Large Language Models (2023.10.29)
Yubo Zhao, Xiying Bao . - 【Adjunct Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology】
CodeFusion: A Pre-trained Diffusion Model for Code Generation (2023.10.26)
Mukul Singh, J. Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, etc
GraphGPT: Graph Instruction Tuning for Large Language Models (2023.10.19)
Jiabin Tang, Yuhao Yang, Wei Wei, Lei Shi, Lixin Su, etc . - 【arXiv.org】
Creative Robot Tool Use with Large Language Models (2023.10.19)
Mengdi Xu, Peide Huang, Wenhao Yu, Shiqi Liu, Xilun Zhang, etc . - 【arXiv.org】
MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models (2023.10.18)
Dingyao Yu, Kaitao Song, Peiling Lu, Tianyu He, Xu Tan, etc . - 【arXiv.org】
BiLL-VTG: Bridging Large Language Models and Lightweight Visual Tools for Video-based Texts Generation (2023.10.16)
Ji Qi, Kaixuan Ji, Jifan Yu, Duokang Wang, Bin Xu, etc . - 【arXiv.org】
JMedLoRA: Medical Domain Adaptation on Japanese Large Language Models using Instruction-tuning (2023.10.16)
Issey Sukeda, Masahiro Suzuki, Hiroki Sakaji, Satoshi Kodera . - 【arXiv.org】
Table-GPT: Table-tuned GPT for Diverse Table Tasks (2023.10.13)
Peng Li, Yeye He, Dror Yashar, Weiwei Cui, Song Ge, etc . - 【arXiv.org】
MemGPT: Towards LLMs as Operating Systems (2023.10.12)
Charles Packer, Vivian Fang, Shishir G. Patil, Kevin Lin, Sarah Wooders, etc
Ferret: Refer and Ground Anything Anywhere at Any Granularity (2023.10.11)
Haoxuan You, Haotian Zhang, Zhe Gan, Xianzhi Du, Bowen Zhang, etc
Understanding the Effects of RLHF on LLM Generalisation and Diversity (2023.10.10)
Robert Kirk, Ishita Mediratta, Christoforos Nalmpantis, Jelena Luketina, Eric Hambro, etc
Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading (2023.10.08)
Howard Chen, Ramakanth Pasunuru, Jason Weston, Asli Celikyilmaz . - 【arXiv.org】
xVal: A Continuous Number Encoding for Large Language Models (2023.10.04)
Siavash Golkar, Mariel Pettee, Michael Eickenberg, Alberto Bietti, M. Cranmer, etc . - 【arXiv.org】
How FaR Are Large Language Models From Agents with Theory-of-Mind? (2023.10.04)
Pei Zhou, Aman Madaan, Srividya Pranavi Potharaju, Aditya Gupta, Kevin R. McKee, etc
MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens (2023.10.03)
Kaizhi Zheng, Xuehai He, Xin Eric Wang . - 【arXiv.org】
PB-LLM: Partially Binarized Large Language Models (2023.09.29)
Yuzhang Shang, Zhihang Yuan, Qiang Wu, Zhen Dong . - 【arXiv.org】
Integration of Large Language Models within Cognitive Architectures for Autonomous Robots (2023.09.26)
Miguel Ángel González Santamarta, F. J. Lera, Ángel Manuel Guerrero Higueras, Vicente Matellán Olivera . - 【arXiv.org】
Effective Distillation of Table-based Reasoning Ability from LLMs (2023.09.22)
Bohao Yang, Chen Tang, Kangning Zhao, Chenghao Xiao, Chenghua Lin . - 【arXiv.org】
ReConcile: Round-Table Conference Improves Reasoning via Consensus among Diverse LLMs (2023.09.22)
Justin Chih-Yao Chen, Swarnadeep Saha, Mohit Bansal . - 【arXiv.org】
Chain-of-Verification Reduces Hallucination in Large Language Models (2023.09.20)
S. Dhuliawala, M. Komeili, Jing Xu, Roberta Raileanu, Xian Li, etc . - 【arXiv.org】
Kosmos-2.5: A Multimodal Literate Model (2023.09.20)
Tengchao Lv, Yupan Huang, Jingye Chen, Lei Cui, Shuming Ma, etc . - 【arXiv.org】
DreamLLM: Synergistic Multimodal Comprehension and Creation (2023.09.20)
Runpei Dong, Chunrui Han, Yuang Peng, Zekun Qi, Zheng Ge, etc . - 【arXiv.org】
SwitchGPT: Adapting Large Language Models for Non-Text Outputs (2023.09.14)
Xinyu Wang, Bohan Zhuang, Qi Wu . - 【arXiv.org】
NExT-GPT: Any-to-Any Multimodal LLM (2023.09.11)
Shengqiong Wu, Hao Fei, Leigang Qu, Wei Ji, Tat-Seng Chua . - 【arXiv.org】
From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting (2023.09.08)
Griffin Adams, Alexander R. Fabbri, Faisal Ladhak, Eric Lehman, Noémie Elhadad . - 【arXiv.org】
Large Language Models as Optimizers (2023.09.07)
Chengrun Yang, Xuezhi Wang, Yifeng Lu, Hanxiao Liu, Quoc V. Le, etc
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models (2023.09.07)
Yung-Sung Chuang, Yujia Xie, Hongyin Luo, Yoon Kim, James R. Glass, etc . - 【arXiv.org】
YaRN: Efficient Context Window Extension of Large Language Models (2023.08.31)
Bowen Peng, Jeffrey Quesnelle, Honglu Fan, Enrico Shippole . - 【arXiv.org】
MVDream: Multi-view Diffusion for 3D Generation (2023.08.31)
Yichun Shi, Peng Wang, Jianglong Ye, Mai Long, Kejie Li, etc . - 【arXiv.org】
PE-MED: Prompt Enhancement for Interactive Medical Image Segmentation (2023.08.26)
Ao Chang, Xing Tao, Xin Yang, Yuhao Huang, Xinrui Zhou, etc . - 【arXiv.org】
DARWIN Series: Domain Specific Large Language Models for Natural Science (2023.08.25)
Tong Xie, Yuwei Wan, Wei Huang, Yufei Zhou, Yixuan Liu, etc . - 【arXiv.org】
ReLLa: Retrieval-enhanced Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation (2023.08.22)
Jianghao Lin, Rongjie Shan, Chenxu Zhu, Kounianhua Du, Bo Chen, etc . - 【arXiv.org】
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding (2023.08.21)
Tianyu Yu, Chengyue Jiang, Chao Lou, Shen Huang, Xiaobin Wang, etc . - 【arXiv.org】
Giraffe: Adventures in Expanding Context Lengths in LLMs (2023.08.21)
Arka Pal, Deep Karkhanis, Manley Roberts, S. Dooley, Arvind Sundararajan, etc . - 【arXiv.org】
ExpeL: LLM Agents Are Experiential Learners (2023.08.20)
Andrew Zhao, Daniel Huang, Quentin Xu, Matthieu Lin, Y. Liu, etc . - 【arXiv.org】
The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation (2023.08.14)
Patrick Fernandes, Daniel Deutsch, M. Finkelstein, Parker Riley, André F. T. Martins, etc . - 【arXiv.org】
Accelerating LLM Inference with Staged Speculative Decoding (2023.08.08)
Benjamin Spector, Christal Re . - 【arXiv.org】
Shepherd: A Critic for Language Model Generation (2023.08.08)
Tianlu Wang, Ping Yu, Xiaoqing Tan, Sean O'Brien, Ramakanth Pasunuru, etc . - 【arXiv.org】
AgentBench: Evaluating LLMs as Agents (2023.08.07)
Xiao Liu, Hao Yu, Hanchen Zhang, Yifan Xu, Xuanyu Lei, etc . - 【arXiv.org】
Scaling Relationship on Learning Mathematical Reasoning with Large Language Models (2023.08.03)
Zheng Yuan, Hongyi Yuan, Cheng Li, Guanting Dong, Chuanqi Tan, etc . - 【arXiv.org】
Advancing Beyond Identification: Multi-bit Watermark for Language Models (2023.08.01)
Kiyoon Yoo, W. Ahn, N. Kwak . - 【arXiv.org】
AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos? (2023.07.31)
Qipeng Zhao, Ce Zhang, Shijie Wang, Changcheng Fu, Nakul Agarwal, etc . - 【arXiv.org】
A Private Watermark for Large Language Models (2023.07.30)
Aiwei Liu, Leyi Pan, Xuming Hu, Shuang Li, Lijie Wen, etc . - 【arXiv.org】
Robust Distortion-free Watermarks for Language Models (2023.07.28)
Rohith Kuditipudi, John Thickstun, Tatsunori Hashimoto, Percy Liang . - 【arXiv.org】
Publisher Correction: Large language models encode clinical knowledge. (2023.07.27)
K. Singhal, Shekoofeh Azizi, Tao Tu, S. S. Mahdavi, Jason Wei, etc . - 【Nature】
Med-Flamingo: a Multimodal Medical Few-shot Learner (2023.07.27)
Michael Moor, Qian Huang, Shirley Wu, Michihiro Yasunaga, C. Zakka, etc . - 【arXiv.org】
Med-Flamingo: a Multimodal Medical Few-shot Learner (2023.07.27)
Michael Moor, Qian Huang, Shirley Wu, Michihiro Yasunaga, C. Zakka, etc . - 【arXiv.org】
ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring Instruction Tuning (2023.07.18)
Liang Zhao, En Yu, Zheng Ge, Jinrong Yang, Hao-Ran Wei, etc . - 【arXiv.org】
TableGPT: Towards Unifying Tables, Nature Language and Commands into One GPT (2023.07.17)
Liangyu Zha, Junlin Zhou, Liyao Li, Rui Wang, Qingyi Huang, etc . - 【arXiv.org】
Self-consistency for open-ended generations (2023.07.11)
Siddhartha Jain, Xiaofei Ma, Anoop Deoras, Bing Xiang . - 【arXiv.org】
LongNet: Scaling Transformers to 1, 000, 000, 000 Tokens (2023.07.05)
Jiayu Ding, Shuming Ma, Li Dong, Xingxing Zhang, Shaohan Huang, etc . - 【arXiv.org】
Mitigating the Learning Bias towards Repetition by Self-Contrastive Training for Open-Ended Generation (2023.07.04)
Jian Guan, Minlie Huang . - 【Annual Meeting of the Association for Computational Linguistics】
Conformer LLMs - Convolution Augmented Large Language Models (2023.07.02)
Prateek Verma . - 【arXiv.org】
Inferring the Goals of Communicating Agents from Actions and Instructions (2023.06.28)
Lance Ying, Tan Zhi-Xuan, Vikash K. Mansinghka, J. Tenenbaum . - 【arXiv.org】
Kosmos-2: Grounding Multimodal Large Language Models to the World (2023.06.26)
Zhiliang Peng, Wenhui Wang, Li Dong, Y. Hao, Shaohan Huang, etc . - 【arXiv.org】
AudioPaLM: A Large Language Model That Can Speak and Listen (2023.06.22)
Paul K. Rubenstein, Chulayuth Asawaroengchai, D. Nguyen, Ankur Bapna, Zalán Borsos, etc . - 【arXiv.org】
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models (2023.06.14)
Lingxi Xie, Longhui Wei, Xiaopeng Zhang, Kaifeng Bi, Xiaotao Gu, etc . - 【arXiv.org】
XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models (2023.06.13)
Omkar Thawakar, Abdelrahman M. Shaker, Sahal Shaji Mullappilly, Hisham Cholakkal, R. Anwer, etc . - 【arXiv.org】
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena (2023.06.09)
Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Siyuan Zhuang, Zhanghao Wu, etc . - 【arXiv.org】
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena (2023.06.09)
Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Siyuan Zhuang, Zhanghao Wu, etc . - 【arXiv.org】
PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for Finance (2023.06.08)
Qianqian Xie, Weiguang Han, Xiao Zhang, Yanzhao Lai, Min Peng, etc . - 【arXiv.org】
ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory (2023.06.06)
Chenxu Hu, Jie Fu, Chenzhuang Du, Simian Luo, J. Zhao, etc . - 【arXiv.org】
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only (2023.06.01)
Guilherme Penedo, Quentin Malartic, Daniel Hesslow, Ruxandra-Aimée Cojocaru, Alessandro Cappelli, etc . - 【arXiv.org】
Baselines for Identifying Watermarked Large Language Models (2023.05.29)
Leonard Tang, Gavin Uberti, Tom Shlomi . - 【arXiv.org】
Undetectable Watermarks for Language Models (2023.05.25)
Miranda Christ, S. Gunn, Or Zamir . - 【IACR Cryptology ePrint Archive】
Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective (2023.05.24)
Guhao Feng, Yuntian Gu, Bohang Zhang, Haotian Ye, Di He, etc
Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering (2023.05.24)
Avi Caciularu, Matthew E. Peters, Jacob Goldberger, Ido Dagan, Arman Cohan
Context-Aware Transformer Pre-Training for Answer Sentence Selection (2023.05.24)
Luca Di Liello, Siddhant Garg, Alessandro Moschitti
Gorilla: Large Language Model Connected with Massive APIs (2023.05.24)
Shishir G. Patil, Tianjun Zhang, Xin Wang, Joseph E. Gonzalez
Visual Programming for Text-to-Image Generation and Evaluation (2023.05.24)
Jaemin Cho, Abhay Zala, Mohit Bansal
Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model (2023.05.24)
Zirui Liu, Guanchu Wang, Shaochen Zhong, Zhaozhuo Xu, Daochen Zha, etc
LMs with a Voice: Spoken Language Modeling beyond Speech Tokens (2023.05.24)
Eliya Nachmani, Alon Levkovitch, Julian Salazar, Chulayutsh Asawaroengchai, Soroosh Mariooryad, etc
Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT Operator (2023.05.24)
Ziwei He, Meng Yang, Minwei Feng, Jingcheng Yin, Xinbing Wang, etc
CSTS: Conditional Semantic Textual Similarity (2023.05.24)
Ameet Deshpande, Carlos E. Jimenez, Howard Chen, Vishvak S. Murahari, Victoria Graf, etc
STAR: Boosting Low-Resource Event Extraction by Structure-to-Text Data Generation with Large Language Models (2023.05.24)
Mingyu Derek Ma, Xiaoxuan Wang, Po-Nien Kung, P. Jeffrey Brantingham, Nanyun Peng, etc
Contrastive Learning of Sentence Embeddings from Scratch (2023.05.24)
Junlei Zhang, Zhenzhong Lan, Junxian He
Meta-Learning Online Adaptation of Language Models (2023.05.24)
Nathan J. Hu, Eric Mitchell, Christopher D. Manning, Chelsea Finn
Who Wrote this Code? Watermarking for Code Generation (2023.05.24)
Taehyun Lee, Seokhee Hong, Jaewoo Ahn, Ilgee Hong, Hwaran Lee, etc
Reasoning over Hierarchical Question Decomposition Tree for Explainable Question Answering (2023.05.24)
Jiajie Zhang, Shulin Cao, Tingjia Zhang, Xin Lv, Jiaxin Shi, etc
Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis (2023.05.24)
Alessandro Stolfo, Yonatan Belinkov, Mrinmaya Sachan
Active Learning for Natural Language Generation (2023.05.24)
Yotam Perlitz, Ariel Gera, Michal Shmueli-Scheuer, Dafna Sheinwald, Noam Slonim, etc
SmartTrim: Adaptive Tokens and Parameters Pruning for Efficient Vision-Language Models (2023.05.24)
Zekun Wang, Jingchang Chen, Wangchunshu Zhou, Ming Liu, Bing Qin
How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives (2023.05.24)
Xinpeng Wang, Leonie Weissweiler, Hinrich Schutze, Barbara Plank
ChatAgri: Exploring Potentials of ChatGPT on Cross-linguistic Agricultural Text Classification (2023.05.24)
Biao Zhao, Weiqiang Jin, Javier Del Ser, Guang Yang
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models (2023.05.24)
Gen Luo, Yiyi Zhou, Tianhe Ren, Shengxin Chen, Xiaoshuai Sun, etc
Unlocking Temporal Question Answering for Large Language Models Using Code Execution (2023.05.24)
Xingxuan Li, Liying Cheng, Qingyu Tan, Hwee Tou Ng, Shafiq Joty, etc
Bactrian-X : A Multilingual Replicable Instruction-Following Model with Low-Rank Adaptation (2023.05.24)
Haonan Li, Fajri Koto, Minghao Wu, Alham Fikri Aji, Timothy Baldwin
Injecting Knowledge into Biomedical Pre-trained Models via Polymorphism and Synonymous Substitution (2023.05.24)
Hongbo Zhang, Xiang Wan, Benyou Wang
LLMDet: A Large Language Models Detection Tool (2023.05.24)
Kangxi Wu, Liang Pang, Huawei Shen, Xueqi Cheng, Tat-Seng Chua
The Art of SOCRATIC QUESTIONING: Zero-shot Multimodal Reasoning with Recursive Thinking and Self-Questioning (2023.05.24)
Jingyuan Qi, Zhiyang Xu, Ying Shen, Minqian Liu, Di Jin, etc
Reasoning with Language Model is Planning with World Model (2023.05.24)
Shibo Hao, Yi Gu, Haodi Ma, Joshua Jiahua Hong, Zhen Wang, etc
Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers (2023.05.24)
Yilun Zhao, Haowei Zhang, Shengyun Si, Linyong Nan, Xiangru Tang, etc
Improving Factuality of Abstractive Summarization without Sacrificing Summary Quality (2023.05.24)
Tanay Dixit, Fei Wang, Muhao Chen
OverPrompt: Enhancing ChatGPT Capabilities through an Efficient In-Context Learning Approach (2023.05.24)
Jiazheng Li, Runcong Zhao, Yulan He, Lin Gui
MMNet: Multi-Mask Network for Referring Image Segmentation (2023.05.24)
Yichen Yan, Xingjian He, Wenxuan Wan, Jing Liu
Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks (2023.05.24)
Abhinav Rao, Sachin Vashistha, Atharva Naik, Somak Aditya, Monojit Choudhury
Editing Commonsense Knowledge in GPT (2023.05.24)
Anshita Gupta, Debanjan Mondal, Akshay Krishna Sheshadri, Wenlong Zhao, Xiang Lorraine Li, etc
Cross-lingual Data Augmentation for Document-grounded Dialog Systems in Low Resource Languages (2023.05.24)
Qi Gou, Zehua Xia, Wen-Hau Du
Trade-Offs Between Fairness and Privacy in Language Modeling (2023.05.24)
Cleo Matzken, Steffen Eger, Ivan Habernal
Frugal Prompting for Dialog Models (2023.05.24)
Bishal Santra, Sakya Basak, Abhinandan De, Manish Gupta, Pawan Goyal
In-Context Demonstration Selection with Cross Entropy Difference (2023.05.24)
Dan Iter, Reid Pryzant, Ruochen Xu, Shuohang Wang, Yang Liu, etc
A Causal View of Entity Bias in (Large) Language Models (2023.05.24)
Fei Wang, Wenjie Mo, Yiwei Wang, Wenxuan Zhou, Muhao Chen
ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models (2023.05.23)
Z. Chen, Kun Zhou, Beichen Zhang, Zheng Gong, Wayne Xin Zhao, etc
Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning (2023.05.20)
Liangming Pan, Alon Albalak, Xinyi Wang, William Yang Wang
LogiCoT: Logical Chain-of-Thought Instruction-Tuning Data Collection with GPT-4 (2023.05.20)
Hanmeng Liu, Zhiyang Teng, Leyang Cui, Chaoli Zhang, Qiji Zhou, etc
SelfzCoT: a Self-Prompt Zero-shot CoT from Semantic-level to Code-level for a Better Utilization of LLMs (2023.05.19)
IokTong Lei, ZhiDong Deng . - 【arXiv.org】
RCOT: Detecting and Rectifying Factual Inconsistency in Reasoning by Reversing Chain-of-Thought (2023.05.19)
Tianci Xue, Ziqi Wang, Zhenhailong Wang, Chi Han, Pengfei Yu, etc . - 【arXiv.org】
Do Models Really Learn to Follow Instructions? An Empirical Study of Instruction Tuning (2023.05.19)
Po-Nien Kung, Nanyun Peng . - 【arXiv.org】
AutoTrial: Prompting Language Models for Clinical Trial Design (2023.05.19)
Zifeng Wang, Cao Xiao, Jimeng Sun . - 【arXiv.org】
Efficient Prompting via Dynamic In-Context Learning (2023.05.18)
Wangchunshu Zhou, Yuchen Jiang, Ryan Cotterell, Mrinmaya Sachan . - 【arXiv.org】
Emergent and Predictable Memorization in Large Language Models (2023.04.21)
Stella Rose Biderman, Usvsn Sai Prashanth, Lintang Sutawika, Hailey Schoelkopf, Quentin G. Anthony, etc
SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks (2023.03.01)
Kai-Wei Chang, Yu-Kai Wang, Hua Shen, Iu-thing Kang, W. Tseng, etc . - 【ArXiv】
Soft Prompt Guided Joint Learning for Cross-Domain Sentiment Analysis (2023.03.01)
Jingli Shi, Weihua Li, Quan-wei Bai, Yi Yang, Jianhua Jiang . - 【ArXiv】
EvoPrompting: Language Models for Code-Level Neural Architecture Search (2023.02.28)
Angelica Chen, David Dohan, David R. So . - 【ArXiv】
Kai Greshake, Sahar Abdelnabi, Shailesh Mishra, C. Endres, Thorsten Holz, etc . - 【ArXiv】
Grimm in Wonderland: Prompt Engineering with Midjourney to Illustrate Fairytales (2023.02.17)
M. Ruskov . - 【ArXiv】
LabelPrompt: Effective Prompt-based Learning for Relation Classification (2023.02.16)
W. Zhang, Xiaoning Song, Zhenhua Feng, Tianyang Xu, Xiaojun Wu . - 【ArXiv】
Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition (2023.02.16)
Minsu Kim, Hyungil Kim, Y. Ro . - 【ArXiv】
Prompting for Multimodal Hateful Meme Classification (2023.02.08)
Rui Cao, R. Lee, Wen-Haw Chong, Jing Jiang . - 【Conference on Empirical Methods in Natural Language Processing】
Toxicity Detection with Generative Prompt-based Inference (2022.05.24)
Yau-Shian Wang, Y. Chang . - 【ArXiv】
Learning to Transfer Prompts for Text Generation (2022.05.03)
Junyi Li, Tianyi Tang, J. Nie, Ji-rong Wen, Wayne Xin Zhao . - 【North American Chapter of the Association for Computational Linguistics】
RelationPrompt: Leveraging Prompts to Generate Synthetic Data for Zero-Shot Relation Triplet Extraction (2022.03.17)
Yew Ken Chia, Lidong Bing, Soujanya Poria, Luo Si . - 【Findings】
QaNER: Prompting Question Answering Models for Few-shot Named Entity Recognition (2022.03.03)
Andy T. Liu, Wei Xiao, Henghui Zhu, Dejiao Zhang, Shang-Wen Li, etc . - 【ArXiv】
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts (2022.02.02)
Stephen H. Bach, Victor Sanh, Zheng Xin Yong, Albert Webson, Colin Raffel, etc . - 【Annual Meeting of the Association for Computational Linguistics】
Few-Shot Bot: Prompt-Based Learning for Dialogue Systems (2021.10.15)
Andrea Madotto, Zhaojiang Lin, Genta Indra Winata, Pascale Fung . - 【ArXiv】
SentiPrompt: Sentiment Knowledge Enhanced Prompt-Tuning for Aspect-Based Sentiment Analysis (2021.09.17)
Chengxi Li, Feiyu Gao, Jiajun Bu, Lu Xu, Xiang Chen, etc . - 【ArXiv】
LightNER: A Lightweight Tuning Paradigm for Low-resource NER via Pluggable Prompting (2021.08.31)
Xiang Chen, Lei Li, Shumin Deng, Chuanqi Tan, Changliang Xu, etc . - 【International Conference on Computational Linguistics】
Program Synthesis with Large Language Models (2021.08.16)
Jacob Austin, Augustus Odena, Maxwell Nye, Maarten Bosma, H. Michalewski, etc . - 【ArXiv】
Evaluating Large Language Models Trained on Code (2021.07.07)
Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde, etc . - 【ArXiv】
KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction (2021.04.15)
Xiang Chen, Ningyu Zhang, Ningyu Zhang, Xin Xie, Shumin Deng, etc . - 【The Web Conference】
Language Models as Knowledge Bases? (2019.09.01)
Fabio Petroni, Tim Rocktäschel, Patrick Lewis, A. Bakhtin, Yuxiang Wu, etc . - 【Conference on Empirical Methods in Natural Language Processing】
AdaPrompt: Adaptive Prompt-based Finetuning for Relation Extraction
Xiang Chen, Xin Xie, Ningyu Zhang, Jiahuan Yan, Shumin Deng, etc