From a050602ef18ae20141c7e053df3a6409789bb0a1 Mon Sep 17 00:00:00 2001
From: Stephanie Schoch
- |
People Hallucinate Too - | -
- |
- -There are three types of hallucinations according to the “Siren's Song - in the AI Ocean” paper: - -
|
- |
- - _Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models_ - |
-
- | Hallucination Risks - | -
-Group 1 focused on "Input-conflict Hallucination". One member narrated a story involving two characters, where one character murdered the other. Contrarily, ChatGPT presented an opposite conclusion. Another member tried to exploit different languages, using two distinct languages that possess similar words. -
- --Group 2 concentrated on "Counter-conflict Hallucination". They described four to five fictitious characters, detailing their interrelationships. Some relationships were deducible, yet the model frequently failed to make a complete set of deductions until explictely prompted to be more complete. -
- --Group 3 delved into "Fact-conflict Hallucination". An illustrative example was when ChatGPT was queried with the fraction "⅓". It offered "0.333" as an approximation. However, when subsequently asked to multiply "0.3333" by "3", it confidently responded with "1". Additional tests included translations between two languages. -
-- |
Mitigation Strategies
-
-_Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models_
- - - For inference, one strategy is to reduce the snowballing of hallucinations by designing a dynamic p-value. The p-value should start off large and shrink as more tokens are generated. Furthermore, introducing new or external knowledge can be done at two different positions: before and after generation. - - |
-
- |
-Decoding Contrasting Layers - -_DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models_ - - - - Based on evolving trends, the concept of contrastive decoding is introduced. For example, one might ask, "How do we decide between Seattle or Olympia?" When considering the last layer as a mature layer, it is beneficial to contrast the differences between the preceding layers, which can be deemed as premature. For each of these layers, it is possible to calculate the difference between each probability distribution by comparing mature and premature layers, a process that utilizes the Jensen-Shannon Divergence. Such an approach permits the amplification of the factual knowledge that the model has acquired, thereby enhancing its output generation. - - |
-
- |
-
-_In-Context Retrieval-Augmented Language Models_
-
- - The model parameters are kept frozen. Instead of directly inputting text into the model, the approach first uses retrieval to search for relevant documents from external sources. The findings from these sources are then concatenated with the original text. Re-ranking results from the retrieval model also provides benefits; the exact perplexities can be referred to in the slide. It has been observed that smaller strides can enhance performance, albeit at the cost of increased runtime. The authors have noticed that the information at the end of a query is typically more relevant for output generation. In general, shorter queries tend to outperform longer ones. - - |
-
- |
Benefits of Hallucinations - | -
- Group 2: - Discussed two papers from this week's reading which highlighted the use of semantic search and the introduction of external context to aid the model. This approach, while useful for diminishing hallucination, heavily depends on external information, which is not effective in generic cases. - Further strategies discussed were automated prompt engineering, optimization of user-provided context (noting that extensive contexts can induce hallucination), and using filtering or attention mechanisms to limit the tokens the model processes. - From the model's perspective, it is beneficial to employ red-teaming, explore corner cases, and pinpoint domains where hallucinations are prevalent. - Notably, responses can vary for an identical prompt. A proposed solution is to generate multiple responses to the same prompt and amalgamate them, perhaps through a majority voting system, to eliminate low-probability hallucinations. -
-- Group 1: - Discussed the scarcity of alternatives to the current training dataset. - Like Group 2, they also explored the idea of generating multiple responses but suggested allowing the user to select from the array of choices. - Another approach discussed was the model admitting uncertainty, stating "I don’t know", rather than producing a hallucination. -
-- Group 3: Addressed inconsistencies in the training data. - Emphasized the importance of fine-tuning and ensuring the use of contemporary data. - However, they noted that fine-tuning doesn't ensure exclusion of outdated data. - It was also advised to source data solely from credible sources. - An interesting perspective discussed was utilizing a larger model to verify the smaller model's hallucinations. But a caveat arises: How can one ensure the larger model's accuracy? And if the larger model is deemed superior, why not employ it directly? -
- -## Discussion: _What are the potential advantages of hallucinations in Large Language Models (LLMs)?_ - -- One advantage discussed was that hallucinations "train" users to not blindly trust the model outputs. If such models are blindly trusted, there is a much greater risk associated with their use. - If users can conclusively discern, however, that the produced information is fictitious, it could assist in fostering new ideas or fresh perspectives on a given topic. - Furthermore, while fake data has potential utility in synthetic data generation, there's a pressing need to remain vigilant regarding the accuracy and plausibility of the data produced. -
- - # Readings ### For the first class (10/9)