Consideration - Compress LLM template/query optimization #31

converseKarl · 2024-04-23T22:46:35Z

there is another git project from Microsoft for LLM Prompt Query compression thatr reduces query prompt by 20% and optimizes it but also reduces the LLM costing and speeds up performance.

https://github.com/microsoft/LLMLingua

Could this be something useful to hook in as an optimization on the LLM template and query?

adhityan · 2024-04-24T03:46:04Z

Yes, this is certainly useful.

I checked this out when they put it out. I want to enable it as a simple flag you set to activate during build. Unfortunately Microsoft has only provided a Python implementation for this library atm. I am keeping track of this and we can reopen this thread once there is support for it in JS/TS.

github-actions bot assigned adhityan Apr 23, 2024

adhityan closed this as completed Apr 24, 2024

adhityan added the enhancement New feature or request label Apr 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consideration - Compress LLM template/query optimization #31

Consideration - Compress LLM template/query optimization #31

converseKarl commented Apr 23, 2024

adhityan commented Apr 24, 2024

Consideration - Compress LLM template/query optimization #31

Consideration - Compress LLM template/query optimization #31

Comments

converseKarl commented Apr 23, 2024

adhityan commented Apr 24, 2024