forked from huggingface/chat-ui
-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Test New LLMs (Llama2, CodeLlama, etc.) on Chat-UI? #3
Open
krrishdholakia
wants to merge
27
commits into
CENIA-DEV:main
Choose a base branch
from
krrishdholakia:patch-2
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This reverts commit 87c6937.
* Fix reuqest body * update webSearchQueryPromptTemplate * update generate google query parser * Add today's date to google search query creator * crawl top stories if exts; remove answer_box & knowledgeGraph * Create paragraph chunks from top articles * flattened paragprah chunks * update status texts * add gradio client * call gradio app for RAG * Web scrape only "p, li, span" els * add MAX_N_CHUNKS * gradio result typing * parse only <p> elements * rm dev change * update typing WebSearch * buld RAG prompt * Rm dev change * change websearch context msg from user to assisntat type * use hosted gradio app * fix lint * prompt engineering * more prompt engineering * MAX_N_PAGES_SCRAPE = 10 * better error msg * more prompt engineering * revert websearch prompt to previous * rm `top_stories` from websearch as the results are not good * Stop using gradio client, use regular fetch * chore * Rm websearchsummary references as it is no longer used * update readme * Apply suggestions from code review Co-authored-by: Julien Chaumond <[email protected]> * Use tfjs to do embeddings in server node * fix websearch component disapperar after finishing generation * Show sources of closest embeddings used in RAG * fix prompting and also add current date * add comment * comment for search query * sources * hide www * using hostname direclty * Show successful web pages instead of failed ones * rm noisy messages * google query generation using previous messaages as context * handle falcon generation * bring back Browsing webpage msg --------- Co-authored-by: Julien Chaumond <[email protected]> Co-authored-by: Victor Mustar <[email protected]>
* Update README.md * add description of websearch on readme * Apply suggestions from code review Co-authored-by: Victor Muštar <[email protected]> * Update README.md --------- Co-authored-by: Mishig Davaadorj <[email protected]> Co-authored-by: Mishig <[email protected]>
* adjustments and mobile modal * use dvh unit * margin
* Add latex support with marked-katex-extension * Add renderer * Fix marked default option problem * Fix linting error * Fix lock error
* Bump mongodb from 5.3.0 to 5.8.0 Bumps [mongodb](https://github.com/mongodb/node-mongodb-native) from 5.3.0 to 5.8.0. - [Release notes](https://github.com/mongodb/node-mongodb-native/releases) - [Changelog](https://github.com/mongodb/node-mongodb-native/blob/v5.8.0/HISTORY.md) - [Commits](mongodb/node-mongodb-native@v5.3.0...v5.8.0) --- updated-dependencies: - dependency-name: mongodb dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]> * Store IP in messageEvents * IP based rate limit * Revert "IP based rate limit" This reverts commit 87c6937. * ip rate limit * move rate limit event to top * Add rate limiting to websearch and title summary (huggingface#433) * [Websearch] update (huggingface#427) * Fix reuqest body * update webSearchQueryPromptTemplate * update generate google query parser * Add today's date to google search query creator * crawl top stories if exts; remove answer_box & knowledgeGraph * Create paragraph chunks from top articles * flattened paragprah chunks * update status texts * add gradio client * call gradio app for RAG * Web scrape only "p, li, span" els * add MAX_N_CHUNKS * gradio result typing * parse only <p> elements * rm dev change * update typing WebSearch * buld RAG prompt * Rm dev change * change websearch context msg from user to assisntat type * use hosted gradio app * fix lint * prompt engineering * more prompt engineering * MAX_N_PAGES_SCRAPE = 10 * better error msg * more prompt engineering * revert websearch prompt to previous * rm `top_stories` from websearch as the results are not good * Stop using gradio client, use regular fetch * chore * Rm websearchsummary references as it is no longer used * update readme * Apply suggestions from code review Co-authored-by: Julien Chaumond <[email protected]> * Use tfjs to do embeddings in server node * fix websearch component disapperar after finishing generation * Show sources of closest embeddings used in RAG * fix prompting and also add current date * add comment * comment for search query * sources * hide www * using hostname direclty * Show successful web pages instead of failed ones * rm noisy messages * google query generation using previous messaages as context * handle falcon generation * bring back Browsing webpage msg --------- Co-authored-by: Julien Chaumond <[email protected]> Co-authored-by: Victor Mustar <[email protected]> * bump to 0.6.0 (huggingface#434) * Update README.md (huggingface#435) * Update README.md * add description of websearch on readme * Apply suggestions from code review Co-authored-by: Victor Muštar <[email protected]> * Update README.md --------- Co-authored-by: Mishig Davaadorj <[email protected]> Co-authored-by: Mishig <[email protected]> * Mobile: fix model selection (huggingface#448) * adjustments and mobile modal * use dvh unit * margin * fix lint on main * Add latex support with marked-katex-extension (huggingface#450) * Add latex support with marked-katex-extension * Add renderer * Fix marked default option problem * Fix linting error * Fix lock error --------- Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Nathan Sarrazin <[email protected]> Co-authored-by: Mishig <[email protected]> Co-authored-by: Julien Chaumond <[email protected]> Co-authored-by: Victor Mustar <[email protected]> Co-authored-by: Mishig Davaadorj <[email protected]> Co-authored-by: Blanchon <[email protected]>
…gingface#451) * feat: Improve error handling and parsing of MODELS environment variable * Add more verbose parsing error * Lint * improve message * lint * refactor error handling and default values in models * improve * format --------- Co-authored-by: Nathan Sarrazin <[email protected]>
* Use `gte-base` as the emebdding model * use `bge-small-en-v1.5` * Revert "use `bge-small-en-v1.5`" This reverts commit 8cfe084. * Use `gte-small`
This reverts commit f88542b.
…ted (huggingface#451)" This reverts commit 8ce8b63.
This reverts commit 1061bc2.
* wip: complete refactor of streaming backend * working refactoring * fix missing first token & perf regression in output quality * lint * Fix websearch loading from db * fix loading * fix invalidate * remove logs * fix SSR error * typo: paragraphs * fixed save on abort * lint * lint * remove debug log in console * lint for real
* Refactor summarization * get rid of debug log * remove old todo
* fix JSON.parse for summerize When serving with TGI, summerize calls this function and it errors with `SyntaxError: Unexpected token d in JSON at position 0` This PR fixes the problem and keeps existing behaviour. * fix types --------- Co-authored-by: Nathan Sarrazin <[email protected]>
* add-copytoclipboardbtn for the all message * fix padding * fix padding * Fix styling * Move before like and dislike button * position and spacing * mobile fix --------- Co-authored-by: Victor Mustar <[email protected]>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Hi @pipepkr @FranLucchini,
Notice you forked chat-ui. if you're trying to test other LLMs (codellama, wizardcoder, etc.) with it, I just wrote a 1-click proxy to translate openai calls to huggingface, anthropic, togetherai, etc. api calls.
code
Here's the PR on adding openai to chat-ui: huggingface#452
I'd love to know if this solves a problem for you