feat: stream tokens usage #4415

mintyleaf · 2024-12-16T19:31:08Z

Description

This PR fixes #4334

Notes for Reviewers

Got some time to take a look at log of issue with hanging webui response - found that llama sends the empty message from the start for some reason

There was an unnecessary usage of such case to break the channel loop, which was overlooked
Still can't reproduce that by myself, since i can't really run more advanced models, than phi2 on my 8/256 machine
Yet, tokens count still works like before and code from http/openai/chat remains untouched

@mudler can you confirm that everything is working now?

…to get the proper usage data in reply streaming mode at the last [DONE] frame

Seems like that empty message marker trick was unnecessary

netlify · 2024-12-16T19:31:26Z

✅ Deploy Preview for localai ready!

Name	Link
🔨 Latest commit	`4f7b373`
🔍 Latest deploy log	https://app.netlify.com/sites/localai/deploys/676260f7c3dde00008fea92f
😎 Deploy Preview	https://deploy-preview-4415--localai.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

mudler · 2024-12-18T08:48:31Z

Just did tested this and seems to work, thanks!

mintyleaf and others added 4 commits November 28, 2024 02:25

Use pb.Reply instead of []byte with Reply.GetMessage() in llama grpc …

2931ea4

…to get the proper usage data in reply streaming mode at the last [DONE] frame

Merge branch 'master' into fix/stream_tokens_usage

8e8e05d

Merge branch 'master' into fix/stream_tokens_usage

e459118

Fix 'hang' on empty message from the start

f98775e

Seems like that empty message marker trick was unnecessary

mudler and others added 2 commits December 17, 2024 09:25

Merge branch 'master' into fix/stream_tokens_usage

9f6be2b

Merge branch 'master' into fix/stream_tokens_usage

4f7b373

mudler changed the title ~~Fix/stream tokens usage~~ feat: stream tokens usage Dec 18, 2024

mudler approved these changes Dec 18, 2024

View reviewed changes

mudler merged commit 2bc4b56 into mudler:master Dec 18, 2024
28 of 30 checks passed

mudler added the enhancement New feature or request label Jan 10, 2025

BrewTestBot mentioned this pull request Jan 10, 2025

localai 2.25.0 Homebrew/homebrew-core#203887

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: stream tokens usage #4415

feat: stream tokens usage #4415

mintyleaf commented Dec 16, 2024 •

edited by mudler

Loading

netlify bot commented Dec 16, 2024 •

edited

Loading

mudler commented Dec 18, 2024

feat: stream tokens usage #4415

feat: stream tokens usage #4415

Conversation

mintyleaf commented Dec 16, 2024 • edited by mudler Loading

netlify bot commented Dec 16, 2024 • edited Loading

✅ Deploy Preview for localai ready!

mudler commented Dec 18, 2024

mintyleaf commented Dec 16, 2024 •

edited by mudler

Loading

netlify bot commented Dec 16, 2024 •

edited

Loading