huggingface / text-generation-inference Public

Notifications You must be signed in to change notification settings
Fork 1.1k
Star 9.5k

Code
Issues 153
Pull requests 23
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Issues: huggingface/text-generation-inference

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

153 Open 1,242 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Tool Calling using Vercel's AI SDK not working as intended

#2864 opened Dec 23, 2024 by kldzj

2 of 4 tasks

Dynamically serve LoRA modules

#2860 opened Dec 20, 2024 by rikardradovac

Make version tag selectable on docs page dropdown

#2857 opened Dec 19, 2024 by andrewrreed

text-generation-inference:latest-trtllm is missing dependencies to run models

#2854 opened Dec 18, 2024 by selalipop

2 of 4 tasks

Entire system crashes when get to warm up model

#2853 opened Dec 17, 2024 by ad-astra-video

1 of 4 tasks

random text generation from Qwen2-VL-7B-Instruct with TGI3

#2851 opened Dec 17, 2024 by DongyoungKim2

2 of 4 tasks

Docs for LoRA Availability and support for Qwen models

#2847 opened Dec 16, 2024 by joaomsimoes

Cohere2 aka Cohere2ForCausalLM

#2843 opened Dec 14, 2024 by kno10

2 tasks done

TGI hangs when running two extremely long prompts at once

#2842 opened Dec 14, 2024 by JohnTheNerd

2 of 4 tasks

Model warmup fails after adding Triton indexing kernels

#2838 opened Dec 13, 2024 by YaserJaradeh

2 of 4 tasks

Load model weight fast

#2836 opened Dec 13, 2024 by Zzzz1111

Server stucks at model warming phase for codestral-22b on 4xH100

#2835 opened Dec 13, 2024 by phymbert

2 of 4 tasks

use pip install TGI3.0

#2832 opened Dec 12, 2024 by xiezhipeng-git

Security aspects in TGI

#2830 opened Dec 12, 2024 by vitalyshalumov

1 of 4 tasks

TypeError: '>=' not supported between instances of 'NoneType' and 'int'

#2828 opened Dec 11, 2024 by KartDriver

2 of 4 tasks

Error for Qwen2-VL-2B-Instruct using v3.0.0

#2823 opened Dec 11, 2024 by tobiasvanderwerff

2 of 4 tasks

Unkown compute for card nvidia-a100-80gb-pcie

#2822 opened Dec 11, 2024 by ferreroal

2 of 4 tasks

BUILD_EXTENSIONS=False make install error！！！

#2821 opened Dec 11, 2024 by tangliangwu

4 tasks

[broken-compatibility] chat completion breaks base64 standard / openAI spec

#2820 opened Dec 11, 2024 by lucyknada

2 of 4 tasks

Failure when start the model using TGI 3

#2819 opened Dec 10, 2024 by hahmad2008

2 of 4 tasks

text-generation-inference False make install exception

#2805 opened Dec 6, 2024 by tangliangwu

1 of 4 tasks

integration-test failures on MI300

#2804 opened Dec 6, 2024 by itej89

2 of 4 tasks

Qwen2-VL-7B does not run properly

#2801 opened Dec 5, 2024 by jvhgit

2 of 4 tasks

NotImplementedError: 4bit quantization is not supported for AutoModel

#2800 opened Dec 4, 2024 by Jaimin-Nividous

TGI choice tool does not match OpenAI response

#2794 opened Dec 2, 2024 by thehs29

Previous 1 2 3 4 5 6 7 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly