Releases · 0cc4m/GPTQ-for-LLaMa

12 Jun 06:23

0cc4m

0.0.6

c3e7493

Latest

Add v2 with bias support (e.g. for Tulu-30b)

Assets 4

23 May 04:57

0cc4m

2023-05-23

81469a4

GPTQ-KoboldAI 0.0.5

Merge pull request #14 from TehVenomm/latestmerge

Incorrect bit shift applied to 8bit models

Assets 4

19 May 19:50

0cc4m

2023-05-19-2

4339d81

GPTQ-KoboldAI 0.0.4

2023-05-19-2

Bump version

Assets 4

18 May 19:47

0cc4m

2023-05-18-2

eab2b56

GPTQ-KoboldAI 0.0.3

2023-05-18-2

Fix setup.py

Assets 4

09 May 20:15

0cc4m

2023-05-09

cbf8ad0

GPTQ-KoboldAI 0.0.2

Add support for upstream gptq cuda version

Co-authored-by: qwopqwop200 <[email protected]>

Assets 5

06 May 18:46

0cc4m

2023-05-06-2

01a5990

GPTQ Python module

2023-05-06-2

Add MPT support

Assets 5

02 May 20:06

0cc4m

2023-05-02

3c16fd9

quant_cuda for CUDA 11.8/ROCm 5.4.2

Revert "Add wheel links file for pip"

This reverts commit 539af97b13e2b8ab0da6b26733159c22d3fe0963.

Assets 5

27 Apr 14:02

0cc4m

2023-04-27

3c16fd9

quant_cuda for CUDA 11.7

Revert "Add wheel links file for pip"

This reverts commit 539af97b13e2b8ab0da6b26733159c22d3fe0963.

Assets 4

10 Apr 20:18

0cc4m

2023-04-10

50b22e2

2023-04-10

First wheel release

Assets 4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: 0cc4m/GPTQ-for-LLaMa

GPTQ-KoboldAI 0.0.6

GPTQ-KoboldAI 0.0.5

GPTQ-KoboldAI 0.0.4

GPTQ-KoboldAI 0.0.3

GPTQ-KoboldAI 0.0.2

GPTQ Python module

quant_cuda for CUDA 11.8/ROCm 5.4.2

quant_cuda for CUDA 11.7

2023-04-10