Skip to content

Commit

Permalink
fix some typos
Browse files Browse the repository at this point in the history
  • Loading branch information
hodlen committed Dec 15, 2023
1 parent 9d2d6a1 commit 0302435
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@

https://github.com/hodlen/PowerInfer/assets/34213478/b782ccc8-0a2a-42b6-a6aa-07b2224a66f7

<sub>The demo running environment consists of a single 4090 GPU, the model is Falcon (ReLU)-40B, and the precision is FP16.</sub>
<sub>The demo is running with a single 24G 4090 GPU, the model is Falcon (ReLU)-40B, and the precision is FP16.</sub>

---
## Abstract
Expand Down Expand Up @@ -108,7 +108,7 @@ In order to build PowerInfer you have two different options.

![github-eval-2080ti-q4](https://github.com/SJTU-IPADS/PowerInfer/assets/34213478/0fc1bfc4-aafc-4e82-a865-bec0143aff1a)

PowerInfer achieves up to 11x and 8x speedup for fp16 and int4 model!
PowerInfer achieves up to 11x and 8x speedup for FP16 and INT4 model!

## TODOs
We will release the code and data in the following order, please stay tuned!
Expand All @@ -127,7 +127,7 @@ We will release the code and data in the following order, please stay tuned!
If you find PowerInfer useful or relevant to your project and research, please kindly cite our paper:

```bibtex
Stay Tune
Stay tuned!
```

## Acknowledgement
Expand Down

0 comments on commit 0302435

Please sign in to comment.