Skip to content

Commit

Permalink
Merge pull request #306 from utilityai/update-llama-cpp-2024-05-19
Browse files Browse the repository at this point in the history
  • Loading branch information
MarcusDunn authored May 19, 2024
2 parents 60ee3ff + 7b8341f commit dce5451
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion llama-cpp-sys-2/llama.cpp
Submodule llama.cpp updated 85 files
+8 −8 .devops/nix/package.nix
+73 −0 .github/labeler.yml
+169 −63 .github/workflows/build.yml
+12 −0 .github/workflows/labeler.yml
+4 −3 .github/workflows/server.yml
+54 −21 CMakeLists.txt
+45 −0 CMakePresets.json
+3 −3 Makefile
+23 −7 README.md
+0 −95 ci/run.sh
+16 −0 cmake/arm64-windows-llvm.cmake
+6 −0 cmake/arm64-windows-msvc.cmake
+11 −1 common/common.cpp
+1 −0 common/common.h
+1 −1 common/grammar-parser.cpp
+6 −6 common/json-schema-to-grammar.cpp
+5 −5 common/log.h
+40 −49 convert-hf-to-gguf-update.py
+49 −59 convert-hf-to-gguf.py
+0 −150 convert-lora-to-ggml.py
+155 −25 convert.py
+20 −4 docs/debugging-tests.md
+3 −0 examples/CMakeLists.txt
+1 −0 examples/embedding/embedding.cpp
+10 −8 examples/llama.android/app/src/main/cpp/CMakeLists.txt
+21 −6 examples/llava/llava-cli.cpp
+0 −15 examples/llava/llava.cpp
+59 −1 examples/perplexity/README.md
+2 −2 examples/perplexity/perplexity.cpp
+3 −1 examples/quantize/README.md
+2 −0 examples/rpc/CMakeLists.txt
+74 −0 examples/rpc/README.md
+134 −0 examples/rpc/rpc-server.cpp
+5 −6 examples/server/README.md
+14 −0 examples/server/server.cpp
+5 −2 examples/server/tests/features/steps/steps.py
+1 −1 examples/server/utils.hpp
+0 −1 ggml-backend.c
+81 −32 ggml-cuda.cu
+1 −0 ggml-cuda.h
+29 −0 ggml-cuda/common.cuh
+162 −0 ggml-cuda/fattn-common.cuh
+312 −0 ggml-cuda/fattn-tile-f16.cu
+3 −0 ggml-cuda/fattn-tile-f16.cuh
+309 −0 ggml-cuda/fattn-tile-f32.cu
+3 −0 ggml-cuda/fattn-tile-f32.cuh
+326 −0 ggml-cuda/fattn-vec-f16.cu
+5 −0 ggml-cuda/fattn-vec-f16.cuh
+275 −0 ggml-cuda/fattn-vec-f32.cu
+3 −0 ggml-cuda/fattn-vec-f32.cuh
+78 −577 ggml-cuda/fattn.cu
+2 −11 ggml-cuda/softmax.cu
+33 −30 ggml-cuda/upscale.cu
+7 −0 ggml-impl.h
+48 −35 ggml-metal.m
+33 −41 ggml-metal.metal
+2,214 −40 ggml-quants.c
+1,032 −0 ggml-rpc.cpp
+24 −0 ggml-rpc.h
+5 −24 ggml-sycl.cpp
+1,043 −1,028 ggml-vulkan-shaders.hpp
+65 −129 ggml-vulkan.cpp
+589 −354 ggml.c
+16 −2 ggml.h
+59 −27 ggml_vk_generate_shaders.py
+1 −0 gguf-py/gguf/__init__.py
+11 −5 gguf-py/gguf/gguf_writer.py
+20 −9 gguf-py/gguf/lazy.py
+109 −0 gguf-py/gguf/quants.py
+267 −123 llama.cpp
+3 −0 llama.h
+0 −1 requirements.txt
+0 −2 requirements/requirements-convert-lora-to-ggml.txt
+176 −90 scripts/debug-test.sh
+116 −46 scripts/gen-unicode-data.py
+4 −0 scripts/sync-ggml-am.sh
+1 −1 scripts/sync-ggml.last
+2 −0 scripts/sync-ggml.sh
+63 −17 tests/test-backend-ops.cpp
+46 −0 tests/test-grammar-integration.cpp
+60 −53 tests/test-tokenizer-random.py
+6,969 −2,169 unicode-data.cpp
+15 −12 unicode-data.h
+89 −111 unicode.cpp
+44 −12 unicode.h

0 comments on commit dce5451

Please sign in to comment.