-
-
-
flash-attention-h2o Public
Combine flash attention with heavy hitter
Python BSD 3-Clause "New" or "Revised" License UpdatedSep 14, 2024 -
kv.run Public
Forked from mlsys-io/kv.runA model serving framework for various research and production scenarios. Seamlessly built upon the PyTorch and HuggingFace ecosystem.
C++ Apache License 2.0 UpdatedAug 8, 2024 -
-
ipex-llm Public
Forked from intel-analytics/ipex-llmAccelerate LLM with low-bit (INT3 / INT4 / NF4 / INT5 / INT8) optimizations using bigdl-llm
Python Apache License 2.0 UpdatedApr 25, 2024 -
bigdl-project.github.io Public
Forked from bigdl-project/bigdl-project.github.ioHTML UpdatedMar 25, 2024 -
-
bigdl-llm-tutorial Public
Forked from intel/ipex-llm-tutorialAccelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using bigdl-llm
Jupyter Notebook Apache License 2.0 UpdatedJan 29, 2024 -
FastChat_BigDL-LLM_Adapted Public
Adapted FastChat with BigDL-LLM
Python Apache License 2.0 UpdatedJan 18, 2024 -
-
-
-
-
This is the github repo for the deep learning spring 2023 course final project competition
Python UpdatedMay 3, 2023 -
-
-
SimVP Public
Forked from peiqi-liu/SimVPThe official implementation of the CVPR'2022 paper SimVP: Simpler Yet Better Video Prediction.
Python UpdatedApr 29, 2023 -
-
-
Prototypical network based on resnet-50
Python UpdatedJun 6, 2022 -