-
Notifications
You must be signed in to change notification settings - Fork 0
/
基座模型相关
60 lines (46 loc) · 1.72 KB
/
基座模型相关
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
欢迎 Gemma: Google 最新推出的开源大型语言模型
https://zhuanlan.zhihu.com/p/683295570
谷歌开源可商用的大语言模型Gemma
https://zhuanlan.zhihu.com/p/683278643
https://huggingface.co/google/gemma-7b-it
ChatGLM3
https://github.com/THUDM/ChatGLM3
https://huggingface.co/THUDM/chatglm3-6b
glm-4-9b
https://huggingface.co/THUDM/glm-4-9b-chat-1m
Mixtral-8x7B
https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1
llama2
https://huggingface.co/meta-llama/Llama-2-70b-chat-hf
llama3
如何看待 Meta 发布 Llama3,并将推出 400B+ 版本?
https://www.zhihu.com/question/653373334
https://github.com/meta-llama/llama3
https://llama.meta.com/llama3/
llama3.1
https://huggingface.co/collections/meta-llama/llama-31-669fc079a0c406a149a5738f
https://huggingface.co/blog/llama31
https://ai.meta.com/blog/meta-llama-3-1/
https://ai.meta.com/research/publications/the-llama-3-herd-of-models/
llama3.2
媲美GPT-4o mini的小模型,Meta Llama 3.2模型全面解读!
https://zhuanlan.zhihu.com/p/803861772
qwen1.5
https://github.com/QwenLM/Qwen1.5
qwen2
https://qwenlm.github.io/blog/qwen2/
https://huggingface.co/Qwen/Qwen2-72B-Instruct
Qwen2 Technical Report
https://arxiv.org/abs/2407.10671
qwen2.5
https://qwenlm.github.io/blog/qwen2.5/
https://huggingface.co/spaces/Qwen/Qwen2.5
Qwen 2.5 技术报告(中文速通版)
https://zhuanlan.zhihu.com/p/14710836610
https://huggingface.co/Qwen/QVQ-72B-Preview
https://huggingface.co/spaces/Qwen/QVQ-72B-preview
Gemma 2
【LLM技术报告】《Gemma 2: Improving Open Language Models at a Practical Size》——Gemma 2技术报告(全文)
https://zhuanlan.zhihu.com/p/706063275
【LLM技术报告】《Phi-4技术报告》
https://zhuanlan.zhihu.com/p/12270688172