基座模型相关

欢迎 Gemma: Google 最新推出的开源大型语言模型
https://zhuanlan.zhihu.com/p/683295570
谷歌开源可商用的大语言模型Gemma
https://zhuanlan.zhihu.com/p/683278643
https://huggingface.co/google/gemma-7b-it

ChatGLM3
https://github.com/THUDM/ChatGLM3
https://huggingface.co/THUDM/chatglm3-6b

glm-4-9b
https://huggingface.co/THUDM/glm-4-9b-chat-1m

Mixtral-8x7B
https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1

llama2
https://huggingface.co/meta-llama/Llama-2-70b-chat-hf

llama3
如何看待 Meta 发布 Llama3，并将推出 400B+ 版本？
https://www.zhihu.com/question/653373334
https://github.com/meta-llama/llama3
https://llama.meta.com/llama3/

llama3.1
https://huggingface.co/collections/meta-llama/llama-31-669fc079a0c406a149a5738f
https://huggingface.co/blog/llama31
https://ai.meta.com/blog/meta-llama-3-1/
https://ai.meta.com/research/publications/the-llama-3-herd-of-models/


llama3.2
媲美GPT-4o mini的小模型，Meta Llama 3.2模型全面解读！
https://zhuanlan.zhihu.com/p/803861772

qwen1.5
https://github.com/QwenLM/Qwen1.5

qwen2
https://qwenlm.github.io/blog/qwen2/
https://huggingface.co/Qwen/Qwen2-72B-Instruct
Qwen2 Technical Report
https://arxiv.org/abs/2407.10671

qwen2.5
https://qwenlm.github.io/blog/qwen2.5/
https://huggingface.co/spaces/Qwen/Qwen2.5
Qwen 2.5 技术报告（中文速通版）
https://zhuanlan.zhihu.com/p/14710836610

https://huggingface.co/Qwen/QVQ-72B-Preview
https://huggingface.co/spaces/Qwen/QVQ-72B-preview

Gemma 2
【LLM技术报告】《Gemma 2: Improving Open Language Models at a Practical Size》——Gemma 2技术报告（全文）
https://zhuanlan.zhihu.com/p/706063275

【LLM技术报告】《Phi-4技术报告》
https://zhuanlan.zhihu.com/p/12270688172