Skip to content
Change the repository type filter

All

    Repositories list

    • 大规模高质量中文网络文本,具有多维和细粒度信息(带训练分类模型代码🌟🌟🌟)
      Python
      2000Updated Dec 2, 2024Dec 2, 2024
    • 1.5-Pints

      Public
      一个在9天内用高质量数据预训练的紧凑型大型语言模型(非常好🌟🌟🌟)
      Python
      MIT License
      24000Updated Nov 28, 2024Nov 28, 2024
    • OLMo

      Public
      Modeling, training, eval, and inference code for OLMo(完全开源)
      Python
      Apache License 2.0
      510000Updated Nov 28, 2024Nov 28, 2024
    • DeepBI

      Public
      LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI.(非常好的BI项目🌟🌟🌟)
      Python
      MIT License
      414000Updated Nov 27, 2024Nov 27, 2024
    • marker

      Public
      Convert PDF to markdown quickly with high accuracy(推荐🌟🌟🌟)
      Python
      GNU General Public License v3.0
      1.1k000Updated Nov 26, 2024Nov 26, 2024
    • EasyRAG

      Public
      Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案
      Python
      MIT License
      29000Updated Nov 17, 2024Nov 17, 2024
    • learn-llm

      Public
      从零预训练LLAMA3的完整指南:一个文件,探索Scaling Law
      Jupyter Notebook
      17000Updated Nov 10, 2024Nov 10, 2024
    • LightRAG

      Public
      "LightRAG: Simple and Fast Retrieval-Augmented Generation"LightRAG将GraphRAG落地门槛打下来了!
      Python
      MIT License
      1.7k000Updated Oct 30, 2024Oct 30, 2024
    • JioNLP

      Public
      中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
      Python
      Apache License 2.0
      415000Updated Oct 27, 2024Oct 27, 2024
    • Emu3

      Public
      Next-Token Prediction is All You Need(下一词预测)
      Python
      Apache License 2.0
      77000Updated Oct 24, 2024Oct 24, 2024
    • 最全最棒的训练/评测/推理框架 🌟🌟🌟
      Python
      Apache License 2.0
      418000Updated Oct 17, 2024Oct 17, 2024
    • 「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练!
      Python
      Apache License 2.0
      386000Updated Oct 15, 2024Oct 15, 2024
    • swarm

      Public
      lightweight multi-agent orchestration. (OpenAI的智能体🌟)
      Python
      MIT License
      1.7k000Updated Oct 15, 2024Oct 15, 2024
    • rtp-llm

      Public
      RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.(大模型lora动态推理框架🌟🌟🌟)
      C++
      Apache License 2.0
      53000Updated Oct 14, 2024Oct 14, 2024
    • Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
      Python
      561000Updated Oct 13, 2024Oct 13, 2024
    • Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
      Python
      250000Updated Oct 10, 2024Oct 10, 2024
    • minimind

      Public
      【大模型】3小时完全从0训练一个仅有26M的小参数GPT,最低仅需2G显卡即可推理训练!(非常好🌟🌟🌟)
      Python
      Apache License 2.0
      386000Updated Sep 27, 2024Sep 27, 2024
    • InternVL

      Public
      [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
      Python
      MIT License
      509000Updated Sep 19, 2024Sep 19, 2024
    • 基于Agent的金融问答系统(非常好的学习项目🌟🌟)
      Python
      Apache License 2.0
      7000Updated Sep 13, 2024Sep 13, 2024
    • XuanYuan

      Public
      轩辕:度小满中文金融对话大模型
      Python
      101000Updated Sep 6, 2024Sep 6, 2024
    • Easy-RAG

      Public
      一个适合学习、使用、自主扩展的RAG【检索增强生成】系统!可联网做AI搜索
      Python
      40000Updated Sep 4, 2024Sep 4, 2024
    • evalscope

      Public
      一个高效的大模型评估和性能基准测试的简化可定制框架(非常好🌟🌟)
      Python
      Apache License 2.0
      38000Updated Aug 28, 2024Aug 28, 2024
    • 总结Prompt&LLM论文,开源数据&模型,AIGC应用
      282000Updated Aug 28, 2024Aug 28, 2024
    • RAGLAB

      Public
      RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation
      Python
      34000Updated Aug 26, 2024Aug 26, 2024
    • gpt-neox

      Public
      基于Megatron和DeepSpeed库,在GPU上实现模型并行Transformers模型训练。(非常好)
      Python
      Apache License 2.0
      1k000Updated Aug 24, 2024Aug 24, 2024
    • A simple, easy-to-hack GraphRAG implementation(非常好🌟)
      Python
      192000Updated Aug 17, 2024Aug 17, 2024
    • Transformer Explained: Learn How LLM Transformer Models Work with Interactive Visualization
      JavaScript
      MIT License
      325000Updated Aug 15, 2024Aug 15, 2024
    • 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理
      Jupyter Notebook
      Other
      279000Updated Aug 15, 2024Aug 15, 2024
    • blog

      Public
      Public repo for HF blog posts(最全最好的大模型知识 🌟🌟🌟)
      Jupyter Notebook
      779000Updated Aug 14, 2024Aug 14, 2024
    • 🚀 秘塔AI搜索逆向API【特长:超强检索超长输出】,支持高速流式输出、超强联网搜索(全网or学术以及简洁、深入、研究三种模式),零配置部署,多路token支持,仅供测试,如需商用请前往官方开放平台。
      TypeScript
      GNU General Public License v3.0
      114000Updated Aug 13, 2024Aug 13, 2024