ModelScope

All

28 repositories

modelscope-studio
Public
A third-party component library based on Gradio.
python ui gradio antd-design modelscope gradio-custom-component modelscope-studio
Python
•
Apache License 2.0
•8•73•3•0•Updated Jan 26, 2025Jan 26, 2025
data-juicer
Public
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
nlp data-science opendata data-visualization pytorch dataset chinese data-analysis llama gpt
Python
•
Apache License 2.0
•197•3.5k•29•7•Updated Jan 26, 2025Jan 26, 2025
ms-swift
Public
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
agent deploy llama lora liger peft multimodal sft distill dpo
Python
•
Apache License 2.0
•445•5.2k•356•15•Updated Jan 26, 2025Jan 26, 2025
evalscope
Public
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
performance evaluation vlm rag llm
Python
•
Apache License 2.0
•45•377•23•1•Updated Jan 24, 2025Jan 24, 2025
FunASR
Public
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model
Python
•
Other
•816•7.8k•234•9•Updated Jan 24, 2025Jan 24, 2025
DiffSynth-Studio
Public
Enjoy the magic of Diffusion models!
Python
•
Apache License 2.0
•629•6.8k•121•0•Updated Jan 24, 2025Jan 24, 2025
dash-infer
Public
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.
cpu cuda llm llm-inference native-engine guided-decoding
C
•
Apache License 2.0
•22•220•5•0•Updated Jan 24, 2025Jan 24, 2025
modelscope
Public
ModelScope: bring the notion of Model-as-a-Service to life.
nlp science cv speech multi-modal python machine-learning deep-learning
Python
•
Apache License 2.0
•755•7.3k•8•3•Updated Jan 23, 2025Jan 23, 2025
ClearerVoice-Studio
Public
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
audio deep-learning speech pytorch speech-separation speech-enhancement noise-suppression speaker-extraction bandwidth-extension speech-super-resolution
Python
•
Apache License 2.0
•150•2.1k•10•3•Updated Jan 23, 2025Jan 23, 2025
PromptScope
Public
Enjoy easier conversations with LLM
prompt multi-modal gpt-4 in-context-learning large-language-models prompt-engineering llms
Python
•
Apache License 2.0
•1•8•0•0•Updated Jan 20, 2025Jan 20, 2025
3D-Speaker
Public
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
speaker-verification speaker-diarization language-identification voxceleb modelscope campplus eres2net 3d-speaker cnceleb sdpn
Python
•
Apache License 2.0
•131•1.6k•1•0•Updated Jan 17, 2025Jan 17, 2025
agentscope
Public
Start building LLM-empowered multi-agent applications in an easier way.
agent drag-and-drop chatbot multi-agent multi-modal distributed-agents gpt-4 large-language-models llm llm-agent
Python
•
Apache License 2.0
•364•6k•34•18•Updated Jan 13, 2025Jan 13, 2025
modelscope-agent
Public
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
agent data-science code chatbot android-application multi-agents rag mobile-agents gpts llm
Python
•
Apache License 2.0
•328•2.9k•69•3•Updated Jan 8, 2025Jan 8, 2025
modelscope-classroom
Public
Jupyter Notebook
•
Apache License 2.0
•76•643•1•0•Updated Dec 31, 2024Dec 31, 2024
langchain-modelscope
Public
Langchain integration for ModelScope
Python
•
Apache License 2.0
•1•4•0•0•Updated Dec 27, 2024Dec 27, 2024
facechain
Public
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Jupyter Notebook
•
Apache License 2.0
•866•9.2k•12•2•Updated Dec 10, 2024Dec 10, 2024
scepter
Public
SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
generative-model scedit aigc lar-gen stylebooth
Python
•
Apache License 2.0
•27•456•10•2•Updated Dec 7, 2024Dec 7, 2024
MemoryScope
Public
Python
•
Apache License 2.0
•38•377•4•0•Updated Nov 21, 2024Nov 21, 2024
comfyscope
Public
Collection of various Comfy components.
Python
•
Apache License 2.0
•1•4•0•2•Updated Nov 20, 2024Nov 20, 2024
richdreamer
Public
[CVPR2024 (Highlight)] RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D. Live Demo：https://modelscope.cn/studios/Damo_XR_Lab/3D_AIGC
Python
•
Apache License 2.0
•20•436•17•0•Updated Sep 27, 2024Sep 27, 2024
motionagent
Public
MotionAgent is your AI assistent to convert ideas into motion pictures.
Python
•
Apache License 2.0
•36•289•3•1•Updated Sep 2, 2024Sep 2, 2024
FunClip
Public
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
speech-recognition speech-to-text gradio video-clip subtitles-generator video-subtitles llm gradio-python-llm
Python
•
MIT License
•458•4.1k•26•2•Updated Aug 22, 2024Aug 22, 2024
lite-sora
Public
An initiative to replicate Sora
Python
•
Apache License 2.0
•7•103•3•0•Updated Apr 10, 2024Apr 10, 2024
normal-depth-diffusion
Public
Python
•
Apache License 2.0
•9•127•6•0•Updated Feb 7, 2024Feb 7, 2024
FunCodec
Public
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
tts speech-synthesis codec speech-to-text audio-generation encodec voicecloning audio-quantization
Python
•
MIT License
•32•384•20•1•Updated Jan 25, 2024Jan 25, 2024
KAN-TTS
Public
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
modelscope speech tts speech-synthesis
Python
•
MIT License
•85•499•42•1•Updated Dec 28, 2023Dec 28, 2023
AdaSeq
Public
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
natural-language-processing information-extraction chinese-nlp word-segmentation bert sequence-labeling relation-extraction natural-language-understanding entity-typing token-classification
Python
•
Apache License 2.0
•39•429•31•0•Updated Nov 15, 2023Nov 15, 2023
kws-training-suite
Public
Python
•
MIT License
•20•99•7•0•Updated May 26, 2023May 26, 2023