ModelScope

All

26 repositories

ms-swift
Public
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
agent reflection deploy llama lora liger peft multimodal sft megatron
Python
•
Apache License 2.0
•377•4.3k•218•9•Updated Nov 18, 2024Nov 18, 2024
evalscope
Public
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
performance evaluation vlm rag llm
Python
•
Apache License 2.0
•31•248•31•2•Updated Nov 18, 2024Nov 18, 2024
modelscope-studio
Public
A third-party component library based on Gradio.
Python
•
Apache License 2.0
•5•44•1•0•Updated Nov 18, 2024Nov 18, 2024
DiffSynth-Studio
Public
Enjoy the magic of Diffusion models!
Python
•
Apache License 2.0
•600•6.6k•109•0•Updated Nov 18, 2024Nov 18, 2024
facechain
Public
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Jupyter Notebook
•
Apache License 2.0
•856•9.1k•16•3•Updated Nov 18, 2024Nov 18, 2024
data-juicer
Public
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！
nlp data-science opendata data-visualization pytorch dataset chinese data-analysis llama gpt
Python
•
Apache License 2.0
•178•2.9k•27•16•Updated Nov 18, 2024Nov 18, 2024
FunASR
Public
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model
Python
•
Other
•744•7k•161•7•Updated Nov 15, 2024Nov 15, 2024
modelscope
Public
ModelScope: bring the notion of Model-as-a-Service to life.
nlp science cv speech multi-modal python machine-learning deep-learning
Python
•
Apache License 2.0
•721•7k•11•1•Updated Nov 14, 2024Nov 14, 2024
ClearVoice
Public
ClearVoice
Apache License 2.0
•0•7•0•0•Updated Nov 14, 2024Nov 14, 2024
modelscope-agent
Public
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
agent data-science code chatbot android-application multi-agents rag mobile-agents gpts llm
Python
•
Apache License 2.0
•310•2.7k•68•0•Updated Nov 13, 2024Nov 13, 2024
agentscope
Public
Start building LLM-empowered multi-agent applications in an easier way.
agent drag-and-drop chatbot multi-agent multi-modal distributed-agents gpt-4 large-language-models llm llm-agent
Python
•
Apache License 2.0
•324•5.3k•28•21•Updated Nov 11, 2024Nov 11, 2024
scepter
Public
SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
generative-model scedit aigc lar-gen stylebooth
Python
•
Apache License 2.0
•22•427•7•0•Updated Nov 7, 2024Nov 7, 2024
modelscope-classroom
Public
Jupyter Notebook
•
Apache License 2.0
•58•503•0•0•Updated Nov 1, 2024Nov 1, 2024
3D-Speaker
Public
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
speaker-verification speaker-diarization language-identification voxceleb modelscope campplus eres2net 3d-speaker rdino cnceleb
Python
•
Apache License 2.0
•104•1.2k•2•0•Updated Oct 29, 2024Oct 29, 2024
MemoryScope
Public
Python
•
Apache License 2.0
•26•307•3•0•Updated Oct 21, 2024Oct 21, 2024
comfyscope
Public
Collection of various Comfy components.
Python
•
Apache License 2.0
•1•3•0•2•Updated Oct 8, 2024Oct 8, 2024
richdreamer
Public
Live Demo：https://modelscope.cn/studios/Damo_XR_Lab/3D_AIGC
Python
•
Apache License 2.0
•18•411•16•0•Updated Sep 27, 2024Sep 27, 2024
motionagent
Public
MotionAgent is your AI assistent to convert ideas into motion pictures.
Python
•
Apache License 2.0
•35•284•3•1•Updated Sep 2, 2024Sep 2, 2024
dash-infer
Public
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.
cpu llm llm-inference native-engine
C++
•
Apache License 2.0
•15•137•5•0•Updated Aug 27, 2024Aug 27, 2024
FunClip
Public
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
speech-recognition speech-to-text gradio video-clip subtitles-generator video-subtitles llm gradio-python-llm
Python
•
MIT License
•404•3.7k•20•2•Updated Aug 22, 2024Aug 22, 2024
lite-sora
Public
An initiative to replicate Sora
Python
•
Apache License 2.0
•6•99•3•0•Updated Apr 10, 2024Apr 10, 2024
normal-depth-diffusion
Public
Python
•
Apache License 2.0
•8•125•5•0•Updated Feb 7, 2024Feb 7, 2024
FunCodec
Public
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
tts speech-synthesis codec speech-to-text audio-generation encodec voicecloning audio-quantization
Python
•
MIT License
•30•369•20•1•Updated Jan 25, 2024Jan 25, 2024
KAN-TTS
Public
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
modelscope speech tts speech-synthesis
Python
•
MIT License
•79•494•41•1•Updated Dec 28, 2023Dec 28, 2023
AdaSeq
Public
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
natural-language-processing information-extraction chinese-nlp word-segmentation bert sequence-labeling relation-extraction natural-language-understanding entity-typing token-classification
Python
•
Apache License 2.0
•38•416•29•0•Updated Nov 15, 2023Nov 15, 2023
kws-training-suite
Public
Python
•
MIT License
•17•81•7•0•Updated May 26, 2023May 26, 2023