Change the repository type filter
All
Repositories list
26 repositories
- Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
- A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
- A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
- ModelScope: bring the notion of Model-as-a-Service to life.
- ModelScope-Agent: An agent framework connecting models in ModelScope with the world
agentscope
PublicStart building LLM-empowered multi-agent applications in an easier way.- SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
- A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
MemoryScope
Publicdash-infer
PublicDashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.- Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
lite-sora
Public- FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
- KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
- AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models