Skip to content
Change the repository type filter

All

    Repositories list

    • ML models for Uberduck
      Roff
      Apache License 2.0
      6137838Updated Oct 17, 2024Oct 17, 2024
    • openduck

      Public
      Building an open-source interactive AI plush toy.
      Python
      MIT License
      1902Updated Apr 5, 2024Apr 5, 2024
    • TypeScript
      0000Updated Feb 10, 2024Feb 10, 2024
    • TypeScript
      MIT License
      0000Updated Jan 27, 2024Jan 27, 2024
    • The official implementation of HierSpeech++
      Python
      Other
      134100Updated Dec 19, 2023Dec 19, 2023
    • Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
      Python
      MIT License
      2.1k000Updated Dec 16, 2023Dec 16, 2023
    • RWKV-infctx for audio generation
      Jupyter Notebook
      Apache License 2.0
      28000Updated Nov 27, 2023Nov 27, 2023
    • Simple text to phones converter for multiple languages
      Python
      GNU General Public License v3.0
      172000Updated Nov 24, 2023Nov 24, 2023
    • yomikata

      Public
      Disambiguate japanese heteronyms
      Python
      MIT License
      4000Updated Oct 3, 2023Oct 3, 2023
    • rvc

      Public
      Python
      0100Updated Jun 12, 2023Jun 12, 2023
    • Voice data <= 10 mins can also be used to train a good VC model!
      Python
      MIT License
      3.6k300Updated Apr 27, 2023Apr 27, 2023
    • whisper

      Public
      Robust Speech Recognition via Large-Scale Weak Supervision
      Python
      MIT License
      8.4k000Updated Apr 24, 2023Apr 24, 2023
    • demucs

      Public
      Code for the paper Hybrid Spectrogram and Waveform Source Separation
      Python
      MIT License
      1.1k000Updated Apr 24, 2023Apr 24, 2023
    • radtts

      Public
      Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.
      Roff
      MIT License
      40200Updated Apr 6, 2023Apr 6, 2023
    • Tileable Stable Diffusion - Cog model
      Python
      324000Updated Apr 4, 2023Apr 4, 2023
    • An STFT/iSTFT for PyTorch.
      Python
      BSD 3-Clause "New" or "Revised" License
      50000Updated Mar 29, 2023Mar 29, 2023
    • NeMo

      Public
      NeMo: a toolkit for conversational AI
      Jupyter Notebook
      Apache License 2.0
      2.5k500Updated Mar 29, 2023Mar 29, 2023
    • riffusion

      Public
      Stable diffusion for real-time music generation
      Python
      MIT License
      390100Updated Mar 27, 2023Mar 27, 2023
    • Tools to train a generative model on arbitrary audio samples
      Jupyter Notebook
      MIT License
      175000Updated Mar 21, 2023Mar 21, 2023
    • g2p

      Public
      g2p: English Grapheme To Phoneme Conversion
      Python
      Apache License 2.0
      129100Updated Mar 15, 2023Mar 15, 2023
    • uberduct

      Public
      CMU US English Dictionary
      Python
      Other
      148501Updated Mar 15, 2023Mar 15, 2023
    • Python
      MIT License
      3900Updated Mar 15, 2023Mar 15, 2023
    • PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
      Jupyter Notebook
      Apache License 2.0
      51300Updated Jan 19, 2023Jan 19, 2023
    • Monotonic Alignment Search
      Cython
      MIT License
      14000Updated Sep 6, 2022Sep 6, 2022
    • Morph Text in Remotion
      TypeScript
      3100Updated Apr 7, 2022Apr 7, 2022
    • Remotion adaptation of https://github.com/winkerVSbecks/3d-particle-effects-demo as reqeusted in Discord
      TypeScript
      MIT License
      4000Updated Mar 29, 2022Mar 29, 2022
    • 3d-text

      Public
      TypeScript
      3100Updated Mar 27, 2022Mar 27, 2022
    • 🎶 Spotify Wrapped recreated in Remotion 🎥
      TypeScript
      11100Updated Mar 26, 2022Mar 26, 2022
    • audiogram

      Public
      Turn audio into a shareable video.
      JavaScript
      MIT License
      335100Updated Jan 14, 2022Jan 14, 2022
    • Streamlit app to visualize and edit TTS datasets
      Python
      51400Updated Dec 15, 2021Dec 15, 2021