speaker-encoder

Here are 10 public repositories matching this topic...

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

python text-to-speech deep-learning speech pytorch tts speech-synthesis voice-conversion vocoder voice-synthesis tacotron voice-cloning speaker-encodings melgan speaker-encoder multi-speaker-tts glow-tts hifigan tts-model

Updated Aug 16, 2024
Python

mozilla / TTS

Star

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

python text-to-speech deep-learning speech pytorch tts vocoder tacotron tensorflow2 tacotron2 melgan speaker-encoder dataset-analysis glow-tts multiband-melgan gantts

Updated Nov 9, 2023
Jupyter Notebook

yistLin / dvector

Star

Speaker embedding (d-vector) trained with GE2E loss

pytorch speaker-verification speaker-embedding ge2e torchscript speaker-encoder dvector

Updated Jan 8, 2024
Python

gooofy / zerovox

Star

zero-shot realtime TTS system, fully offline, free and open source

python text-to-speech deep-learning speech pytorch tts speech-synthesis voice-synthesis voice-cloning speaker-encodings melgan speaker-encoder multi-speaker-tts hifigan tts-model

Updated Apr 18, 2025
Python

shaojinding / Adversarial-Many-to-Many-VC

Star

[InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by Shaojin Ding, Guanlong Zhao, Ricardo Gutierrez-Osuna

voice-conversion vctk speaker-encoder adversarial-speaker-recognition speaker-identity

Updated Mar 24, 2023
Python

Mannalol999 / AGI_HER_TTS

Star

🎤 Create high-quality text-to-speech audio using FastSpeech 2 in PyTorch. Easily customize for English and Mandarin with multi-speaker support.

react sorting grid deep-learning encoder vue-table table datatable encode react-grid ai-agents tacotron speaker-encoder glow-tts agentic-workflow agent-frontend ag-ui-protocol

Updated Jun 28, 2026
Python

sekalf / MioTTS-llama.cpp

Star

Create fast, lightweight text-to-speech audio on your CPU with MioTTS-llama.cpp. Convert text to WAV files using customizable voice options.

rust ai deep-learning smart-home hass header-only memory-mapped-file vocoder tacotron miio file-view miot custom-component speaker-encoder home-assistant-integration miot-spec hifigan

Updated Jun 28, 2026
C++

gantasmo / Real-Time-Voice-Cloning

Sponsor

Star

Real-Time-Voice-Cloning using Generative Adversarial Networks. It combines text and speaker and generates natural sounding audio. Some Traditional Methods like Convolutional Neural Networks can contain some limitation it requires large datasets and lengthy processing time, to overcome these limitations we are using GANs to maximize space use.

text-to-speech real-time deep-learning pytorch tts speech-synthesis gan voice-conversion voice-cloning speaker-encoder audio-generation

Updated Jan 28, 2025
Python

JudgedDani / ComfyUI-OmniVoice-TTS

Star

Generate zero-shot multilingual TTS nodes for ComfyUI with voice cloning and voice design, supporting 600+ languages

python agent ocr nintendo cpp emulation speech tts extract-data cemu layout-analysis comfy melgan speaker-encoder glow-tts stable-diffusion comfyui

Updated Jun 28, 2026

Birgittawarming489 / voxtral-tts.c

Star

Run pure C inference for Mistral Voxtral-4B-TTS with mmap safetensors, BF16 weights, and 24kHz WAV output.

Updated Jun 28, 2026
C

Improve this page

Add a description, image, and links to the speaker-encoder topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speaker-encoder topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speaker-encoder

Here are 10 public repositories matching this topic...

coqui-ai / TTS

mozilla / TTS

yistLin / dvector

gooofy / zerovox

shaojinding / Adversarial-Many-to-Many-VC

Mannalol999 / AGI_HER_TTS

sekalf / MioTTS-llama.cpp

gantasmo / Real-Time-Voice-Cloning

JudgedDani / ComfyUI-OmniVoice-TTS

Birgittawarming489 / voxtral-tts.c

Improve this page

Add this topic to your repo