From 75a192514683e97f49de5a6e0c7998481c087b2a Mon Sep 17 00:00:00 2001 From: Alex Date: Tue, 19 May 2026 01:54:54 -0400 Subject: [PATCH] Revise showcase entries in README Updated the README to improve the description of Kyutai PocketTTS and Supertonic-3. --- README.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index c53b4a81d..43cf288d1 100644 --- a/README.md +++ b/README.md @@ -1,4 +1,4 @@ -![banner.png](banner.png) +d![banner.png](banner.png) # FluidAudio - Transcription, Text-to-speech, VAD, Speaker diarization with CoreML Models @@ -58,8 +58,8 @@ Want to convert your own model? Check [möbius](https://github.com/FluidInferenc | **[Action Phrase Live Production Control](https://www.youtube.com/watch?v=ykcvdTHHmrk)** | Voice-controlled live production workflow using FluidAudio's ASR and speaker diarization to trigger cameras, graphics, and layouts with natural voice commands | | **[talat - VAD, ASR, Speaker ID](https://www.youtube.com/watch?v=OjP4Adrv9_E)** | A video demo showcasing FluidAudio's VAD, two different ASR models, and speaker diarization during a talat.app meeting recording | | **[MagpieTTS Multilingual on iPhone 17 Pro](https://x.com/sach1n/status/2030521943263436934)** | NVIDIA's MagpieTTS Multilingual model converted to CoreML and running on an iPhone 17 Pro | -| **[Kyutai PocketTTS on ANE](https://x.com/sach1n/status/2051375097408360596)** | Kyutai labs PocketTTS in CoreML running on the ANE — fast and background-capable on iOS | -| **[Supertonic-3 on iPhone 17 Pro ANE](https://x.com/sach1n/status/2056476851825250685)** | Supertonic-3 running on iPhone 17 Pro via ANE/CoreML — 2 minutes of audio generated in 3 seconds with low RAM and background support | +| **[Kyutai PocketTTS on ANE](https://x.com/sach1n/status/2051375097408360596)** | Kyutai labs PocketTTS in iOS running fast on the ANE & background-capable | +| **[Supertonic-3 on iPhone 17 Pro ANE](https://x.com/sach1n/status/2056476851825250685)** | Supertonic-3 running on iPhone 17 Pro via ANE/CoreML with 2 minutes of audio generated in 3 seconds with low RAM & background support | ## Showcase