Featured on Locally Hosted: Run Full LLMs Locally with Microsoft Foundry Local 🎬 #744

ryanstorandt · 2026-05-29T21:47:12Z

ryanstorandt
May 29, 2026

I recently hosted an episode of my Locally Hosted podcast where we dove deep into your work on Foundry Local. We really appreciated how you’ve simplified running full LLMs locally, and we highlighted its seamless support for chat-completions, GPU acceleration, and private AI workflows.

For production, we ran our entire pipeline locally on an NVIDIA DGX Spark. We generated the AI dialogue with Qwen 3.6, created the visuals using CogVideoX-2B, and synthesized the narration with Edge-TTS voices—all without leaving the machine.

We’re also working on open-sourcing our complete pipeline soon so the community can build on it. Thank you so much for your incredible contribution to the local AI ecosystem. It’s truly inspiring to see projects like yours empowering developers to run powerful models privately and efficiently. Looking forward to future updates!

🎬 Catch the episode here: https://youtu.be/EXLAV7vAFwI

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Featured on Locally Hosted: Run Full LLMs Locally with Microsoft Foundry Local 🎬 #744

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Uh oh!

Featured on Locally Hosted: Run Full LLMs Locally with Microsoft Foundry Local 🎬 #744

Uh oh!

ryanstorandt May 29, 2026

Replies: 0 comments

ryanstorandt
May 29, 2026