Machine Learning Systems
-
Updated
May 24, 2026 - Python
Machine Learning Systems
Production Android AI with ExecuTorch 1.0 - Deploy PyTorch models to mobile with NPU acceleration and 50KB footprint
LLM inference on mobile via Capacitor — run quantized GGUF models on-device
Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
ONNX model execution on iOS and Android via Capacitor
📱 Optimized ML for edge devices. Showcasing efficient model deployment, GPU-CPU memory transfer optimization, and real-world edge AI applications. 🤖
On device streaming voice activity detection (Silero VAD v5) for Android. ~424 KB native binary, NEON-accelerated arm64-v8a, RTF ~3% on Snapdragon 662.
Claude Code skill for Google LiteRT - on-device AI/ML deployment framework
Android ML model server — download management, session caching, accelerator probing
Standalone ONNX runtime session management and preprocessing for Dust — iOS/macOS
Android ONNX runtime session management and preprocessing for Dust
Model download and serving orchestration for Dust — Capacitor bridge
Largest list of models for Core ML (for iOS 11+)
On-device text embedding generation for iOS and Android via Capacitor
Standalone tokenizers and embedding runtime primitives for on-device text embeddings
magnitude-based pruning of MobileNetV2 for learning and production-oriented iOS mobile inference.
Core Capacitor bridge for the Dust on-device ML framework — iOS and Android
Add a description, image, and links to the mobile-ml topic page so that developers can more easily learn about it.
To associate your repository with the mobile-ml topic, visit your repo's landing page and select "manage topics."