mlc-llm
Here are 14 public repositories matching this topic...
codebase for "MELTing Point: Mobile Evaluation of Language Transformers"
-
Updated
Jul 19, 2024 - Python
A React Native application that enables on-device Large Language Model (LLM) chat functionality with intelligent context management. Built with TypeScript and modern React Native practices.
-
Updated
Aug 26, 2025 - TypeScript
Privacy-first on-device LLM inference for Android. Real-time AI conversations without data leaving your phone. Powered by MLC-LLM.
-
Updated
Feb 13, 2026 - Kotlin
Pre-built TVM Windows x64 binaries with LLVM support - enables MLC-LLM model conversion on Windows
-
Updated
Nov 15, 2025
Inference runtime benchmarks, configs, and production setup for a Jetson Orin Nano Super 8GB — llama.cpp, ONNX Runtime, MLC-LLM, Gemma 4, multimodal
-
Updated
Apr 23, 2026 - Python
Android agent runtime and workflow compiler for local LLM-powered screen/API automation
-
Updated
May 31, 2026
Reproducible research lab for browser-local small-model RAG with WebGPU/WebLLM runtimes, evidence-packet compression, latency benchmarks, and rights-aware archive fixtures.
-
Updated
Jun 11, 2026 - JavaScript
Improve this page
Add a description, image, and links to the mlc-llm topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the mlc-llm topic, visit your repo's landing page and select "manage topics."