on-device-llm

Here are 3 public repositories matching this topic...

whyisitworking / llama-bro

High-performance Android SDK for on-device LLM inference (GGUF). Privacy-focused, offline-first, and powered by llama.cpp with a clean Kotlin Coroutines API.

android cmake ai ndk android-library llama android-app android-package on-device-ai ndk-jni ai-assistant llamacpp llama-cpp on-device-models on-device-inference on-device-llm

Updated Mar 25, 2026
Kotlin

KnoLo Core is a local-first knowledge base engine built for small language models (LLMs). It packages your documents into a compact .knolo file and enables fully deterministic querying — no embeddings, no vector databases, no cloud services required. Designed for on-device and edge LLM deployments.

offline-first knowledge-base document-retrieval edge-computing edge-ai local-first lexical-search offline-llm rag-alternative vector-database-alternative small-llms on-device-llm retrieval-engine deterministic-search knolo

Updated Mar 18, 2026
TypeScript

martinkorelic / mobiletransformers-docs

Star

Documentation for MobileTransformers - a lightweight, modular framework based on ONNX Runtime for running and adapting large language models (LLMs) directly on mobile and edge devices. It supports on-device fine-tuning (PEFT), efficient inference, quantization, weight merging, and direct inference from merged models.

android mobile mars lora llm llm-finetuning on-device-llm

Updated Feb 14, 2026

Improve this page

Add a description, image, and links to the on-device-llm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the on-device-llm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly