llama-cpp

Star

Here are 894 public repositories matching this topic...

mozilla-ai / llamafile

Star

Distribute and run LLMs with a single file.

cross-platform speech-to-text local-inference llama-cpp local-llm local-ai gguf open-source-ai single-file-executable

Updated May 14, 2026
C++

getumbrel / llama-gpt

Star

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

ai self-hosted openai llama gpt gpt-4 llm chatgpt llamacpp llama-cpp gpt4all localai llama2 llama-2 code-llama codellama

Updated Apr 23, 2024
TypeScript

SciSharp / LLamaSharp

Star

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot llama gpt multi-modal llm llava semantic-kernel llamacpp llama-cpp llama2 llama3

Updated May 15, 2026
C#

Open-Source AI Camera Skills Platform, AI NVR & CCTV Surveillance. Local VLM video analysis with Qwen, DeepSeek, SmolVLM, LLaVA, YOLO26. LLM-powered agentic security camera agent — watches, understands, remembers & guards your home via Telegram, Discord or Slack. Pluggable AI skills. OpenAI, Google, Anthropic or local AI. Runs on Mac Mini & AI PC.

Updated Apr 21, 2026
JavaScript

Mobile-Artificial-Intelligence / maid

Sponsor

Star

Maid is a free and open source application for interfacing with llama.cpp models locally, and with Anthropic, DeepSeek, Ollama, Mistral and OpenAI models remotely.

android facebook chatbot openai llama mistral claude chatgpt anthropic llama-cpp ollama gguf mobile-artificial-intelligence deepseek

Updated Apr 7, 2026
TypeScript

Luce-Org / lucebox-hub

Star

Lucebox: LLM inference server built for speed for specific consumer hardware.

kernel cuda cuda-kernels nvidia-cuda luce rtx3090 llama-cpp local-ai qwen speculative-decoding dflash megakernel speculative-prefill pflash lucebox

Updated May 17, 2026
C++

alichherawalla / off-grid-mobile-ai

Star

The Swiss Army Knife of Offline AI. Chat, Speak, and Generate Images - Privacy First, Zero Internet. Download an LLM and use it on your mobile device. No data ever leaves your phone. Supports text-to-text, vision, text-to-image

privacy-first edge-ai ondevice mobile-ai llama-cpp local-ai offline-llm gguf stable-diffusion-android offline-ai whisper-android tool-calling ondevice-ai

Updated May 16, 2026
TypeScript

withcatai / node-llama-cpp

Star

Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level

Updated May 6, 2026
TypeScript

undreamai / LLMUnity

Sponsor

Star

Create characters in Unity with LLMs!

chat gamedev ai unity chatbot game-development dialogue unity3d character npc llama unity2d conversational-ai rag llm generative-ai llama-cpp

Updated Apr 29, 2026
C#

RunanywhereAI / RCLI

Star

Talk to your Mac, query your docs, no cloud required. On-device voice AI + RAG

text-to-speech metal speech-to-text voice-assistant rag parakeet on-device-ai apple-silicon ai-assistant llm llama-cpp local-ai tool-calling kokoro-tts qwen3 lfm2 kitten-tts

Updated Mar 16, 2026
C++

gotzmann / llama.go

Star

llama.go is like llama.cpp in pure Golang!

llama gpt alpaca vicuna gpt3 gpt4 llm chatgpt dalai llama-cpp gpt4all

Updated Sep 20, 2024
Go

ggml-org / LlamaBarn

Star

A cosy home for your LLMs.

macos swift ai llms llama-cpp

Updated Apr 27, 2026
Swift

docker / compose-for-agents

Star

Build and run AI agents using Docker Compose. A collection of ready-to-use examples for orchestrating open-source LLMs, tools, and agent runtimes.

docker docker-compose examples openai-gym self-hosted ai-agents large-language-models llama-cpp agentic-workflows

Updated Apr 20, 2026
TypeScript

mybigday / llama.rn

Star

React Native binding of llama.cpp

android ios react-native llama llm llama-cpp

Updated May 11, 2026
C++

Light-Heart-Labs / DreamServer

Star

Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.

docker text-to-speech amd self-hosted nvidia speech-to-text workflow-automation ai-agents rag n8n llm llama-cpp comfyui local-ai open-webui strix-halo

Updated May 17, 2026
Python

the-crypt-keeper / can-ai-code

Star

Self-evaluating interview for AI coders

ai transformers humaneval llm langchain llama-cpp ggml

Updated Jun 21, 2025
Python

withcatai / catai

Star

Run AI ✨ assistant locally! with simple API for Node.js 🚀

nodejs ai chatbot openai chatui vicuna ai-assistant llm chatgpt dalai llama-cpp vicuna-installation-guide localai wizardlm local-llm catai ggmlv3 gguf node-llama-cpp

Updated Nov 16, 2025
TypeScript

mdrokz / rust-llama.cpp

Sponsor

Star

LLama.cpp rust bindings

rust machine-learning cpp model ffi crates-io llama api-bindings llama-cpp

Updated Jun 27, 2024
Rust

Siddhesh2377 / ToolNeuron

Sponsor

Star

On-device AI for Android — LLM chat (GGUF/llama.cpp), vision models (VLM), image generation (Stable Diffusion), tool calling, AI personas, RAG knowledge packs, TTS/STT. Fully offline, zero subscriptions, open-source.

android kotlin open-source privacy-first rag jetpack-compose on-device-ai mobile-ai llm stable-diffusion vision-language-model llama-cpp local-ai tool-calling gguf-models ai-personas

Updated May 17, 2026
Kotlin

lucasjinreal / Crane

Star

A Pure Rust based LLM, VLM, VLA, TTS, OCR Inference Engine, powering by Candle & Rust. Alternate to your llama.cpp but much more simpler and cleaner..

rust mllm llama-cpp qwen2-vl spark-tts qwen3

Updated May 4, 2026
Rust

Improve this page

Add a description, image, and links to the llama-cpp topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llama-cpp topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama-cpp

Here are 894 public repositories matching this topic...

mozilla-ai / llamafile

getumbrel / llama-gpt

SciSharp / LLamaSharp

SharpAI / DeepCamera

Mobile-Artificial-Intelligence / maid

Luce-Org / lucebox-hub

alichherawalla / off-grid-mobile-ai

withcatai / node-llama-cpp

undreamai / LLMUnity

RunanywhereAI / RCLI

gotzmann / llama.go

ggml-org / LlamaBarn

docker / compose-for-agents

mybigday / llama.rn

Light-Heart-Labs / DreamServer

the-crypt-keeper / can-ai-code

withcatai / catai

mdrokz / rust-llama.cpp

Siddhesh2377 / ToolNeuron

lucasjinreal / Crane

Improve this page

Add this topic to your repo