
Your entire knowledge base,
instantly askable.
A private, grounded AI assistant for your second brain. Primarily grounded in your local notes, with optional web search for the gaps.
Native integration with every frontier model, plus fully local
Bring your own API key — or run 100% offline with a local model.
Notes are where great ideas
go to hide.
You've spent years building a digital garden in Obsidian or Markdown. But when you actually need an answer, you're stuck digging through folders, chasing broken tags, and losing your flow. Your “Second Brain” has become a digital junk drawer.
You don't need more notes.
You need a way to talk to the ones you already have.
One keypress to remember everything.
QuickAI brings the power of a large language model to your local file system. No training on your private data. No server of ours in the path. Just instant recall of every thought you've ever recorded.
How it works
Five primitives,
one keypress.
Every answer traces back to a file.
Grounded, not guessed. Every claim ships with a clickable link to the exact markdown file it came from.
Today's chat
is tomorrow's context.
Save any chat back to your notes as a summarized markdown file. Your assistant's memory compounds.
Highlight, then
reply.
Highlight text anywhere on your Mac, hit Reply, drop it in as context.
We never see
your notes.
On-device by default. BYOK for frontier. No server of ours in the path.
Your model,
your key.
Plug in GPT-5.4, Claude 4.7, Gemini 3.1 — whatever you already pay for. Or run fully offline with a local MoE model via Ollama. Your tokens stay in your keychain. We never see them.
- GPT-5.4OpenAIyour key
- Claude 4.7 OpusAnthropicyour key
- Gemini 3.1 ProGoogleyour key
- Llama 4 ScoutOllama · localoffline
- Qwen 3.6 AgentOllama · localoffline

Stop searching.
Start synthesizing.
This is the workflow you were promised. A silent partner that knows exactly what you know.
Pricing
One-time payment. Yours forever.
Every tier is a lifetime license with one year of free updates. We never see your notes.
Local-first Inference
Run fully offline via Ollama, or bring your own key when you want frontier power. We never route your notes through a server of ours — because we don't run one.
Instant Recall
Find answers across thousands of files in milliseconds. No cloud latency, no loading spinners.
Zero Subscriptions
Pay once, own it forever. We believe premium tools shouldn't be a rent-trap for your knowledge.
- Lifetime license
- 1 year of free updates
- Unlimited knowledge bases
- Unlimited Save to knowledge
- Multi-model picker
- Direct line to the founder
- Lifetime license
- 1 year of free updates
- Unlimited knowledge bases
- Unlimited Save to knowledge
- Multi-model picker
Unlocks when Early Bird sells out.
- Lifetime license
- 1 year of free updates
- Unlimited knowledge bases
- Unlimited Save to knowledge
- Multi-model picker
Unlocks when Early Bird sells out.
Common questions.
Everything you need to know about privacy, licensing, and local AI.
How does 'Local Inference' actually work?
QuickAI runs inference on your Mac's hardware and searches your files locally. When you ask a question, the 'thinking' happens on your CPU/GPU, not in the cloud. We never route your notes through a server of ours — because we don't have one.
Which AI models do you support?
Run fully offline with Llama 4 Scout, Qwen 3.6, or Gemma via Ollama or LM Studio. Or 'Bring Your Own Key' for GPT-5.4, Claude 4.7 Opus, or Gemini 3.1 Pro when you want frontier power. Either way, we never see your traffic.
What does 'Lifetime License' really mean?
You pay once, and the version of QuickAI you buy is yours forever. No monthly 'rent' for your own knowledge. This license includes one full year of feature updates. After that, you can keep using your version forever or renew for another year of updates.
Does it work with my existing notes?
Yes. QuickAI is a 'read-only' layer that sits on top of your existing folders. It works with Obsidian, Apple Notes, and any folder of Markdown or PDF files. No vendor lock-in, ever.