Mental Model A skill is a Markdown file that extends Claude’s behavior. Think of it as a persistent instruction set Claude loads on demand — not a script that runs automatically, but a detailed playbook Claude reads and follows when the right task comes up.
How Claude finds and loads skills:
At session start, Claude scans all registered skill directories and loads only each skill’s name and description (the frontmatter). Full content is not loaded yet.
What Is the Shadow Stack? The “shadow stack” is a local inference layer that runs alongside your cloud API usage. Instead of every prompt hitting OpenAI or Anthropic, lightweight or private workloads run on GPUs you already own. You choose the right tier per task.
Three deployment tiers:
Cloud APIs — Claude, GPT-4o, Gemini. Highest quality, per-token cost, zero ops. Local inference — Llama 3, Mistral, Phi-3 on your hardware. Fixed cost after setup, full data sovereignty.
One of the most common requests I get is “Can you pull a list of all users in AD with X information?” Instead of manually clicking through ADUC, let’s automate it.