v0.1 Open source / MCP-native / Apache 2.0

Insights inside
your cluster.

One MCP server. Every pillar of your cluster: cost, security, reliability, policy. Ask the questions your team actually asks. Get answers that cross product boundaries, not another dashboard to babysit.

Join as contributor → Star on GitHub

◆ v0.1 tools one tool per intent

◆ SRE · investigate root cause
01 investigate_namespace 5 sources
◆ FinOps · spend cost only
02 namespace_cost opencost
◆ Audit · compliance framework-based
03 audit_namespace pss · cis

      → investigate_namespace("payments")
      ↳ 3 pods unhealthy · deploy d4c7f19
      ↳ api-gateway restart loop · OOMKilled
      ↳ elevated errors from 14:32 UTC
      ↳ ✓ likely cause + rollback ready
    

01Architecture

One prompt. Every source. One answer.

Insigh8s sits between your AI assistant and your Kubernetes stack. Each team asks the question they care about. The MCP fans out to the right sources, joins the data, and returns one answer with remediation guidance.

Workload

Network

Metrics

Cost

Security

Insigh8s MCP

Developer

my deploy failed. what broke?

investigate_namespace · ~3s

SRE

something's wrong in payments

investigate_namespace · ~6s

Security

audit payments against CIS

audit_namespace(cis) · ~4s

FinOps

what does payments cost us?

namespace_cost · ~2s

02The tools

One tool per intent.

Each tool answers one clear question. An SRE investigating a problem, a FinOps engineer tracking spend, and a compliance reviewer running an audit all want different answers, so they get different tools. No god-tool that tries to do everything.

◆ 01

Investigate: what's wrong?

For SREs and platform engineers · root-cause triage

i.1

investigate_namespace(namespace, window)

Unhealthy pods, recent deploys that correlate, error log patterns, unusual flows, admission denials. The 2am triage call, correlated.

kubectl prometheus hubble logs

◆ 02

Spend: what does this cost?

For FinOps and platform leads · cost-only, nothing else

c.1

namespace_cost(namespace, window)

Spend for this namespace over the window, broken down by workload. Week-over-week delta. Top cost drivers ranked.

opencost

◆ 03

Audit: is this compliant?

For security and compliance · pass/fail against a named framework

a.1

audit_namespace(namespace, framework)

Check compliance against a specific framework. v0.1 ships with pod-security-standards-restricted and cis-kubernetes-benchmark. Returns pass/fail per control, violator pods, remediation patches.

kubectl pss cis

a.2

list_audit_frameworks()

Returns the frameworks this server knows how to check, with short descriptions. Your AI calls this first when the user hasn't specified which framework.

metadata

Not raw data. Real triage.

Anyone can call an API. The hard part is knowing what to look at, in what order, with what thresholds, and which finding actually matters. Insigh8s encodes that judgement into every tool.

01 One tool per intent. Investigation returns root cause. Cost returns spend. Audit returns compliance. No tool mixes concerns.
02 Root cause before symptoms. When investigate_namespace finds unhealthy pods, it also shows the deploy that caused them and the error pattern in logs.
03 Audit requires a framework. No "just audit this" catch-all. The caller picks CIS or PSS. If they haven't, the AI asks which one via list_audit_frameworks().
04 Findings ranked by blast radius. Not alphabetical, not severity alone. What actually hurts if you don't fix it today comes first.
05 Remediation commands, not just "look here". Copy-paste patches, policy YAML, kubectl commands. Ship the fix, not the homework.

03Why architecture matters

Judgment belongs in code.
Not in the AI's head.

With a pile of raw MCPs, the AI has to guess which tools to call, how to stitch the results, and what matters. That guess changes every time. With Insigh8s, the orchestration lives inside a tested tool, so the answer is the same whether you're on Claude, GPT, Gemini, or a local Llama.

Six raw MCPs

Intelligence in the AI's head.

Judgment lives in: LLM reasoning

× Different AIs give different answers. Claude, GPT-4, Gemini, and Llama all choose tools and weigh findings differently.
× Same AI, same question, different answer tomorrow. "Audit payments" and "check the payments namespace" produce two different results.
× Can't be tested, versioned, or audited. A compliance team can't review an LLM's reasoning. They need code they can read.
× AI forgets steps. Picks wrong tools. Writes bad queries. Raw JSON dumps back to the AI's context window. Slow, expensive, unreliable.

Insigh8s MCP

Intelligence in the code.

Judgment lives in: tested Go, versioned, auditable

✓ Deterministic across every model. Claude, GPT, Gemini, local Llama: same question in, same answer out.
✓ Prompt variations don't change the result. The AI calls one tool. The orchestration happens in code, not in interpretation.
✓ Versioned, testable, auditable. audit_namespace v1.2 is a reviewable diff. Your security team can read what it checks.
✓ One tool call. Pre-joined, pre-ranked, pre-formatted. Tokens spent on the answer, not on raw JSON bouncing through the AI's context.

Kubernetes triage shouldn't live in an LLM's guess. It should live in code your team can read, test, and trust.

· why we're building this ·

Every team has its own dashboard. Every dashboard answers one question. And when something breaks at 2am, you're still the correlation engine.

You could install a dozen MCP servers. But raw data isn't triage. Answers are.

beforeRaw MCPs

You drive every step.

# you, manually:
kubectl_get_pods("payments")
→ which pods are failing?

kubectl_describe("api-gw-...")
→ OOMKilled, but why now?

kubectl_rollout_history(...)
→ 3 recent deploys, which one?

prometheus_query("rate(...)")
# you're writing PromQL at 2am.

Raw primitives. You still need to know the thresholds, the priorities, what "good" looks like.

with insigh8sOne call

Intent-based tool, pre-joined output.

# your AI, once:
investigate_namespace(
  namespace="payments",
  window="15m"
)

↳ api-gateway OOMKilled x3
↳ correlates to deploy
    d4c7f19 (14:32 UTC)
↳ error rate: 3% → 47%
↳ likely cause + rollback
# ~6 seconds.

One intent, one tool. Cost and compliance questions have their own tools. Each answers what was actually asked.

Built by practitioners · Open source from day one · Apache 2.0

Insights inside
your cluster.

One prompt. Every source. One answer.

One tool per intent.

Not raw data. Real triage.

Judgment belongs in code.
Not in the AI's head.

Intelligence in the AI's head.

Intelligence in the code.

Notes from the build.

Open source from day one.
Help build it.

Insights inside your cluster.

One prompt. Every source. One answer.

One tool per intent.

Not raw data. Real triage.

Judgment belongs in code.Not in the AI's head.

Intelligence in the AI's head.

Intelligence in the code.

Notes from the build.

Open source from day one.Help build it.

Insights inside
your cluster.

Judgment belongs in code.
Not in the AI's head.

Open source from day one.
Help build it.