TokenGuard Operator

A Kubernetes operator that continuously audits ServiceAccount (SA) permissions to enforce least-privilege security. It cross-references what permissions are granted via RBAC against what permissions are actually used (from the Kubernetes audit log), producing a live Least Privilege Score and flagging anomalies such as external IP token usage.

Overview

Over-permissioned ServiceAccounts are one of the most common Kubernetes misconfigurations. TokenGuard automates the hard work of identifying permission creep by:

Scanning RBAC — Walks all RoleBindings and ClusterRoleBindings to build a complete list of permissions granted to every ServiceAccount in a target namespace.
Consuming the audit log — Receives Kubernetes audit events via a webhook and records which permissions each ServiceAccount actually exercises.
Scoring — Computes a Least Privilege Score (0–100). A score of 100 means every granted permission is actively used. A low score indicates excess permissions that should be removed.
Alerting — Detects tokens being used from external (non-private) IP addresses, which can indicate a supply-chain compromise or credential leak.

How It Works

┌─────────────────────────────────────────────────────────────────┐
│  Kubernetes Cluster                                             │
│                                                                 │
│   ┌──────────────────┐       Audit Events (POST /audit)        │
│   │  API Server       │ ─────────────────────────────────────► │
│   │  (Audit Webhook)  │                                        │
│   └──────────────────┘       ┌──────────────────────────────┐  │
│                               │   TokenGuard Operator        │  │
│   ┌──────────────────┐        │                              │  │
│   │  RoleBindings /  │ ──────►│  audit.Receiver  (port 9443) │  │
│   │  ClusterRoles    │        │  rbac.Evaluator              │  │
│   └──────────────────┘        │  SAAuditorReconciler         │  │
│                               │  report.Server   (port 9090) │  │
│                               └──────────────┬───────────────┘  │
│                                              │ updates status    │
│                               ┌──────────────▼───────────────┐  │
│                               │  SAAuditor CR                │  │
│                               │  .status.currentScore        │  │
│                               │  .status.usedPermissions     │  │
│                               │  .status.unusedPermissions   │  │
│                               │  .status.anomalies           │  │
│                               └──────────────────────────────┘  │
└─────────────────────────────────────────────────────────────────┘

The reconcile loop runs every 2 minutes by default (configurable via scoringInterval).

The SAAuditor CRD

SAAuditor is a namespaced custom resource in the security.tokenguard.io/v1 API group.

Spec

Field	Type	Required	Description
`targetNamespace`	string	Yes	Namespace whose ServiceAccounts are monitored
`scoringInterval`	string	No	How often to recalculate the score (e.g. `"5m"`)
`alertThreshold`	integer (0–100)	No	Minimum score before triggering an alert

Status (written by the operator)

Field	Type	Description
`currentScore`	integer	Least Privilege Score — `(used / granted) * 100`
`usedPermissions`	[]string	Permissions actually exercised, per ServiceAccount
`unusedPermissions`	[]string	Granted permissions that have never been observed in the audit log
`anomalies`	[]string	Critical findings, e.g. external-IP token usage

Example Resource

apiVersion: security.tokenguard.io/v1
kind: SAAuditor
metadata:
  name: prod-namespace-audit
  namespace: security-ops
spec:
  targetNamespace: production
  scoringInterval: "5m"
  alertThreshold: 80

After the first reconcile, the status will be populated:

status:
  currentScore: 62
  usedPermissions:
    - "my-app: get /core/pods"
    - "my-app: list /apps/deployments"
  unusedPermissions:
    - "my-app: delete /core/pods"
    - "my-app: create /core/secrets"
  anomalies:
    - "CRITICAL: External IP 203.0.113.42 used SA my-app token"

Architecture

my-crd-operator/
├── api/v1/
│   ├── saauditor_types.go        # CRD type definitions (Spec, Status)
│   └── zz_generated.deepcopy.go  # Auto-generated DeepCopy methods
├── cmd/
│   └── main.go                   # Entrypoint — wires together manager, audit receiver, RBAC evaluator
├── internal/controller/
│   └── saauditor_controller.go   # Reconcile loop — scores SAs, writes status
├── pkg/
│   ├── audit/
│   │   └── webhook.go            # HTTP server (:9443) that receives K8s audit events
│   ├── report/
│   │   └── server.go             # HTTP server (:9090) serving the HTML report at /report
│   └── rbac/
│       └── evaluator.go          # Walks RoleBindings/ClusterRoleBindings to compute granted permissions
├── config/
│   ├── crd/                      # Generated CRD manifests
│   ├── rbac/                     # RBAC manifests for the operator itself
│   ├── manager/                  # Deployment manifests
│   ├── default/                  # Kustomize overlay (metrics, webhooks)
│   └── samples/                  # Example SAAuditor resource
└── test/
    ├── e2e/                      # End-to-end tests (Kind)
    └── utils/                    # Test helpers

Key components

audit.Receiver — Implements manager.Runnable. Starts an HTTP server on :9443 (configurable via --audit-webhook-bind-address) that accepts POST /audit requests from the Kubernetes API server's audit webhook backend. Parses EventList payloads, extracts the verb+resource+apiGroup per ServiceAccount, and stores a deduplicated set of UsedPermissions along with all observed source IPs (thread-safe via sync.RWMutex).

report.Server — Implements manager.Runnable. Starts an HTTP server on :9090 (configurable via --report-bind-address) serving a live HTML dashboard at /report. Queries all SAAuditor resources and renders scores, used/unused permissions, and anomalies in a dark-themed UI. Access it via kubectl port-forward or the tokenguard-operator-report Service.

rbac.Evaluator — Walks all RoleBindings (namespace-scoped) and ClusterRoleBindings (cluster-wide) that reference a given ServiceAccount. Resolves both Role and ClusterRole references and formats rules as "verb /apiGroup/resource" strings for direct comparison with audit data.

SAAuditorReconciler — The main controller. For each reconcile:

Lists all ServiceAccounts in spec.targetNamespace
Calls rbac.Evaluator for granted permissions
Calls audit.Receiver for observed permissions
Computes score = (totalUsed / totalGranted) * 100
Checks source IPs for non-private addresses (anomaly detection)
Writes results to SAAuditor.Status and requeues after 2 minutes

Prerequisites

Go 1.25+
Kubernetes cluster v1.35+ (or Kind for local dev)
kubectl configured against your target cluster
Docker (for building images)
The Kubernetes API server configured with an audit webhook pointing to http://<operator-service>:9443/audit

Installation

Using Helm (recommended)

helm upgrade --install tokenguard-operator \
  oci://ghcr.io/didiberman/tokenguard-operator/charts/tokenguard-operator \
  --namespace tokenguard-system --create-namespace

After installation, view the HTML report:

kubectl port-forward svc/tokenguard-operator-report -n tokenguard-system 9090:9090
# open http://localhost:9090/report

Using Kustomize

# Install the CRD
make install

# Build and push the operator image
make docker-build docker-push IMG=<your-registry>/tokenguard:latest

# Deploy to the cluster
make deploy IMG=<your-registry>/tokenguard:latest

Generate a single install manifest

make build-installer IMG=<your-registry>/tokenguard:latest
kubectl apply -f dist/install.yaml

Configure the Kubernetes Audit Webhook

Add the following to your API server's audit policy (--audit-webhook-config-file):

apiVersion: v1
kind: Config
clusters:
  - name: tokenguard
    cluster:
      server: http://<tokenguard-service>.<namespace>.svc.cluster.local:9443/audit
users:
  - name: tokenguard
contexts:
  - name: tokenguard
    context:
      cluster: tokenguard
      user: tokenguard
current-context: tokenguard

Usage

Deploy the operator (see Installation).
Create a SAAuditor resource targeting the namespace you want to audit:

kubectl apply -f - <<EOF
apiVersion: security.tokenguard.io/v1
kind: SAAuditor
metadata:
  name: my-audit
  namespace: default
spec:
  targetNamespace: default
  alertThreshold: 75
EOF

Wait for the first reconcile (up to 2 minutes), then inspect the status:

kubectl get saauditor my-audit -o yaml

Identify unused permissions and tighten RBAC accordingly:

kubectl get saauditor my-audit -o jsonpath='{.status.unusedPermissions}' | tr ',' '\n'

Check for anomalies:

kubectl get saauditor my-audit -o jsonpath='{.status.anomalies}'

View the live HTML report in your browser:

kubectl port-forward svc/tokenguard-operator-report -n <namespace> 9090:9090
# then open http://localhost:9090/report

Anomaly Detection

TokenGuard flags any ServiceAccount token usage originating from a non-private IP address. The following ranges are considered private/internal:

10.x.x.x (RFC 1918)
192.168.x.x (RFC 1918)
127.0.0.1 / ::1 (loopback)
fd... (ULA IPv6)

Any source IP outside these ranges generates a CRITICAL anomaly entry:

CRITICAL: External IP 203.0.113.42 used SA payment-processor token

This pattern targets supply chain compromise scenarios where a malicious dependency, CI runner, or stolen credential is using a ServiceAccount token from outside the cluster.

Metrics

The operator exposes Prometheus metrics over HTTPS on :8443 (secured with mTLS by default). Metrics are protected with Kubernetes authentication and authorization.

Flag	Default	Description
`--audit-webhook-bind-address`	`:9443`	Address the audit webhook receiver binds to
`--report-bind-address`	`:9090`	Address the HTML report server binds to
`--metrics-bind-address`	`0` (disabled)	Set to `:8443` (HTTPS) or `:8080` (HTTP) to enable
`--metrics-secure`	`true`	Serve metrics over HTTPS
`--health-probe-bind-address`	`:8081`	Liveness/readiness probe address
`--leader-elect`	`false`	Enable leader election for HA deployments
`--enable-http2`	`false`	Enable HTTP/2 (disabled by default due to CVE-2023-44487)

Development

Run locally against a cluster

make run

Run unit tests

make test

Run end-to-end tests (requires Kind)

make test-e2e

This creates a local Kind cluster (my-crd-operator-test-e2e), runs the full e2e suite, and tears down the cluster.

Lint

make lint        # Run golangci-lint
make lint-fix    # Auto-fix lint issues

Regenerate manifests and DeepCopy methods

make generate   # Regenerate DeepCopy methods
make manifests  # Regenerate CRD/RBAC manifests from kubebuilder markers

Available Make targets

make help

CI/CD

Workflow	Trigger	Description
`ci.yml`	Push / PR	Full build + unit tests
`lint.yml`	Push / PR	golangci-lint
`test.yml`	Push / PR	Unit + integration tests with envtest
`test-e2e.yml`	Push / PR	End-to-end tests on Hetzner (secure ephemeral cluster)
`release.yml`	Tag push	Build and publish container image

License

Apache License 2.0. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.devcontainer		.devcontainer
.github		.github
api/v1		api/v1
charts/tokenguard-operator		charts/tokenguard-operator
cmd		cmd
config		config
hack		hack
internal/controller		internal/controller
pkg		pkg
test		test
.custom-gcl.yml		.custom-gcl.yml
.dockerignore		.dockerignore
.gitignore		.gitignore
.golangci.yml		.golangci.yml
Dockerfile		Dockerfile
Makefile		Makefile
PROJECT		PROJECT
README.md		README.md
e2e_logs.txt		e2e_logs.txt
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TokenGuard Operator

Table of Contents

Overview

How It Works

The SAAuditor CRD

Spec

Status (written by the operator)

Example Resource

Architecture

Key components

Prerequisites

Installation

Using Helm (recommended)

Using Kustomize

Generate a single install manifest

Configure the Kubernetes Audit Webhook

Usage

Anomaly Detection

Metrics

Development

Run locally against a cluster

Run unit tests

Run end-to-end tests (requires Kind)

Lint

Regenerate manifests and DeepCopy methods

Available Make targets

CI/CD

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TokenGuard Operator

Table of Contents

Overview

How It Works

The SAAuditor CRD

Spec

Status (written by the operator)

Example Resource

Architecture

Key components

Prerequisites

Installation

Using Helm (recommended)

Using Kustomize

Generate a single install manifest

Configure the Kubernetes Audit Webhook

Usage

Anomaly Detection

Metrics

Development

Run locally against a cluster

Run unit tests

Run end-to-end tests (requires Kind)

Lint

Regenerate manifests and DeepCopy methods

Available Make targets

CI/CD

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages