Cloud Native Stack

Cloud Native Stack (CNS) provides validated configuration guidance for deploying GPU-accelerated Kubernetes infrastructure. It captures known-good combinations of software, configuration, and system requirements and makes them consumable as documentation and generated deployment artifacts.

Why We Built This

Running GPU-accelerated Kubernetes clusters reliably is hard. Small differences in kernel versions, drivers, container runtimes, operators, and Kubernetes releases can cause failures that are difficult to diagnose and expensive to reproduce.

Historically, this knowledge has lived in internal validation pipelines, playbooks, and tribal knowledge. Cloud Native Stack exists to externalize that experience. Its goal is to make validated configurations visible, repeatable, and reusable across environments.

What Cloud Native Stack Is (and Is Not)

Cloud Native Stack is a source of validated configuration knowledge for NVIDIA-accelerated Kubernetes environments.

It is:

A curated set of tested and validated component combinations
A reference for how NVIDIA-accelerated Kubernetes clusters are expected to be configured
A foundation for generating reproducible deployment artifacts
Designed to integrate with existing provisioning, CI/CD, and GitOps workflows

It is not:

A Kubernetes distribution
A cluster provisioning or lifecycle management system
A managed control plane or hosted service
A replacement for cloud provider or OEM platforms

How It Works

Cloud Native Stack separates validated configuration knowledge from how that knowledge is consumed.

Human-readable documentation lives under docs/.
Version-locked configuration definitions (“recipes”) capture known-good system states.
Those definitions can be rendered into concrete artifacts such as Helm values, Kubernetes manifests, or install scripts.- Recipes can be validated against actual system configurations to verify compatibility.

This separation allows the same validated configuration to be applied consistently across different environments and automation systems.

For example, a configuration validated for gb200 on Ubuntu 22.04 with Kubernetes 1.29 can be rendered into Helm values and manifests suitable for use in an existing GitOps pipeline.

Get Started

Some tooling and APIs are under active development; documentation reflects current and near-term capabilities.

Installation

macOS (Homebrew):

brew install mchmarny/cloud-native-stack/cnsctl

Linux/macOS (script):

curl -sfL https://raw.githubusercontent.com/nvidia/cloud-native-stack/refs/heads/main/install | bash -s --

See Installation Guide for manual installation, building from source, and container images.

Quick Start

Get started quickly with CNS:

Review the documentation under docs/ to understand supported platforms and required components.
Identify your target environment:
- GPU architecture
- Operating system and kernel
- Kubernetes distribution and version
- Workload intent (for example, training or inference)
Apply the validated configuration guidance using your existing tools (Helm, kubectl, CI/CD, or GitOps).
Validate and iterate as platforms and workloads evolve.

Get Started by Use Case

These use cases reflect common ways teams interact with Cloud Native Stack.

Platform and Infrastructure Operators

You are responsible for deploying and operating GPU-accelerated Kubernetes clusters.

Installation Guide – Install the cnsctl CLI (automated script, manual, or build from source)
CLI Reference – Complete command reference with examples
API Reference – Complete API reference with examples
Agent Deployment – Deploy the Kubernetes agent to get automated configuration snapshots

Developers and Contributors

You are contributing code, extending functionality, or working on CNS internals.

Contributing Guide – Development setup, testing, and PR process
Architecture Overview – System design and components
Bundler Development – How to create new bundlers
Data Architecture – Recipe data model and query matching

Integrators and Automation Engineers

You are integrating CNS into CI/CD pipelines, GitOps workflows, or a larger product or service.

API Reference – REST API endpoints and usage examples
Data Flow – Understanding snapshots, recipes, and bundles
Automation Guide – CI/CD integration patterns
Kubernetes Deployment – Self-hosted API server setup

Project Structure

api/ — OpenAPI specifications for the REST API
cmd/ — Entry points for CLI (cnsctl) and API server (cnsd)
deployments/ — Kubernetes manifests for agent deployment
docs/ — User-facing documentation, guides, and architecture docs
examples/ — Example snapshots, recipes, and comparisons
infra/ — Infrastructure as code (Terraform) for deployments
pkg/ — Core Go packages (collectors, recipe engine, bundlers, serializers)
tools/ — Build scripts, E2E testing, and utilities

Documentation & Resources

Documentation – Documentation, guides, and examples.
Roadmap – Feature priorities and development timeline
Overview - Detailed system overview and glossary
Security - Security-related resources
Releases - Binaries, SBOMs, and other artifacts
Issues - Bugs, feature requests, and questions

Contributing

Contributions are welcome. See contributing for development setup, contribution guidelines, and the pull request process.

Name		Name	Last commit message	Last commit date
Latest commit History 942 Commits
.claude		.claude
.flox		.flox
.github		.github
api/cns/v1		api/cns/v1
cmd		cmd
deployments/cns-agent		deployments/cns-agent
docs		docs
examples		examples
infra/demo-api-server		infra/demo-api-server
pkg		pkg
tests/e2e		tests/e2e
tilt		tilt
tools		tools
.ctlptl.yaml		.ctlptl.yaml
.gitattributes		.gitattributes
.gitignore		.gitignore
.golangci.yaml		.golangci.yaml
.goreleaser.yaml		.goreleaser.yaml
.grype.yaml		.grype.yaml
.ko.yaml		.ko.yaml
.versions.yaml		.versions.yaml
.yamllint.yaml		.yamllint.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
codecov.yml		codecov.yml
go.mod		go.mod
go.sum		go.sum
install		install

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cloud Native Stack

Why We Built This

What Cloud Native Stack Is (and Is Not)

How It Works

Get Started

Installation

Quick Start

Get Started by Use Case

Project Structure

Documentation & Resources

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Cloud Native Stack

Why We Built This

What Cloud Native Stack Is (and Is Not)

How It Works

Get Started

Installation

Quick Start

Get Started by Use Case

Project Structure

Documentation & Resources

Contributing

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages