Open Source

Building AI infrastructure tools in the open. These projects aim to solve real problems in the ML/AI platform space and are designed for potential CNCF contribution.

Featured Projects

In ProgressFeatured

Kortex

Kubernetes-native AI inference gateway for multi-model routing, A/B testing, and intelligent failover. Features circuit breakers with exponential backoff, OpenTelemetry tracing, smart routing (cost/latency/context-length), and configuration hot-reload. CNCF Sandbox candidate.

Progress

CRDsRoutingA/B TestingFallbacksMetricsRate LimitingCost TrackingOpenTelemetrySmart RoutingCircuit BreakersE2E TestsHelm Chart

GoKubebuilderKubernetesKServeOpenTelemetry+1 more

GitHub

CompletedFeatured

AI Infrastructure FinOps Platform

Production-ready cost optimization platform for AI/ML workloads. Features GPU utilization monitoring, budget forecasting with alerts, ML-based anomaly detection, automated right-sizing recommendations, and multi-cloud billing integration. All 3 phases complete.

Progress

MVPBudget & AlertsML AnalyticsChargeback ReportsAWS Billing APIRight-Sizing Engine

NVIDIA DCGMOpenCostPrometheusGrafanaPython/Flask+2 more

GitHub

CompletedFeatured

MLOps Platform on Kubernetes

Production-ready multi-cloud MLOps platform on AWS EKS, Azure AKS, and GCP GKE with defense-in-depth security and full-stack observability. Enables data science teams to deploy ML models, HuggingFace transformers, and LLMs from experimentation to production in 15 minutes with full auditability, drift detection, and GitOps-driven infrastructure.

Progress

CoreMLflowKServeCI/CDGPUSecurityDrift DetectionAzure SupportGCP SupportLLM InferenceHuggingFaceGitOpsObservabilityBackup & DRChaos TestingDocs

Argo WorkflowsMLflow 3.xKServevLLMHuggingFace+15 more

GitHub

PlannedFeatured

SpotTensor

GPU compute price aggregator — "Trivago for ML training". Arbitrages spot pricing across AWS, RunPod, and Lambda Labs to find the cheapest GPU instances for batch training jobs.

Progress

AWS Spot ConnectorRunPod IntegrationLambda Labs IntegrationGPU Normalization SchemaPrice Comparison CLIRecommendation Engine

GoAWS Price List APIREST APIsCLIPostgreSQL

GitHub

PlannedFeatured

AgentFile

Docker Compose for AI Agents — Declarative spec that deploys AI agent stacks to Kubernetes. GitOps-native with Kortex integration for inference governance. Transparent abstraction: generates readable K8s manifests you own.

Progress

Schema ParserCLI ScaffoldRAG StackManifest GeneratorKind SupportKortex Integration

GoKubernetesKustomizeHelmKServe+2 more

GitHub

Contributions

Contributions to CNCF and other open source projects coming soon.

Currently focused on building these projects to production-ready status before contributing upstream.

Reference Architecture

Complete MLOps platform showing how all the pieces fit together.

View All Repositories

Check out my GitHub profile for more projects and contributions.

GitHub Profile

Open Source

Building AI infrastructure tools in the open. These projects aim to solve real problems in the ML/AI platform space and are designed for potential CNCF contribution.

Featured Projects

In ProgressFeatured

Kortex

Progress

CRDsRoutingA/B TestingFallbacksMetricsRate LimitingCost TrackingOpenTelemetrySmart RoutingCircuit BreakersE2E TestsHelm Chart

GoKubebuilderKubernetesKServeOpenTelemetry+1 more

GitHub

CompletedFeatured

AI Infrastructure FinOps Platform

Progress

MVPBudget & AlertsML AnalyticsChargeback ReportsAWS Billing APIRight-Sizing Engine

NVIDIA DCGMOpenCostPrometheusGrafanaPython/Flask+2 more

GitHub

CompletedFeatured

MLOps Platform on Kubernetes

Progress

CoreMLflowKServeCI/CDGPUSecurityDrift DetectionAzure SupportGCP SupportLLM InferenceHuggingFaceGitOpsObservabilityBackup & DRChaos TestingDocs

Argo WorkflowsMLflow 3.xKServevLLMHuggingFace+15 more

GitHub

PlannedFeatured

SpotTensor

GPU compute price aggregator — "Trivago for ML training". Arbitrages spot pricing across AWS, RunPod, and Lambda Labs to find the cheapest GPU instances for batch training jobs.

Progress

AWS Spot ConnectorRunPod IntegrationLambda Labs IntegrationGPU Normalization SchemaPrice Comparison CLIRecommendation Engine

GoAWS Price List APIREST APIsCLIPostgreSQL

GitHub

PlannedFeatured

AgentFile

Progress

Schema ParserCLI ScaffoldRAG StackManifest GeneratorKind SupportKortex Integration

GoKubernetesKustomizeHelmKServe+2 more

GitHub

Contributions

Contributions to CNCF and other open source projects coming soon.

Currently focused on building these projects to production-ready status before contributing upstream.

Reference Architecture

Complete MLOps platform showing how all the pieces fit together.

View All Repositories

Check out my GitHub profile for more projects and contributions.

GitHub Profile