DeepLoop AI Gateway

A programmable AI Gateway library for Go, fully compatible with OpenAI API and OpenResponses API specifications.

Features

Dual API Compatibility: OpenAI API + OpenResponses specification
OpenResponses Endpoint: POST /v1/responses with semantic streaming events
OpenAI Endpoints: Chat Completions, Embeddings, Images
Real Streaming Support: Server-Sent Events with OpenResponses semantic events
Flexible Hook System: Extend request/response processing at any stage
Provider Abstraction: Support multiple LLM providers with dynamic routing
Model Management: Model name rewriting and provider mapping
Library-first Design: Embed directly into your Go application

Installation

go get github.com/deeplooplabs/ai-gateway

Quick Start

package main

import (
    "log"
    "net/http"

    "github.com/deeplooplabs/ai-gateway/gateway"
    "github.com/deeplooplabs/ai-gateway/model"
    "github.com/deeplooplabs/ai-gateway/provider"
)

func main() {
    // Setup provider
    openAI := provider.NewHTTPProviderWithBaseURLAndPath(
        "https://api.openai.com/v1",
        "your-api-key",
        "/v1",  // Strip /v1 from endpoint
    )

    // Configure models
    registry := model.NewMapModelRegistry()
    registry.Register("gpt-4", openAI)

    // Create gateway
    gw := gateway.New(
        gateway.WithModelRegistry(registry),
    )

    // Serve
    http.Handle("/v1/", gw)
    log.Fatal(http.ListenAndServe(":8080", nil))
}

Supported Endpoints

OpenResponses API

The gateway implements the OpenResponses specification with full streaming support:

# Non-streaming
curl http://localhost:8080/v1/responses \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4",
    "input": "Hello, how are you?"
  }'

# Streaming
curl http://localhost:8080/v1/responses \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4",
    "input": "Count to 10",
    "stream": true
  }'

OpenResponses Features:

Semantic streaming events (response.created, response.in_progress, response.output_text.delta, etc.)
Message items with role-based content
Tool calling support
Proper error format with type, message, param fields

OpenAI API (Compatible)

# Chat Completions
curl http://localhost:8080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

# Embeddings
curl http://localhost:8080/v1/embeddings \
  -H "Content-Type: application/json" \
  -d '{
    "input": "Hello world",
    "model": "text-embedding-3-small"
  }'

# Images (DALL-E)
curl http://localhost:8080/v1/images/generations \
  -H "Content-Type: application/json" \
  -d '{
    "model": "dall-e-3",
    "prompt": "a cat"
  }'

Streaming

OpenResponses Streaming

OpenResponses uses semantic events for streaming:

event: response.created
data: {"type":"response.created","response_id":"resp_abc123",...}

event: response.output_text.delta
data: {"type":"response.output_text.delta","delta":"Hello","sequence_number":1,...}

event: response.completed
data: {"type":"response.completed",...}

data: [DONE]

OpenAI Streaming

Traditional OpenAI-style streaming is also supported:

req := openai.ChatCompletionRequest{
    Model:    "gpt-4",
    Messages: messages,
    Stream:   true,
}

OpenResponses Request Format

{
  "model": "gpt-4",
  "input": "Your prompt here",
  "stream": false,
  "temperature": 0.7,
  "max_output_tokens": 1000,
  "tools": [...],
  "tool_choice": "auto",
  "truncation": "auto"
}

OpenResponses Response Format

{
  "id": "resp_abc123",
  "object": "response",
  "status": "completed",
  "created_at": 1234567890,
  "completed_at": 1234567895,
  "model": "gpt-4",
  "output": [
    {
      "id": "msg_xyz789",
      "type": "message",
      "status": "completed",
      "role": "assistant",
      "content": [
        {
          "type": "output_text",
          "text": "Response text here"
        }
      ]
    }
  ],
  "usage": {
    "input_tokens": 10,
    "output_tokens": 20,
    "total_tokens": 30
  }
}

Hook System

Hooks allow you to customize request/response processing:

Authentication Hook

type AuthHook struct{}

func (h *AuthHook) Name() string { return "auth" }

func (h *AuthHook) Authenticate(ctx context.Context, apiKey string) (bool, string, error) {
    // Validate API key, return (success, tenantID, error)
    return true, "user-id", nil
}

hooks := hook.NewRegistry()
hooks.Register(&AuthHook{})

Request/Response Hooks

type LoggingHook struct{}

func (h *LoggingHook) BeforeRequest(ctx context.Context, req *openai.ChatCompletionRequest) error {
    log.Printf("Request: %+v", req)
    return nil
}

func (h *LoggingHook) AfterRequest(ctx context.Context, req *openai.ChatCompletionRequest, resp *openai.ChatCompletionResponse) error {
    log.Printf("Response: %+v", resp)
    return nil
}

Providers

HTTP Provider (Generic)

import "github.com/deeplooplabs/ai-gateway/provider"

// Standard OpenAI-compatible provider
provider := provider.NewHTTPProviderWithBaseURL(
    "https://api.openai.com/v1",
    "your-api-key",
)

// Provider with BasePath (for APIs that include /v1 in base URL)
provider := provider.NewHTTPProviderWithBaseURLAndPath(
    "https://api.siliconflow.cn/v1",  // BaseURL includes /v1
    "your-api-key",
    "/v1",  // Strip /v1 from endpoint to avoid duplication
)

Configuration Options

config := provider.NewProviderConfig("my-provider").
    WithBaseURL("https://api.example.com/v1").
    WithBasePath("/v1").  // Strip from endpoint
    WithAPIKey("your-key").
    WithAPIType(provider.APITypeAll).
    WithTimeout(30 * time.Second)

provider := provider.NewHTTPProvider(config)

Model Registry

registry := model.NewMapModelRegistry()

// Simple registration
registry.Register("gpt-4", provider)

// With options
registry.RegisterWithOptions("gpt-4", provider,
    model.WithModelRewrite("gpt-4-turbo"),      // Rewrite model name
    model.WithPreferredAPI(provider.APITypeChatCompletions),
)

Docker Support

# Build
docker build -t ai-gateway-example -f example/Dockerfile .

# Run
docker run -p 8083:8083 \
  -e OPENAI_BASE_URL=https://api.openai.com/v1 \
  -e OPENAI_API_KEY=your-key \
  ai-gateway-example

See example/ directory for a complete working example.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 107 Commits
cache		cache
docs/plans		docs/plans
e2e		e2e
example		example
gateway		gateway
handler		handler
hook		hook
loadbalancer		loadbalancer
model		model
openresponses		openresponses
provider		provider
quota		quota
ratelimit		ratelimit
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
P0_P1_IMPLEMENTATION.md		P0_P1_IMPLEMENTATION.md
PHASE3_4_IMPLEMENTATION.md		PHASE3_4_IMPLEMENTATION.md
README.md		README.md
Taskfile.yml		Taskfile.yml
context.go		context.go
context_test.go		context_test.go
error.go		error.go
error_test.go		error_test.go
go.mod		go.mod
go.sum		go.sum
plan.md		plan.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepLoop AI Gateway

Features

Installation

Quick Start

Supported Endpoints

OpenResponses API

OpenAI API (Compatible)

Streaming

OpenResponses Streaming

OpenAI Streaming

OpenResponses Request Format

OpenResponses Response Format

Hook System

Authentication Hook

Request/Response Hooks

Providers

HTTP Provider (Generic)

Configuration Options

Model Registry

Docker Support

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DeepLoop AI Gateway

Features

Installation

Quick Start

Supported Endpoints

OpenResponses API

OpenAI API (Compatible)

Streaming

OpenResponses Streaming

OpenAI Streaming

OpenResponses Request Format

OpenResponses Response Format

Hook System

Authentication Hook

Request/Response Hooks

Providers

HTTP Provider (Generic)

Configuration Options

Model Registry

Docker Support

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages