Monitor your production code with TraceKit Code Monitoring. Set live breakpoints, capture variable state, and debug without redeploying.

Code Monitoring

Debug production code without stopping your application. Set non-breaking breakpoints and capture variable state in real-time.

Production Debugging Without Downtime -- Set breakpoints in production and capture variables, stack traces, and context without redeploying. Built-in PII scrubbing, crash isolation, circuit breakers, and real-time SSE updates. Less than 5ms overhead. Supports Go, Java, PHP, Laravel, Node.js, Python, .NET and Ruby.

What is Code Monitoring?

Non-Breaking Breakpoints -- Create breakpoints that capture data without stopping your application.
Capture Variable State -- See all variable values at the exact moment the breakpoint was hit.
Full Stack Traces -- Complete call stack showing how code reached that breakpoint.
Request Context -- HTTP headers, trace IDs, and more to understand what triggered execution.

How It Works

Automatic Code Discovery

TraceKit automatically indexes your code from traces you're already sending. When traces contain stack traces (from errors or instrumentation), we extract file paths, functions, and line numbers. No extra instrumentation needed.

Browse your discovered code in the Code Monitoring page, "Browse Code" tab.

Step-by-Step

Send Traces -- Your existing traces automatically index code. Stack traces reveal file paths and functions.
Browse and Set Breakpoints -- Click "Browse Code" to see discovered files/functions, then click "Set Breakpoint" on any location.
Capture and Debug -- When that code runs, we capture variables, stack trace, and context automatically. View snapshots in the UI.

Recommended: Use CheckAndCaptureWithContext() for automatic breakpoint registration. The SDK handles file detection, line tracking, and breakpoint creation for you.

Quick Start

Step 1: Install and Enable Code Monitoring

Choose your language:

Go:

go get github.com/Tracekit-Dev/go-sdk/tracekit

sdk, _ := tracekit.NewSDK(&tracekit.Config{
    APIKey:               os.Getenv("TRACEKIT_API_KEY"),
    ServiceName:          "order-service",
    EnableCodeMonitoring: true,
})
defer sdk.Shutdown(context.Background())

sdk.CheckAndCaptureWithContext(ctx, "order-processing", map[string]interface{}{
    "orderID": orderID,
    "total":   total,
    "status":  "validated",
})

Full Go Code Monitoring Docs

Python:

pip install tracekit-apm

import tracekit

client = tracekit.init(
    api_key=os.getenv("TRACEKIT_API_KEY"),
    service_name="my-flask-app",
    enable_code_monitoring=True,  # default: False
)

client.capture_snapshot("order-processing", {
    "order_id": order["id"],
    "total": order["total"],
    "user_id": user.id,
})

Full Python Code Monitoring Docs

Node.js:

npm install @tracekit/node-apm

import * as tracekit from '@tracekit/node-apm';

const client = tracekit.init({
    apiKey: process.env.TRACEKIT_API_KEY,
    serviceName: 'my-app',
    enableCodeMonitoring: true,
});

await client.captureSnapshot('checkout-validation', {
    userId,
    amount,
    timestamp: new Date().toISOString(),
});

Full Node.js Code Monitoring Docs

PHP:

composer require tracekit/php-apm

$tracekit = new TracekitClient([
    'api_key' => getenv('TRACEKIT_API_KEY'),
    'service_name' => 'my-php-app',
    'endpoint' => 'https://your-app.com/v1/traces',
    'code_monitoring_enabled' => true,
]);

$tracekit->captureSnapshot('checkout-validation', [
    'user_id' => $userId,
    'cart_items' => count($cart['items']),
    'total_amount' => $cart['total'],
]);

Full PHP Code Monitoring Docs

Laravel:

composer require tracekit/laravel-apm

TRACEKIT_CODE_MONITORING_ENABLED=true
TRACEKIT_CODE_MONITORING_POLL_INTERVAL=30

tracekit_snapshot('checkout-start', [
    'user_id' => $userId,
    'cart_total' => $cartTotal,
    'items_count' => count($items),
]);

Full Laravel Code Monitoring Docs

Java:

<dependency>
    <groupId>dev.tracekit</groupId>
    <artifactId>tracekit-core</artifactId>
</dependency>

tracekit:
    enable-code-monitoring: true

tracekit.captureSnapshot("order_processing",
    Map.of(
        "orderId", order.getId(),
        "customerId", order.getCustomerId(),
        "total", order.getTotal()
    )
);

Full Java Code Monitoring Docs

Step 3: Add Checkpoints (Automatic)

Recommended: Automatic Breakpoint Registration -- Breakpoints are automatically created and updated when you call CheckAndCaptureWithContext. No manual UI setup required.

// Automatic file/line detection + auto-creates breakpoint!
sdk.CheckAndCaptureWithContext(ctx, "payment-processing", map[string]interface{}{
    "userID": userID,
    "amount": amount,
})

// The SDK will:
// 1. Detect file path and line number automatically
// 2. Auto-create/update the breakpoint in TraceKit
// 3. Capture snapshot when breakpoint is active

Step 4: View and Manage (Optional)

Breakpoints are automatically created and enabled. You can optionally:

View captured snapshots in the UI at /snapshots
Adjust conditions or sampling rates
Browse auto-discovered code
Disable/enable breakpoints as needed

Advanced: Manual Breakpoint Creation

For advanced users who want full control, you can manually create breakpoints in the UI first. Go to Code Monitoring and create a breakpoint for payment.go:42.

Production Safety

Code monitoring is built for production from day one. Every SDK includes multiple layers of protection to ensure zero impact on your application, even under failure conditions.

PII Scrubbing (Default On)

13 built-in patterns automatically redact sensitive data before it leaves your application. Emails, SSNs, credit cards, API keys, JWTs, and more are replaced with typed markers like [REDACTED:email]. Enabled by default. Add custom patterns or disable per-service.

Crash Isolation

Every SDK entry point is wrapped in language-idiomatic recovery handlers. A bug in TraceKit's snapshot code will never crash your application -- the SDK recovers silently and continues. Go: defer/recover. Node: try/catch. Java: catch(Throwable). And more.

Circuit Breaker

If the TraceKit backend is unreachable, the SDK automatically stops sending snapshots after 3 failures in 60 seconds. It re-enables after a 5-minute cooldown. No manual intervention needed. Thresholds are configurable per SDK instance.

Remote Kill Switch

Instantly disable all code monitoring for a service from the dashboard. The kill switch propagates to all connected SDKs in real-time via SSE, or within 60 seconds via polling. One click in the dashboard to stop all captures immediately.

All safety features are enabled by default across all 8 SDKs. No configuration required -- just enable code monitoring and you're protected.

Real-Time Updates

Breakpoint changes propagate to your SDKs in under 1 second using Server-Sent Events (SSE). No more waiting for the next 30-second poll cycle.

Auto-Discovery -- When your SDK polls for breakpoints, the server returns an sse_endpoint URL. The SDK automatically connects.
Real-Time Streaming -- Breakpoint creates, updates, deletes, and kill switch commands stream instantly to connected SDKs. Polling pauses while SSE is active.
Automatic Fallback -- If the SSE connection drops, the SDK seamlessly falls back to polling and reconnects SSE on the next successful poll.

sdk, _ := tracekit.NewSDK(&tracekit.Config{
    APIKey:               os.Getenv("TRACEKIT_API_KEY"),
    ServiceName:          "order-service",
    EnableCodeMonitoring: true,
    CaptureConfig: &tracekit.CaptureConfig{
        CaptureDepth:   10,              // Max nesting depth (0 = unlimited)
        MaxPayload:     65536,           // Max payload bytes (0 = unlimited)
        CaptureTimeout: 5 * time.Second, // Capture timeout (0 = none)
        PIIScrubbing:   boolPtr(true),   // Default: enabled
        CircuitBreaker: &tracekit.CircuitBreakerConfig{
            MaxFailures: 3,     // Failures before tripping (default: 3)
            WindowMs:    60000, // Failure window in ms (default: 60s)
            CooldownMs:  300000,// Auto-recovery after (default: 5min)
        },
    },
})

Node.js:

const client = tracekit.init({
    apiKey: process.env.TRACEKIT_API_KEY,
    serviceName: 'order-service',
    enableCodeMonitoring: true,
    captureConfig: {
        captureDepth: 10,         // Max nesting depth (undefined = unlimited)
        maxPayload: 65536,        // Max payload bytes (undefined = unlimited)
        captureTimeout: 5000,     // Capture timeout in ms (undefined = none)
        piiScrubbing: true,       // Default: true
        circuitBreaker: {
            maxFailures: 3,       // Failures before tripping (default: 3)
            windowMs: 60000,      // Failure window in ms (default: 60s)
            cooldownMs: 300000,   // Auto-recovery after (default: 5min)
        },
    },
});

Python:

client = tracekit.init(
    api_key=os.getenv("TRACEKIT_API_KEY"),
    service_name="order-service",
    enable_code_monitoring=True,
    capture_config={
        "capture_depth": 10,         # Max nesting depth (None = unlimited)
        "max_payload": 65536,        # Max payload bytes (None = unlimited)
        "capture_timeout": 5.0,      # Capture timeout in seconds (None = none)
        "pii_scrubbing": True,       # Default: True
        "circuit_breaker": {
            "max_failures": 3,       # Failures before tripping (default: 3)
            "window_ms": 60000,      # Failure window in ms (default: 60s)
            "cooldown_ms": 300000,   # Auto-recovery after (default: 5min)
        },
    },
)

Java:

TracekitConfig config = TracekitConfig.builder()
    .apiKey(System.getenv("TRACEKIT_API_KEY"))
    .serviceName("order-service")
    .enableCodeMonitoring(true)
    .captureDepth(10)              // Max nesting depth (0 = unlimited)
    .maxPayload(65536)             // Max payload bytes (0 = unlimited)
    .captureTimeoutMs(5000)        // Capture timeout in ms (0 = none)
    .piiScrubbing(true)            // Default: true
    .circuitBreakerMaxFailures(3)  // Default: 3
    .circuitBreakerWindowMs(60000) // Default: 60s
    .circuitBreakerCooldownMs(300000) // Default: 5min
    .build();

.NET:

var sdk = TracekitSDK.CreateBuilder()
    .WithApiKey(Environment.GetEnvironmentVariable("TRACEKIT_API_KEY"))
    .WithServiceName("order-service")
    .WithEnableCodeMonitoring(true)
    .WithCaptureDepth(10)              // Max nesting depth (0 = unlimited)
    .WithMaxPayload(65536)             // Max payload bytes (0 = unlimited)
    .WithCaptureTimeoutMs(5000)        // Capture timeout in ms (0 = none)
    .WithPiiScrubbing(true)            // Default: true
    .WithCircuitBreakerMaxFailures(3)  // Default: 3
    .WithCircuitBreakerWindowMs(60000) // Default: 60s
    .WithCircuitBreakerCooldownMs(300000) // Default: 5min
    .Build();

Ruby:

Tracekit::SDK.configure do |c|
    c.api_key                = ENV['TRACEKIT_API_KEY']
    c.service_name           = "order-service"
    c.enable_code_monitoring = true
    c.capture_depth          = 10      # Max nesting depth (nil = unlimited)
    c.max_payload            = 65536   # Max payload bytes (nil = unlimited)
    c.capture_timeout        = 5.0     # Capture timeout in seconds (nil = none)
    c.pii_scrubbing          = true    # Default: true
    c.circuit_breaker_max_failures = 3       # Default: 3
    c.circuit_breaker_window_ms    = 60000   # Default: 60s
    c.circuit_breaker_cooldown_ms  = 300000  # Default: 5min
end

Check breakpoint is enabled and not expired
Verify file path and line number match
Ensure service name matches between SDK and breakpoint
Check the kill switch is not active for the service
Verify the circuit breaker hasn't tripped (check SDK logs for "circuit breaker open")
If using conditions, verify the condition expression is valid

Performance concerns?

Use max_captures to limit total captures per breakpoint
Set capture_frequency for sampling
Set short expiration times on breakpoints
Use opt-in capture limits (captureDepth, maxPayload, captureTimeout)
The circuit breaker auto-disables after 3 failures -- no manual action needed

Variables showing [REDACTED:type]?

PII scrubbing is enabled by default and redacts sensitive data before transmission
13 built-in patterns detect emails, SSNs, credit cards, API keys, JWTs, and more
To disable for a specific service, set piiScrubbing: false in your capture config
Custom patterns can be added via the piiPatterns config option

Code Monitoring

Code Monitoring

What is Code Monitoring?

How It Works

Automatic Code Discovery

Step-by-Step

Quick Start

Step 1: Install and Enable Code Monitoring

Step 3: Add Checkpoints (Automatic)

Step 4: View and Manage (Optional)

Advanced: Manual Breakpoint Creation

Production Safety

PII Scrubbing (Default On)

Crash Isolation

Circuit Breaker

Remote Kill Switch

Real-Time Updates

Dashboard Live Updates

Server-Side Conditions

Available SDKs

Advanced Configuration

Use Cases

Debug Production Issues

Performance Investigation

Verify Calculations

Troubleshooting

No snapshots captured?

Performance concerns?

Variables showing [REDACTED:type]?

Circuit breaker tripped?

SSE not connecting?

Ready to Start?

On this page