Configuration Options Reference

Note

Complete reference for HoneyHive SDK configuration options

This document provides detailed specifications for all configuration options available in the HoneyHive SDK.

Important

🆕 NEW: Hybrid Configuration System

The HoneyHive SDK now supports a hybrid configuration approach that combines modern Pydantic config objects with full backwards compatibility. You can use either approach or mix them together.

The HoneyHive SDK supports multiple configuration approaches:

🎯 Recommended Approaches (Choose One):

Modern Pydantic Config Objects (Recommended for new code)
Traditional Parameter Passing (Backwards compatible)
Mixed Approach (Config objects + parameter overrides)

📚 Additional Configuration Sources:

Environment variables (HH_* prefixed)
Configuration files (YAML/JSON)
CLI options

Configuration Methods

Type-safe, validated configuration with IDE support:

from honeyhive import HoneyHiveTracer
from honeyhive.config.models import TracerConfig, SessionConfig

# Create configuration objects
config = TracerConfig(
    api_key="hh_1234567890abcdef",
    project="my-llm-project",
    source="production",
    verbose=True,
    disable_http_tracing=True,
    test_mode=False
)

session_config = SessionConfig(
    session_name="user-chat-session",
    inputs={"user_id": "123", "query": "Hello world"}
)

# Initialize with config objects
tracer = HoneyHiveTracer(
    config=config,
    session_config=session_config
)

Benefits: Type safety, IDE autocomplete, validation, reduced argument count

Existing code continues to work exactly as before:

from honeyhive import HoneyHiveTracer

# This continues to work exactly as before
tracer = HoneyHiveTracer(
    api_key="hh_1234567890abcdef",
    project="my-llm-project",
    session_name="user-chat-session",
    source="production",
    verbose=True,
    disable_http_tracing=True,
    test_mode=False
)

Benefits: No code changes required, familiar pattern

Config objects with parameter overrides (individual parameters take precedence):

from honeyhive import HoneyHiveTracer
from honeyhive.config.models import TracerConfig

# Base configuration
config = TracerConfig(
    api_key="hh_1234567890abcdef",
    project="my-llm-project",
    source="production"
)

# Individual parameters override config values
tracer = HoneyHiveTracer(
    config=config,
    verbose=True,  # Overrides config.verbose
    session_name="override-session"  # Additional parameter
)

Benefits: Flexible configuration with selective overrides

Configuration Precedence

The SDK follows this precedence order (highest to lowest):

Individual Parameters - Direct parameters to HoneyHiveTracer()
Config Object Values - Values from TracerConfig objects
Environment Variables - HH_* environment variables
Default Values - Built-in SDK defaults

Note

API Key Special Case: For backwards compatibility, HH_API_KEY environment variable takes precedence over both config objects and constructor parameters.

Configuration Classes

class honeyhive.config.models.TracerConfig

Primary configuration class for HoneyHive tracer initialization.

Inherits common fields from BaseHoneyHiveConfig and adds tracer-specific parameters.

Key Features:

Type-safe Pydantic validation
Environment variable loading via AliasChoices
Graceful degradation on invalid values
IDE autocomplete support

Example:

from honeyhive.config.models import TracerConfig

config = TracerConfig(
    api_key="hh_1234567890abcdef",
    project="my-llm-project",
    source="production",
    verbose=True
)

class honeyhive.config.models.BaseHoneyHiveConfig

Base configuration class with common fields shared across all HoneyHive components.

Common Fields: api_key, project, test_mode, verbose

class honeyhive.config.models.SessionConfig

Session-specific configuration for tracer initialization.

Key Fields: session_name, inputs, outputs, metadata

class honeyhive.config.models.APIClientConfig: Configuration for HoneyHive API client settings.

class honeyhive.config.models.HTTPClientConfig: HTTP client configuration including connection pooling and retry settings.

Core Configuration Options

The following options are available through both traditional parameters and config objects:

Authentication

api_key: str = None

Description: HoneyHive API key for authentication

Environment Variable: HH_API_KEY

Required: Yes

Format: String starting with hh_

Example: "hh_1234567890abcdef..."

Security: Keep this secure and never commit to code repositories

Usage Examples:

from honeyhive.config.models import TracerConfig

config = TracerConfig(api_key="hh_1234567890abcdef")
tracer = HoneyHiveTracer(config=config)

tracer = HoneyHiveTracer(api_key="hh_1234567890abcdef")

export HH_API_KEY="hh_1234567890abcdef"

# API key loaded automatically from environment
tracer = HoneyHiveTracer(project="my-project")

base_url: str = "https://api.honeyhive.ai"

Description: Base URL for HoneyHive API

Environment Variable: HH_BASE_URL

Default: "https://api.honeyhive.ai"

Examples: - "https://api.honeyhive.ai" (Production) - "https://api-staging.honeyhive.ai" (Staging) - "https://api-dev.honeyhive.ai" (Development)

Project Configuration

project: str = None

Description: Default project name for operations. Required field that must match your HoneyHive project.

Environment Variable: HH_PROJECT

Required: Yes

Format: Alphanumeric with hyphens and underscores

Example: "my-llm-application"

Validation: 1-100 characters, cannot start/end with special characters

source: str = None

Description: Source identifier for tracing

Environment Variable: HH_SOURCE

Default: Auto-detected from environment

Examples: - "chat-service" - "recommendation-engine" - "data-pipeline"

session_name: str = None

Description: Default session name for tracing

Environment Variable: HH_SESSION_NAME

Default: Auto-generated based on context

Format: Human-readable string

Example: "user-chat-session"

Operational Mode

test_mode: bool = False

Description: Enable test mode (no data sent to HoneyHive)

Environment Variable: HH_TEST_MODE

Default: False

Values: true, false

Use Cases: - Unit testing - Development environments - CI/CD pipelines

debug: bool = False

Description: Enable debug logging

Environment Variable: HH_DEBUG

Default: False

Values: true, false

Behavior: Enables verbose logging and debug information

Performance Configuration

HTTP Configuration

timeout: float = 30.0

Description: HTTP request timeout in seconds

Environment Variable: HH_TIMEOUT

Default: 30.0

Range: 1.0 - 300.0

Use Cases: Adjust based on network conditions and latency requirements

max_retries: int = 3

Description: Maximum number of retry attempts for failed requests

Environment Variable: HH_MAX_RETRIES

Default: 3

Range: 0 - 10

Behavior: Exponential backoff between retries

retry_delay: float = 1.0

Description: Initial retry delay in seconds

Environment Variable: HH_RETRY_DELAY

Default: 1.0

Range: 0.1 - 60.0

Behavior: Delay doubles with each retry (exponential backoff)

max_connections: int = 100

Description: Maximum number of HTTP connections in pool

Environment Variable: HH_MAX_CONNECTIONS

Default: 100

Range: 1 - 1000

Use Cases: Adjust based on concurrency requirements

connection_pool_size: int = 10

Description: HTTP connection pool size

Environment Variable: HH_CONNECTION_POOL_SIZE

Default: 10

Range: 1 - 100

OTLP Configuration

otlp_enabled: bool = True

Description: Enable OTLP export to HoneyHive backend

Environment Variable: HH_OTLP_ENABLED

Default: True

Usage: Set to False to disable OTLP export (useful for testing)

otlp_endpoint: str | None = None

Description: Custom OTLP endpoint URL

Environment Variable: HH_OTLP_ENDPOINT

Default: Auto-configured based on server_url

Example: "https://custom.honeyhive.ai/opentelemetry/v1/traces"

otlp_protocol: str = "http/protobuf"

Description: OTLP protocol format for span export

Environment Variables: HH_OTLP_PROTOCOL or OTEL_EXPORTER_OTLP_PROTOCOL

Valid Values: - "http/protobuf" (default) - Binary Protobuf format - "http/json" - JSON format for debugging and backend type conversion testing

Example: Set HH_OTLP_PROTOCOL=http/json to use JSON format

otlp_headers: Dict[str, Any] | None = None

Description: Additional HTTP headers for OTLP export requests

Environment Variable: HH_OTLP_HEADERS (JSON string)

Example: {"X-Custom-Header": "value"}

Tracing Configuration

disable_http_tracing: bool = True

Description: Disable automatic HTTP request tracing (opt-in feature)

Environment Variable: HH_DISABLE_HTTP_TRACING

Default: True (HTTP tracing disabled by default for performance)

Use Cases: - Lambda environments (performance optimization) - Reduce tracing overhead - Prevent recursive tracing

batch_size: int = 100

Description: Number of spans to batch before sending

Environment Variable: HH_BATCH_SIZE

Default: 100

Range: 1 - 1000

Trade-offs: - Larger batches: Better performance, higher memory usage - Smaller batches: Lower latency, more network calls

flush_interval: float = 5.0

Description: Automatic flush interval in seconds

Environment Variable: HH_FLUSH_INTERVAL

Default: 5.0

Range: 1.0 - 300.0

Behavior: Automatically flushes pending spans at this interval

max_queue_size: int = 2048

Description: Maximum number of spans in memory queue

Environment Variable: HH_MAX_QUEUE_SIZE

Default: 2048

Range: 100 - 10000

Behavior: Oldest spans are dropped when queue is full

OpenTelemetry Span Limits

Note

🆕 NEW in v1.0: Configurable span limits with automatic core attribute preservation

These settings control OpenTelemetry span size limits. The SDK defaults are optimized for 95% of use cases - only increase limits when you actually hit them, not preemptively.

max_attributes: int = 1024

Description: Maximum number of attributes (key-value pairs) per span

Environment Variable: HH_MAX_ATTRIBUTES

Default: 1024 (recommended - optimized for LLM workloads)

Backend Maximum: 10,000 (supported for edge cases only)

OpenTelemetry Default: 128 (SDK increases this 8x for LLM workloads)

Range: 128 - 10,000

⚠️ Important: The default of 1024 is intentionally set to handle 95% of use cases. Only increase this limit when you actually encounter “attribute limit exceeded” errors in production, not preemptively.

When You Might Need More: - Large embeddings (>1MB) with extensive metadata - High-resolution image processing with detailed annotations - Complex multi-step chains with per-step metadata - Debug/development scenarios requiring verbose attribute capture

Trade-offs: - Higher limits: Support larger payloads, more metadata - Lower limits: Reduced memory usage, faster serialization

Performance Impact: Minimal (<1ms overhead) with lazy core attribute preservation

Important: When limit is exceeded, OpenTelemetry uses FIFO eviction (oldest attributes dropped first). The SDK automatically preserves critical attributes (session_id, event_type, event_name, source) when spans approach the limit.

Example:

from honeyhive.config.models import TracerConfig
from honeyhive import HoneyHiveTracer

# Default: 1024 attributes (recommended)
tracer = HoneyHiveTracer.init(
    api_key="hh_...",
    project="my-project"
)

# Increased for large embeddings
config = TracerConfig(
    api_key="hh_...",
    project="my-project",
    max_attributes=5000  # Increase to 5000
)
tracer = HoneyHiveTracer(config=config)

# Or via environment variable
# export HH_MAX_ATTRIBUTES=5000

max_events: int = 1024

Description: Maximum number of events per span

Environment Variable: HH_MAX_EVENTS

Default: 1024 (conservative SDK default)

Backend Maximum: 10,000 (increase if needed)

OpenTelemetry Default: 128 (SDK increases this 8x)

Range: 128 - 10,000

Use Cases: - Default (1024): Most LLM applications with typical event counts - Increased (2000-5000): High-frequency logging, detailed trace events - Maximum (10,000): Debug scenarios, comprehensive event capture

Note: Events are flattened to pseudo-attributes (_event.0.*, _event.1.*, etc.) by the ingestion service, so they count toward effective attribute limit.

Trade-offs: - Higher limits: Capture more detailed execution flow - Lower limits: Reduced network payload size

Example:

# Increase for high-frequency event logging
config = TracerConfig(
    api_key="hh_...",
    project="my-project",
    max_events=3000
)

max_links: int = 128

Description: Maximum number of span links per span (for distributed tracing)

Environment Variable: HH_MAX_LINKS

Default: 128 (typically sufficient)

Backend Maximum: 10,000 (rarely needed)

OpenTelemetry Default: 128 (SDK uses standard default)

Range: 1 - 10,000

Use Cases: - Default (128): Standard distributed tracing scenarios - Increased (500+): Complex microservice architectures, fan-out patterns

Note: Span links are used for distributed tracing to link spans across service boundaries. Most applications don’t need more than the default.

Example:

# Increase for complex distributed systems
config = TracerConfig(
    api_key="hh_...",
    project="my-project",
    max_links=500
)

max_span_size: int = 10485760

Description: Maximum total span size in bytes (attributes + events + links combined)

Environment Variable: HH_MAX_SPAN_SIZE

Default: 10485760 (10 MB - recommended for most use cases)

Backend Maximum: 104857600 (100 MB - supported for edge cases only)

Range: 1,048,576 - 104,857,600 (1 MB - 100 MB)

⚠️ Important: The default of 10 MB is sufficient for 95% of applications including small-to-medium images, embeddings, and typical LLM metadata. Only increase when you actually encounter “span size exceeded” errors.

When You Might Need More: - High-resolution images (>10 MB each) - Audio/video file processing (>10 MB payloads) - Scientific computing with large matrices/tensors - Debug scenarios capturing extensive state

Important: This is a total span size limit enforced in-memory before serialization. OpenTelemetry doesn’t provide this natively, so the SDK implements custom size tracking.

Trade-offs: - Higher limits: Support larger payloads (images, audio, video) - Lower limits: Reduced memory usage, faster network transmission

Performance Impact: Size checking adds ~0.001ms overhead per span

Span Size Breakdown:

Attributes: Each key-value pair (~100-1000 bytes typical)
Events: Each event with data (~50-500 bytes typical)
Links: Each link reference (~100 bytes typical)
Large Data: Images (100KB-10MB), embeddings (1KB-100KB), audio (1MB-50MB)

Example:

# Default: 10 MB
tracer = HoneyHiveTracer.init(
    api_key="hh_...",
    project="my-project"
)

# Increased for image processing
config = TracerConfig(
    api_key="hh_...",
    project="my-project",
    max_span_size=52428800  # 50 MB
)

# Maximum for video/audio processing
config = TracerConfig(
    api_key="hh_...",
    project="my-project",
    max_span_size=104857600  # 100 MB (backend max)
)

preserve_core_attributes: bool = True

Description: Enable automatic preservation of critical attributes to prevent data loss

Environment Variable: HH_PRESERVE_CORE_ATTRIBUTES

Default: True (enabled - strongly recommended)

Behavior: When spans approach the attribute limit (95% threshold), the SDK automatically re-sets critical attributes just before span.end() to ensure they survive OpenTelemetry’s FIFO eviction policy.

Critical Attributes Protected:

session_id (CRITICAL - required for backend ingestion)
source (CRITICAL - required for backend routing)
event_type (HIGH - required for span classification)
event_name (HIGH - required for span identification)
project (NORMAL - required for project routing)
config (NORMAL - optional configuration name)

Why This Matters:

OpenTelemetry uses strict FIFO (First-In-First-Out) eviction when spans exceed attribute limits. Without preservation:

Critical attributes set early (like session_id) get evicted first
Backend rejects spans missing required attributes
Data loss occurs silently

With preservation enabled:

SDK monitors attribute count per span
When span reaches 95% of limit, preservation activates
Critical attributes are re-set LAST (become newest)
Critical attributes survive eviction, span is accepted

Performance Impact:

Normal spans (<95% of limit): Zero overhead
Large spans (>95% of limit): ~0.5ms overhead (lazy activation)
Memory: Negligible (only attributes checked, not copied)

When to Disable:

⚠️ Never in production - high risk of data loss
Debugging OpenTelemetry behavior
Performance profiling (measure raw OTel overhead)
Testing attribute eviction scenarios

Example:

# Default: Enabled (recommended)
tracer = HoneyHiveTracer.init(
    api_key="hh_...",
    project="my-project"
)

# Explicitly enable (redundant but clear)
config = TracerConfig(
    api_key="hh_...",
    project="my-project",
    preserve_core_attributes=True
)

# ⚠️ Disable only for debugging (NOT for production)
config = TracerConfig(
    api_key="hh_...",
    project="my-project",
    preserve_core_attributes=False  # RISKY: Can cause data loss
)

Important

Span Limit Configuration Best Practices

Use the defaults (1024 attrs, 10MB) - optimized for 95% of use cases
Don’t preemptively increase limits - only adjust when you hit actual errors
Monitor in production - use HoneyHive dashboard to track span sizes
Keep preservation enabled - prevents silent data loss from FIFO eviction
Increase incrementally - if needed, increase by 2-3x, not to maximum
Higher limits = higher costs - larger spans mean more memory, network, and storage

Common Configuration Scenarios:

# Scenario 1: Standard LLM application (RECOMMENDED - use defaults)
config = TracerConfig(
    api_key="hh_...",
    project="my-project"
    # Uses defaults: 1024 attrs, 10MB, preservation ON
    # This handles 95% of use cases
)

# Scenario 2: Image processing (only if hitting limits)
config = TracerConfig(
    api_key="hh_...",
    project="image-pipeline",
    max_attributes=2048,       # 2x increase (not 10x)
    max_span_size=20971520     # 20 MB (2x increase, not 100 MB)
)

# Scenario 3: High-resolution media (rare edge case)
config = TracerConfig(
    api_key="hh_...",
    project="media-pipeline",
    max_attributes=3000,       # 3x increase
    max_span_size=52428800     # 50 MB (5x increase)
)

# ⚠️ Scenario 4: Maximum limits (ONLY for extreme edge cases)
# WARNING: Higher memory usage, network costs, and processing time
config = TracerConfig(
    api_key="hh_...",
    project="scientific-computing",
    max_attributes=10000,      # Backend maximum (use sparingly)
    max_span_size=104857600,   # Backend maximum (100 MB)
    verbose=True
)
# Only use maximum limits if:
# - You've verified you actually need them
# - You've tested memory/network impact
# - You understand the cost implications

Evaluation Configuration

Evaluation Settings

evaluation_enabled: bool = True

Description: Enable automatic evaluations

Environment Variable: HH_EVALUATION_ENABLED

Default: True

Use Cases: Disable in high-performance scenarios

evaluation_timeout: float = 30.0

Description: Timeout for evaluation operations in seconds

Environment Variable: HH_EVALUATION_TIMEOUT

Default: 30.0

Range: 5.0 - 300.0

evaluation_parallel: bool = True

Description: Run evaluations in parallel

Environment Variable: HH_EVALUATION_PARALLEL

Default: True

Performance: Parallel execution improves throughput

evaluation_max_workers: int = 4

Description: Maximum parallel evaluation workers

Environment Variable: HH_EVALUATION_MAX_WORKERS

Default: 4

Range: 1 - 20

Default Evaluators

default_evaluators: List[str] = []

Description: Default evaluators to run automatically

Environment Variable: HH_DEFAULT_EVALUATORS (comma-separated)

Default: [] (no automatic evaluators)

Available Evaluators: - "quality" - Overall response quality - "factual_accuracy" - Factual correctness - "relevance" - Query relevance - "toxicity" - Content safety - "length" - Response length appropriateness

Example: "quality,factual_accuracy,relevance"

Logging Configuration

Log Settings

log_level: str = "INFO"

Description: Logging level for SDK operations

Environment Variable: HH_LOG_LEVEL

Default: "INFO"

Values: "DEBUG", "INFO", "WARNING", "ERROR", "CRITICAL"

Behavior: Controls verbosity of SDK logging

log_format: str = "%(asctime)s - %(name)s - %(levelname)s - %(message)s"

Description: Log message format

Environment Variable: HH_LOG_FORMAT

Default: Standard format with timestamp, logger name, level, and message

Format: Python logging format string

log_file: str = None

Description: Log file path (if file logging enabled)

Environment Variable: HH_LOG_FILE

Default: None (console logging only)

Example: "/var/log/honeyhive.log"

structured_logging: bool = False

Description: Enable structured JSON logging

Environment Variable: HH_STRUCTURED_LOGGING

Default: False

Use Cases: Production environments, log aggregation systems

Security Configuration

Data Privacy

mask_inputs: bool = False

Description: Automatically mask sensitive data in inputs

Environment Variable: HH_MASK_INPUTS

Default: False

Behavior: Replaces sensitive data with [MASKED]

mask_outputs: bool = False

Description: Automatically mask sensitive data in outputs

Environment Variable: HH_MASK_OUTPUTS

Default: False

sensitive_keys: List[str] = ["password", "token", "key", "secret"]

Description: Keys to automatically mask in data

Environment Variable: HH_SENSITIVE_KEYS (comma-separated)

Default: Common sensitive field names

Behavior: Case-insensitive matching

SSL/TLS Configuration

verify_ssl: bool = True

Description: Verify SSL certificates for HTTPS requests

Environment Variable: HH_VERIFY_SSL

Default: True

Security: Only disable for development/testing

ca_bundle: str = None

Description: Path to custom CA bundle for SSL verification

Environment Variable: HH_CA_BUNDLE

Default: None (use system CA bundle)

Use Cases: Corporate networks with custom certificates

Environment-Specific Configuration

Development Environment

# development.yaml
api_key: "hh_dev_key_123..."
base_url: "https://api-dev.honeyhive.ai"
project: "my-app-dev"
test_mode: false
debug: true
log_level: "DEBUG"

# Performance (relaxed for development)
timeout: 60.0
batch_size: 10
flush_interval: 1.0

# Evaluation (enabled for testing)
evaluation_enabled: true
default_evaluators: ["quality", "relevance"]

Staging Environment

# staging.yaml
api_key: "hh_staging_key_456..."
base_url: "https://api-staging.honeyhive.ai"
project: "my-app-staging"
test_mode: false
debug: false
log_level: "INFO"

# Performance (production-like)
timeout: 30.0
batch_size: 100
flush_interval: 5.0

# Security (moderate)
mask_inputs: false
mask_outputs: false

Production Environment

# production.yaml
api_key: "hh_prod_key_789..."
base_url: "https://api.honeyhive.ai"
project: "my-app-prod"
test_mode: false
debug: false
log_level: "WARNING"
structured_logging: true

# Performance (optimized)
timeout: 15.0
batch_size: 500
flush_interval: 10.0
max_queue_size: 5000

# Security (strict)
mask_inputs: true
mask_outputs: true
sensitive_keys: ["password", "token", "key", "secret", "api_key", "auth"]

# Evaluation (selective)
evaluation_enabled: true
evaluation_timeout: 10.0
default_evaluators: ["toxicity"]

Lambda/Serverless Environment

# lambda.yaml
api_key: "hh_lambda_key_abc..."
project: "my-lambda-app"
test_mode: false
log_level: "ERROR"

# Performance (optimized for cold starts)
disable_http_tracing: true
timeout: 5.0
batch_size: 1
flush_interval: 1.0
max_queue_size: 100

# Evaluation (disabled for performance)
evaluation_enabled: false

Configuration File Formats

YAML Configuration

# honeyhive.yaml
api_key: "hh_your_api_key_here"
base_url: "https://api.honeyhive.ai"
project: "my-project"
source: "my-service"

# Operational settings
test_mode: false
debug: false

# Performance settings
timeout: 30.0
max_retries: 3
batch_size: 100
flush_interval: 5.0

# Tracing settings
disable_http_tracing: false
max_queue_size: 2048

# Evaluation settings
evaluation_enabled: true
evaluation_parallel: true
evaluation_timeout: 30.0
default_evaluators:
  - "quality"
  - "relevance"

# Logging settings
log_level: "INFO"
structured_logging: false

# Security settings
mask_inputs: false
mask_outputs: false
sensitive_keys:
  - "password"
  - "token"
  - "key"
  - "secret"

JSON Configuration

{
  "api_key": "hh_your_api_key_here",
  "base_url": "https://api.honeyhive.ai",
  "project": "my-project",
  "source": "my-service",
  "test_mode": false,
  "debug": false,
  "timeout": 30.0,
  "max_retries": 3,
  "batch_size": 100,
  "flush_interval": 5.0,
  "disable_http_tracing": false,
  "max_queue_size": 2048,
  "evaluation_enabled": true,
  "evaluation_parallel": true,
  "evaluation_timeout": 30.0,
  "default_evaluators": ["quality", "relevance"],
  "log_level": "INFO",
  "structured_logging": false,
  "mask_inputs": false,
  "mask_outputs": false,
  "sensitive_keys": ["password", "token", "key", "secret"]
}

Configuration Loading

File Discovery:

The SDK searches for configuration files in this order:

./honeyhive.yaml (current directory)
./honeyhive.json (current directory)
~/.honeyhive/config.yaml (user home directory)
~/.honeyhive/config.json (user home directory)
/etc/honeyhive/config.yaml (system-wide)

Environment-Specific Files:

You can specify environment-specific configuration:

# Set environment
export HH_ENVIRONMENT=production

# SDK will look for:
# ./honeyhive.production.yaml
# ~/.honeyhive/config.production.yaml

Explicit Configuration File:

from honeyhive import HoneyHiveTracer

# Load specific config file
tracer = HoneyHiveTracer.init(config_file="./my-config.yaml")

Configuration Validation

Type Validation:

All configuration values are validated for correct types:

# These will raise validation errors:
timeout = "invalid"  # Must be float
batch_size = -1      # Must be positive integer
log_level = "INVALID" # Must be valid log level

Range Validation:

Numeric values are validated against acceptable ranges:

# These will raise validation errors:
timeout = 0.0        # Must be >= 1.0
batch_size = 10000   # Must be <= 1000
max_retries = -1     # Must be >= 0

Format Validation:

String values are validated for correct format:

# These will raise validation errors:
api_key = "invalid"         # Must start with "hh_"
log_level = "invalid"       # Must be valid log level
base_url = "not-a-url"      # Must be valid URL

Configuration Best Practices

Security:

Never commit API keys to version control
Use environment variables for secrets in production
Enable input/output masking for sensitive data
Use different API keys for different environments

Performance:

Tune batch size based on your traffic patterns
Adjust timeout based on your network conditions
Disable HTTP tracing in high-performance scenarios
Use appropriate queue sizes for your memory constraints

Reliability:

Set appropriate retry limits for your use case
Configure timeouts to prevent hanging operations
Enable debug logging during development
Use structured logging in production

Monitoring:

Enable appropriate log levels for your environment
Monitor queue sizes and flush intervals
Track configuration changes in your deployment pipeline
Use health checks to validate configuration

Configuration Examples

High-Performance Web Service:

# High-throughput configuration
batch_size: 1000
flush_interval: 10.0
max_queue_size: 10000
timeout: 5.0
max_retries: 1
disable_http_tracing: true
evaluation_enabled: false

Development Environment:

# Development-friendly configuration
debug: true
log_level: "DEBUG"
test_mode: true
batch_size: 1
flush_interval: 1.0
evaluation_enabled: true
default_evaluators: ["quality", "factual_accuracy"]

Security-Conscious Environment:

# Security-focused configuration
mask_inputs: true
mask_outputs: true
sensitive_keys:
  - "password"
  - "token"
  - "key"
  - "secret"
  - "api_key"
  - "auth"
  - "credential"
verify_ssl: true
structured_logging: true

Configuration Options Reference

Configuration Methods

Configuration Precedence

Configuration Classes

Core Configuration Options

Authentication

Project Configuration

Operational Mode

Performance Configuration

HTTP Configuration

OTLP Configuration

Tracing Configuration

OpenTelemetry Span Limits

Evaluation Configuration

Evaluation Settings

Default Evaluators

Logging Configuration

Log Settings

Security Configuration

Data Privacy

SSL/TLS Configuration

Environment-Specific Configuration

Development Environment

Staging Environment

Production Environment

Lambda/Serverless Environment

Configuration File Formats

YAML Configuration

JSON Configuration

Configuration Loading

Configuration Validation

Configuration Best Practices

Configuration Examples

See Also