Testing Strategy

September 6, 2025 · View on GitHub

Philosophy

"If you didn't run it, it doesn't work."

The testing strategy combines focused unit tests using go test with real-world validation using diverse wallpaper images. Tests provide immediate feedback and serve as living documentation of system behavior.

Core Testing Principles

Public Infrastructure Testing Standard

ALL PUBLIC PACKAGE FUNCTIONALITY MUST BE TESTED

The primary standard for gauging test coverage is comprehensive testing of public package infrastructure. This means:

Every public function and method has corresponding unit tests
Every public data structure and configuration has validation tests
Every public API endpoint has behavioral tests
Every public constant and setting has usage tests

This principle ensures:

API Contract Validation: All public interfaces work as documented
Backwards Compatibility: Changes to public APIs are caught immediately
Integration Readiness: All exposed functionality is verified to work correctly
Documentation Accuracy: Tests serve as executable examples of public APIs

Coverage Assessment Framework

Test coverage adequacy is measured by:

Public Function Coverage: 100% of exported functions tested
Public Struct Coverage: All exported types have construction and usage tests
Public Method Coverage: All exported methods tested with realistic parameters
Error Path Coverage: All public error conditions tested and documented

Private/internal functions are tested only when:

They contain complex algorithms requiring validation
They implement critical business logic
They handle edge cases that affect public behavior

Current Testing Approach

Package-Level Unit Tests

// tests/formats_test.go
package formats_test

import (
    "image/color"
    "testing"
    "github.com/JaimeStill/omarchy-theme-generator/pkg/formats"
)

func TestColorConversion(t *testing.T) {
    testCases := []struct {
        name     string
        input    color.RGBA
        expected struct{ h, s, l float64 }
    }{
        {"red", color.RGBA{255, 0, 0, 255}, struct{ h, s, l float64 }{0.0, 1.0, 0.5}},
        {"gray", color.RGBA{128, 128, 128, 255}, struct{ h, s, l float64 }{0.0, 0.0, 0.5}},
    }
    
    for _, tc := range testCases {
        t.Run(tc.name, func(t *testing.T) {
            h, s, l := formats.RGBToHSL(tc.input)
            if h != tc.expected.h || s != tc.expected.s || l != tc.expected.l {
                t.Errorf("Expected HSL(%.1f, %.1f, %.1f), got HSL(%.1f, %.1f, %.1f)", 
                         tc.expected.h, tc.expected.s, tc.expected.l, h, s, l)
            }
        })
    }
}

Characteristics

Standard Go testing: Uses built-in testing package with *_test.go files
Layered testing: Each package tested in isolation with clear dependencies
Real world validation: Integration tests with actual image samples
Comprehensive coverage: Unit tests for all public APIs and critical functions
Diagnostic logging: All tests include comprehensive t.Logf() output for debugging

Diagnostic Logging Requirements

ALL UNIT TESTS MUST INCLUDE COMPREHENSIVE DIAGNOSTIC OUTPUT

Every test function must use t.Logf() to output:

Input values and parameters
Expected vs actual results
Intermediate calculation values
Threshold values and settings
Any metrics used for decision making

Example Implementation

func TestThemeMode_Calculation(t *testing.T) {
    // ... test setup ...
    
    result, err := p.ProcessImage(img)
    if err != nil {
        t.Fatalf("Unexpected error: %v", err)
    }
    
    // Calculate diagnostic metrics
    bgLuminance := chromatic.Luminance(result.Background)
    avgInputLuminance := calculateAverageInputLuminance(inputColors)
    
    // LOG ALL RELEVANT METRICS
    t.Logf("Input colors average luminance: %v", avgInputLuminance)
    t.Logf("Threshold: %v", tc.threshold)
    t.Logf("Background luminance: %v", bgLuminance)
    t.Logf("Expected mode: %v, Detected mode: %v", expectedMode, actualMode)
    
    // Then perform assertions with full context
    if actualMode != expectedMode {
        t.Errorf("Expected theme mode %v, detected %v (input avg: %v, threshold: %v, bg luminance: %v)", 
            expectedMode, actualMode, avgInputLuminance, tc.threshold, bgLuminance)
    }
}

Why Diagnostic Logging is Critical

Debugging failures: When tests fail, logs show exactly what calculations produced the unexpected result
Understanding algorithm behavior: Logs reveal how input values flow through calculations
Validating test expectations: Sometimes test expectations are wrong - logs help identify this
Performance analysis: Logs can reveal performance characteristics of algorithms
Documentation: Test logs serve as living examples of algorithm behavior

Transparent Test Execution

All tests must provide complete visibility into their execution:

Initial State: Display starting values of all test variables
Transformations: Show operations being performed with parameters
Expected vs Actual: Display both values with exact measurements
Rationale: Explain WHY the test passes or fails

Example of poor test output:

AA compliance testing: ✗

Example of transparent test output:

AA compliance testing:
  Testing: RGB(119,119,119) on RGB(255,255,255) background
  Calculated contrast: 4.48:1
  Required for AA: 4.5:1
  Result: FAIL ✗ (4.48 < 4.5, difference: 0.02)

This principle ensures:

Tests are self-documenting and educational
Failures can be diagnosed without additional debugging
The test suite serves as living documentation of the system's behavior
Anyone running tests understands exactly what is being validated

Test Categories

1. Algorithm Validation

Test core algorithms with known inputs and expected outputs.

func TestAlgorithmBehavior(t *testing.T) {
    testCases := []struct {
        name     string
        input    InputType
        expected OutputType
    }{
        // Test cases with predictable results
    }
    
    for _, tc := range testCases {
        t.Run(tc.name, func(t *testing.T) {
            result := Algorithm(tc.input)
            if !reflect.DeepEqual(result, tc.expected) {
                t.Errorf("Expected %v, got %v", tc.expected, result)
            }
        })
    }
}

2. Performance Verification

Measure execution time against requirements.

func BenchmarkSystemPerformance(b *testing.B) {
    input := prepareTestInput()
    
    b.ResetTimer()
    for i := 0; i < b.N; i++ {
        result := ProcessInput(input)
        if result == nil {
            b.Fatal("Unexpected nil result")
        }
    }
}

3. Behavioral Validation

Verify system behavior with real-world scenarios.

func TestSystemBehavior(t *testing.T) {
    // Test with diverse inputs that represent actual usage
    testInputs := prepareRealWorldInputs()
    
    for _, input := range testInputs {
        t.Run(input.Name, func(t *testing.T) {
            result, err := ProcessInput(input.Data)
            if err != nil {
                t.Fatalf("Processing failed: %v", err)
            }
            
            if !validateOutput(result) {
                t.Error("Output validation failed")
            }
        })
    }
}

4. Integration Tests

Test interactions between major components.

func TestComponentIntegration(t *testing.T) {
    // Setup test environment
    system := SetupTestSystem()
    defer system.Cleanup()
    
    // Test full workflow
    input := prepareTestInput()
    result, err := system.ProcessComplete(input)
    
    if err != nil {
        t.Fatalf("Integration failed: %v", err)
    }
    
    if !validateIntegrationResult(result) {
        t.Error("Integration result validation failed")
    }
}

Test Organization

Package-Specific Test Structure

Tests are organized by package in the tests/ directory:

tests/
├── formats/                     # Unit tests for pkg/formats (complete)
├── chromatic/                   # Unit tests for pkg/chromatic (complete)
├── settings/                    # Unit tests for pkg/settings (complete)
├── loader/                      # Unit tests for pkg/loader (complete)
├── processor/                   # Unit tests for pkg/processor (complete)
├── images/                      # Real-world wallpaper test images
│   ├── README.md                # Image analysis documentation
│   ├── grayscale.jpeg           # Pure grayscale test image
│   ├── nebula.jpeg              # Complex space image
│   ├── night-city.jpeg          # High-detail urban scene
│   ├── mountains.jpeg           # Natural landscape
│   ├── abstract.jpeg            # Abstract art
│   └── *.jpg, *.png             # Additional test wallpapers
└── analyze-images/              # Test analysis utility
    ├── main.go                  # Image characteristic analysis tool
    └── README.md                # Utility documentation

Running Tests

Standard Go Testing

# Run all unit tests
go test ./tests/... -v

# Run specific package tests
go test ./tests/formats -v
go test ./tests/extractor -v

# Run specific test functions
go test ./tests/formats -run TestParseHex -v
go test ./tests/extractor -run TestStrategySelection -v

# Run with race detection
go test ./tests/... -race -v

# Run with coverage
go test ./tests/... -v -cover

Code Validation

# Check for compilation errors and vet issues
go vet ./...

# Format code consistently
go fmt ./...

Utility Tools

# Generate test documentation
go run tests/analyze-images/main.go

Expected Outputs

Color Operations

RGB: #ff8040
HSL: 20°, 100%, 63%
RGB (converted back): #ff8040
✓ Roundtrip successful

Extraction Performance

Loaded: 3840x2160 image
Colors extracted: 1,847,293
Reduced to: 16
Time: 1.23s
✓ Meets <2s requirement

Palette Generation

Monochromatic Palette:
  0: #ff8040 (base)
  1: #ffb380 (lighter)
  2: #cc5020 (darker)
  ...
✓ Single hue maintained

Success Criteria

Each test should verify:

Correctness: Output matches expected values
Performance: Execution time within targets
Stability: No crashes or panics
Determinism: Consistent results across runs

Test Coverage Goals

Unit Test Coverage (Public Infrastructure Standard)

pkg/formats: Color space conversions and utilities (100% public API coverage achieved)
pkg/chromatic: Color theory and harmony functions (100% public API coverage achieved)
pkg/settings: Configuration loading and management (100% public API coverage achieved)
pkg/loader: Image I/O and validation (100% public API coverage achieved)
pkg/processor: Unified image processing and analysis (100% public API coverage achieved)
pkg/palette: Complete theme palette generation (100% public API coverage required)
pkg/theme: Template processing (100% public API coverage required)

Public API Coverage Verification

Each package must demonstrate:

All exported functions tested with realistic inputs and edge cases
All exported methods tested on their respective structs/interfaces
All exported constants validated for correct values and usage
All exported types validated for construction, manipulation, and serialization
All error conditions tested that can be returned from public APIs

This standard ensures that any code depending on these packages can rely on thoroughly tested public interfaces.

Integration Test Coverage

End-to-end extraction pipeline with real images
Complete theme generation workflow
Settings and configuration integration
Profile detection with diverse image types

Benchmark Coverage

Color extraction performance with 4K images
Color space conversion efficiency
Profile detection speed
Memory usage optimization

References

Development approach: docs/development-methodology.md
Architecture details: docs/architecture.md
Progress tracking: PROJECT.md