Awesome Performance Engineering [](https://awesome.re)

April 17, 2026 ยท View on GitHub

Awesome Performance Engineering

The discipline that ensures systems deliver fast, reliable, and cost-efficient experiences at any scale, combining observability and performance testing.

Contents

Indicators: โญ Widely adopted ยท ๐ŸŸข Active ยท ๐Ÿ”ต Cloud-native ยท ๐ŸŸ  Commercial ยท ๐Ÿš€ High performance

Observability

Metrics Collection & Time-Series Storage

  • Prometheus - โญ๐ŸŸข๐Ÿ”ต Pull-based cloud-native metrics platform with dimensional data model and PromQL query language.
  • VictoriaMetrics - โญ๐ŸŸข๐Ÿš€ High-performance, cost-efficient Prometheus-compatible TSDB with high-cardinality and long-retention support.
  • Thanos - โญ๐ŸŸข๐Ÿ”ต Long-term storage, global query view, and high availability layer for Prometheus via sidecar architecture.
  • Mimir - โญ๐ŸŸข๐Ÿ”ต๐Ÿš€ Horizontally scalable, multi-tenant Prometheus-compatible TSDB from Grafana Labs.
  • InfluxDB - ๐ŸŸข๐ŸŸ  Purpose-built time-series database with high write throughput and a Rust-based engine (v3).
  • Grafana Alloy - โญ๐ŸŸข๐Ÿ”ต OpenTelemetry-native telemetry collector supporting metrics, logs, traces, and profiles.
  • Telegraf - ๐ŸŸข Plugin-driven agent for collecting and reporting metrics with 300+ input plugins.
  • StatsD - Lightweight, UDP-based metrics aggregation daemon with broad application support.
  • Netdata - โญ๐ŸŸข๐Ÿš€ Real-time per-second monitoring with built-in anomaly detection and zero-configuration agent.

Distributed Tracing

  • OpenTelemetry - โญ๐ŸŸข๐Ÿ”ต Open standard for distributed tracing, metrics, and logs with language-specific SDKs and auto-instrumentation.
  • Jaeger - โญ๐ŸŸข๐Ÿ”ต CNCF graduated distributed tracing backend and UI, originally from Uber.
  • Grafana Tempo - โญ๐ŸŸข๐Ÿ”ต High-scale tracing backend requiring only object storage, with native Grafana integration.
  • Zipkin - ๐ŸŸข Pioneering distributed tracing system (Twitter, 2012) with a simple architecture.
  • Apache SkyWalking - โญ๐ŸŸข๐Ÿ”ต Observability platform with bytecode-injection-based tracing, popular in the Java ecosystem.
  • SigNoz - ๐ŸŸข๐Ÿ”ต Open-source OpenTelemetry-native observability platform with unified metrics, traces, and logs.
  • Pinpoint - Bytecode-instrumentation-based APM and tracing for Java and PHP with zero-code-change approach.

Log Management & Log Pipelines

  • Grafana Loki - โญ๐ŸŸข๐Ÿ”ต Label-based log aggregation that indexes metadata instead of content for cost-efficient storage at scale.
  • Fluent Bit - โญ๐ŸŸข๐Ÿ”ต๐Ÿš€ Lightweight, high-performance log processor and forwarder for edge and containerized environments.
  • Fluentd - ๐ŸŸข๐Ÿ”ต CNCF graduated unified logging layer with 1000+ plugins for complex routing.
  • Elasticsearch - โญ๐ŸŸข๐ŸŸ  Distributed search and analytics engine with powerful full-text search capabilities.
  • OpenSearch - ๐ŸŸข๐Ÿ”ต Community-driven, Apache-2.0-licensed fork of Elasticsearch, backed by AWS.
  • Logstash - Flexible log ingestion and transformation pipeline, part of the Elastic Stack.
  • Graylog - ๐ŸŸข๐ŸŸ  Centralized log management with built-in alerting and dashboards.
  • rsyslog - ๐ŸŸข๐Ÿš€ High-performance system logging daemon handling millions of messages per second.

Observability Pipelines and Telemetry Processing

  • OpenTelemetry Collector - โญ๐ŸŸข๐Ÿ”ต Standard telemetry processing pipeline with receivers, processors, and exporters for any signal.
  • Vector - ๐ŸŸข๐Ÿš€ End-to-end observability data routing and transformation with programmable VRL transforms.
  • Logstash - ETL-style processing for observability data with powerful filter plugins.
  • Cribl Stream - ๐ŸŸ ๐Ÿš€ Commercial observability pipeline for routing, reducing, and enriching telemetry data.

Visualization & Dashboards

  • Grafana - โญ๐ŸŸข Open-source observability dashboard platform supporting 100+ data sources with alerting and annotations.
  • Kibana - ๐ŸŸข๐ŸŸ  Visualization and log exploration for Elasticsearch and OpenSearch data.
  • OpenSearch Dashboards - ๐ŸŸข๐Ÿ”ต Open-source fork of Kibana for OpenSearch.
  • Apache Superset - ๐ŸŸข SQL-first analytics and dashboarding platform for ad-hoc data exploration.
  • Perses - ๐ŸŸข๐Ÿ”ต CNCF sandbox dashboards-as-code project with native PromQL and TraceQL support.

Profiling & Continuous Performance Analysis

  • Parca - โญ๐ŸŸข๐Ÿ”ต eBPF-based continuous profiling platform with zero-instrumentation and differential flame graphs (CNCF sandbox).
  • Grafana Pyroscope - โญ๐ŸŸข๐Ÿ”ต Continuous profiling with flame graph visualization and multi-language support.
  • async-profiler - ๐ŸŸข๐Ÿš€ Low-overhead JVM sampling profiler capturing CPU, allocation, and lock contention profiles.
  • perf - ๐Ÿš€ Linux kernel performance analysis tool with hardware counters, tracepoints, and sampling.
  • bpftrace - ๐ŸŸข๐Ÿš€ High-level tracing language for Linux eBPF with dynamic kernel and user-space tracing.
  • bcc (BPF Compiler Collection) - ๐ŸŸข๐Ÿš€ Toolkit for creating eBPF-based tracing programs with dozens of ready-to-use tools.
  • Grafana Beyla - ๐ŸŸข๐Ÿ”ต๐Ÿš€ eBPF-based zero-code auto-instrumentation generating RED metrics and distributed traces.
  • Perfetto - ๐ŸŸข System-wide tracing and profiling toolkit from Google for Android, Chrome, and general system analysis.

Alerting & Incident Response

  • Alertmanager - โญ๐ŸŸข Prometheus-native alert handling with grouping, silencing, inhibition, and routing.
  • Grafana OnCall - ๐ŸŸข๐Ÿ”ต Open-source on-call management and alert routing with native Grafana integration.
  • Keep - ๐ŸŸข๐Ÿ”ต Open-source alert management platform consolidating alerts from multiple sources.
  • Alerta - ๐ŸŸข Unified alert correlation and management across multiple monitoring systems.
  • PagerDuty - ๐ŸŸ  Industry-standard incident response and on-call management platform.
  • Opsgenie - ๐ŸŸ  Alerting and escalation platform, part of the Atlassian suite.
  • Rootly - ๐ŸŸ  AI-assisted incident management with automated timelines and postmortem generation.

Observability Platforms (Integrated)

  • Datadog - ๐ŸŸ  SaaS observability platform with AI-powered anomaly detection and root-cause analysis.
  • Dynatrace - ๐ŸŸ  AI-driven observability with automatic topology discovery and root-cause analysis (Davis AI).
  • New Relic - ๐ŸŸ  Developer-centric observability platform with NRQL query language and a generous free tier.
  • Splunk Observability - ๐ŸŸ  Observability built on Splunk's machine data analytics platform.
  • Elastic Observability - ๐ŸŸ  Observability solution built on the Elastic Stack with self-managed and cloud options.
  • Honeycomb - ๐ŸŸ  Observability platform for high-cardinality event data with BubbleUp automated correlation.
  • Grafana Cloud - ๐ŸŸ  Managed Grafana stack (Mimir, Loki, Tempo, Pyroscope) with a generous free tier.
  • Instana (IBM) - ๐ŸŸ  Automatic infrastructure and application discovery with real-time observability.
  • AppDynamics (Splunk/Cisco) - ๐ŸŸ  Enterprise APM with business transaction monitoring and code-level diagnostics.
  • Chronosphere - ๐ŸŸ  Cloud-native observability platform focused on metrics at scale with cost control.
  • Lightstep / ServiceNow Cloud Observability - ๐ŸŸ  OpenTelemetry-native observability platform, now part of ServiceNow.
  • Sematext - ๐ŸŸข๐ŸŸ  SaaS observability platform with OpenTelemetry-native support and topology discovery.

Monitoring Suites (Operations-Oriented)

  • Zabbix - ๐ŸŸข Enterprise-grade monitoring platform with agent-based and agentless monitoring.
  • Nagios - ๐ŸŸข Pioneering open-source check-based monitoring with an enormous plugin ecosystem.
  • Icinga - ๐ŸŸข Modern evolution of Nagios with improved APIs, configuration management, and scalability.
  • Checkmk - ๐ŸŸข๐ŸŸ  Infrastructure and application monitoring with auto-discovery for large environments.

Service Mesh Observability

  • Kiali - ๐ŸŸข๐Ÿ”ต Observability console for Istio with topology visualization and traffic flow analysis.
  • Linkerd Viz - ๐ŸŸข๐Ÿ”ต Built-in telemetry and dashboard for Linkerd service mesh.
  • Hubble - ๐ŸŸข๐Ÿ”ต๐Ÿš€ eBPF-powered network observability for Cilium with L3/L4/L7 flow visibility.

Database Observability

  • PMM (Percona Monitoring and Management) - ๐ŸŸข Open-source database performance monitoring for MySQL, PostgreSQL, and MongoDB.
  • pgwatch - ๐ŸŸข PostgreSQL-specific monitoring and metrics collection.
  • pg_stat_monitor - ๐ŸŸข PostgreSQL extension for enhanced query performance monitoring.
  • VividCortex / SolarWinds DPM - ๐ŸŸ  SaaS query-level database performance monitoring.
  • Datadog DBM - ๐ŸŸ  Database monitoring with query-level explain plans, wait event analysis, and trace correlation.

Real User Monitoring (RUM) & Frontend Observability

  • Sentry - ๐ŸŸข Error tracking and performance monitoring with session replay and Web Vitals.
  • Grafana Faro - ๐ŸŸข๐Ÿ”ต Open-source frontend observability SDK capturing errors, performance, and user events.
  • OpenTelemetry Browser SDK - ๐ŸŸข OTel instrumentation for web applications capturing page loads and resource timings.
  • LogRocket - ๐ŸŸ  Session replay combined with frontend performance monitoring.

AI-Augmented Observability

  • Dynatrace Davis AI - ๐ŸŸ  Deterministic and causal AI for topology-aware automatic root-cause analysis.
  • Datadog Watchdog - ๐ŸŸ  ML-driven anomaly detection across metrics, logs, and APM data.
  • Moogsoft - ๐ŸŸ  AIOps platform for alert correlation, noise reduction, and incident clustering.
  • New Relic AI - ๐ŸŸ  Applied intelligence with anomaly detection, incident correlation, and natural-language querying.
  • Honeycomb BubbleUp - ๐ŸŸ  Automated outlier correlation across high-cardinality dimensions.
  • Coroot - ๐ŸŸข๐Ÿ”ต Open-source eBPF-powered observability with automated service map discovery.

SLO Management

  • Sloth - ๐ŸŸข๐Ÿ”ต SLO generation for Prometheus with YAML definitions and multi-window multi-burn-rate alerts.
  • Pyrra - ๐ŸŸข๐Ÿ”ต Kubernetes-native SLO management generating Prometheus recording rules and alerts.
  • OpenSLO - ๐ŸŸข Open, vendor-neutral specification for defining SLOs as code.
  • Nobl9 - ๐ŸŸ  Enterprise SLO platform with unified tracking and error budget management.

Synthetic Monitoring

  • Checkly - ๐ŸŸข๐Ÿ”ต Monitoring as code for APIs and browsers with Playwright-based synthetic checks.
  • Grafana Synthetic Monitoring - ๐ŸŸข๐Ÿ”ต Probe-based multi-location synthetic monitoring integrated into Grafana Cloud.
  • Uptime Kuma - โญ๐ŸŸข Self-hosted monitoring tool with HTTP, TCP, DNS, and keyword checks.
  • Sematext - ๐ŸŸข๐ŸŸ  Playwright-based synthetic checks with CI/CD integration and SSL monitoring.

Legacy & Historical

  • Graphite - Pioneering time-series storage and graphing system with Whisper backend and Carbon collector.
  • Redash - SQL-first data visualization and collaboration connecting to many data sources.

Performance Testing

Load & Stress Testing

  • k6 - โญ๐ŸŸข๐Ÿ”ต Modern load testing tool with JavaScript ES6 scripting and native Prometheus/Grafana integration.
  • Gatling - โญ๐ŸŸข๐Ÿš€ High-performance load testing framework with Scala/Java/Kotlin DSL and detailed HTML reports.
  • Locust - โญ๐ŸŸข Python-based load testing framework defining user behavior in plain Python code.
  • Apache JMeter - โญ๐ŸŸข Load testing tool with GUI and extensive protocol support (HTTP, JDBC, JMS, LDAP, SOAP).
  • Artillery - ๐ŸŸข๐Ÿ”ต Node.js-based load testing toolkit with YAML scenarios supporting HTTP, WebSocket, and Socket.io.
  • NBomber - ๐ŸŸข Load testing framework for .NET with C#/F# scripting.
  • Tsung - ๐Ÿš€ Erlang-based distributed load testing tool handling massive concurrent connections across multiple protocols.
  • GoReplay (gor) - ๐ŸŸข๐Ÿš€ Capture and replay production HTTP traffic for load testing with real traffic patterns.
  • Anteon (formerly Ddosify) - ๐Ÿ”ต eBPF-based Kubernetes performance testing platform with distributed load generation.
  • Neoload - ๐ŸŸ  Enterprise performance testing platform with codeless and as-code options.
  • LoadRunner / OpenText - ๐ŸŸ  Enterprise performance testing platform with broad protocol support.

HTTP Benchmarking & Micro-Benchmarking

  • wrk2 - ๐Ÿš€ Constant-throughput HTTP benchmarking with accurate latency histograms that avoids coordinated omission.
  • wrk - ๐Ÿš€ HTTP benchmarking tool with Lua scripting for quick relative performance comparisons.
  • Vegeta - ๐ŸŸข๐Ÿš€ HTTP load testing tool with constant request rate mode and built-in plotting.
  • hey - ๐ŸŸข Simple HTTP load generator, successor to Apache Bench (ab).
  • oha - ๐ŸŸข๐Ÿš€ Rust-based HTTP load generator with real-time TUI.
  • bombardier - ๐ŸŸข๐Ÿš€ Fast, cross-platform HTTP benchmarking tool with detailed latency reporting.
  • hyperfoil - ๐ŸŸข๐Ÿ”ต๐Ÿš€ Distributed benchmarking framework designed to avoid coordinated omission.

API Testing & Contract Testing

  • Hurl - ๐ŸŸข Plain-text HTTP request runner for API testing in CI with assertions and chaining.
  • Postman - โญ๐ŸŸข๐ŸŸ  API development and testing platform with Newman CLI for CI/CD integration.
  • REST-assured - ๐ŸŸข Java DSL for testing REST APIs with fluent syntax and JUnit/TestNG integration.
  • Karate - ๐ŸŸข BDD-style API testing framework combining API testing, mocking, and performance testing.
  • Step CI - ๐ŸŸข Open-source YAML-based API testing and monitoring framework for CI/CD.
  • Pact - ๐ŸŸข Contract testing framework ensuring provider-consumer compatibility for HTTP APIs and messaging.
  • Dredd - API testing tool that validates implementations against OpenAPI and API Blueprint specifications.

gRPC & Protocol-Specific Testing

  • ghz - ๐ŸŸข๐Ÿš€ gRPC benchmarking and load testing tool supporting unary and streaming RPCs.
  • k6 + xk6-grpc - ๐ŸŸข๐Ÿ”ต k6 extension for scriptable gRPC load testing scenarios.
  • k6 + xk6-kafka - ๐ŸŸข๐Ÿ”ต k6 extension for Apache Kafka load testing at scale.
  • kafka-producer-perf-test / kafka-consumer-perf-test - ๐ŸŸข Built-in Kafka benchmarking tools for producer and consumer throughput.
  • RabbitMQ PerfTest - ๐ŸŸข Official RabbitMQ benchmarking tool for throughput and latency measurement.
  • k6 + xk6-websockets - ๐ŸŸข๐Ÿ”ต Built-in k6 WebSocket support for testing real-time and bidirectional protocols.

Browser & Frontend Performance

  • Lighthouse - โญ๐ŸŸข Google's auditing tool for performance, accessibility, and SEO with actionable scores.
  • WebPageTest - โญ๐ŸŸข Web performance analysis with filmstrip views, waterfall charts, and multi-location testing.
  • Playwright - โญ๐ŸŸข Browser automation framework with built-in performance timing APIs for Chromium, Firefox, and WebKit.
  • Sitespeed.io - ๐ŸŸข Open-source web performance monitoring integrating Lighthouse, WebPageTest, and Grafana dashboards.
  • Puppeteer - ๐ŸŸข Chrome DevTools Protocol API enabling programmatic access to performance traces and network interception.
  • Yellowlab Tools - ๐ŸŸข Frontend code quality and performance auditing for JavaScript, CSS, and rendering issues.
  • SpeedCurve - ๐ŸŸ  Continuous frontend performance monitoring with Core Web Vitals tracking and competitive benchmarking.

Service Virtualization and Mocking

  • WireMock - โญ๐ŸŸข๐Ÿ”ต HTTP mock server with request matching, stateful behavior, response templating, and fault injection.
  • Mountebank - ๐ŸŸข Multi-protocol service virtualization supporting HTTP, HTTPS, TCP, and SMTP.
  • Hoverfly - ๐ŸŸข๐Ÿ”ต Lightweight service virtualization with capture-and-replay mode for API simulation.
  • MockServer - ๐ŸŸข HTTP/HTTPS mock server with expectation-based matching and callback actions.
  • Microcks - ๐ŸŸข๐Ÿ”ต Kubernetes-native API mocking and testing importing OpenAPI, AsyncAPI, gRPC, and GraphQL contracts.

Synthetic Data Generation

  • Faker - โญ๐ŸŸข Realistic fake data generation for JavaScript/TypeScript with massive locale support.
  • DataFaker - ๐ŸŸข Modern Java data generation library with expression-based generation.
  • Mimesis - ๐ŸŸข๐Ÿš€ High-performance fake data generator for Python with strong locale support.
  • Neosync - ๐Ÿ”ต Open-source platform for anonymizing production data and generating synthetic datasets.

Database Performance Testing & Benchmarking

  • HammerDB - โญ๐ŸŸข Open-source database benchmarking tool supporting TPC-C and TPC-H workloads across major databases.
  • sysbench - โญ๐ŸŸข๐Ÿš€ Scriptable multi-threaded benchmark tool for OLTP, CPU, memory, and I/O tests.
  • pgbench - ๐ŸŸข PostgreSQL built-in benchmarking tool with custom scripts for workload simulation.
  • YCSB (Yahoo! Cloud Serving Benchmark) - โญ๐ŸŸข Framework for benchmarking NoSQL and NewSQL databases with standard workloads.
  • benchbase (formerly OLTPBench) - ๐ŸŸข Multi-DBMS benchmarking framework supporting TPC-C, TPC-H, and YCSB workloads.
  • mysqlslap - MySQL built-in load emulation client for quick benchmarks.

System & Infrastructure Benchmarking

  • fio - โญ๐ŸŸข๐Ÿš€ Reference I/O benchmarking tool with configurable workloads and multiple engines (libaio, io_uring).
  • stress-ng - ๐ŸŸข๐Ÿš€ System stress testing tool with 300+ methods covering CPU, memory, I/O, and network.
  • Phoronix Test Suite - ๐ŸŸข Comprehensive benchmarking platform with 500+ test profiles and result comparison.
  • iperf3 - โญ๐ŸŸข๐Ÿš€ Network bandwidth measurement tool for TCP/UDP throughput testing.

Chaos Engineering & Fault Injection

  • Litmus - โญ๐ŸŸข๐Ÿ”ต CNCF incubating Kubernetes chaos engineering platform with extensive experiment library.
  • Chaos Mesh - โญ๐ŸŸข๐Ÿ”ต CNCF incubating Kubernetes-native chaos platform with pod, network, and I/O fault injection.
  • Gremlin - ๐ŸŸ  Enterprise chaos engineering platform with managed experiments and safety controls.
  • Chaos Monkey - โญ๐ŸŸข Netflix's pioneering chaos tool that randomly terminates instances in production.
  • Pumba - ๐ŸŸข๐Ÿ”ต Chaos testing for Docker containers with network delay and packet loss injection.
  • Steadybit - ๐ŸŸ ๐Ÿ”ต Enterprise reliability platform combining chaos engineering with resilience validation.
  • AWS Fault Injection Service - ๐ŸŸ ๐Ÿ”ต Managed fault injection for AWS resources with native service integration.

Network Simulation & Traffic Shaping

  • tc (Traffic Control) - Linux kernel traffic shaping with netem qdisc for network emulation.
  • Comcast - CLI tool for simulating bad network conditions wrapping tc/pfctl.
  • Clumsy - ๐ŸŸข Windows network condition simulator for packet drop, lag, throttle, and reordering.

CI/CD Integration & Performance Gates

  • Gatling Enterprise - ๐ŸŸ  Managed Gatling execution with CI/CD integrations and historical comparison.
  • Lighthouse CI - ๐ŸŸข Run Lighthouse in CI with performance budgets, baseline comparison, and trend tracking.
  • Taurus - ๐ŸŸข YAML-based automation wrapper for JMeter, Gatling, Locust with unified reporting.

Results Analysis & Reporting

  • k6 HTML Report - ๐ŸŸข Standalone HTML report generator for k6 test results.
  • HdrHistogram - ๐ŸŸข๐Ÿš€ High Dynamic Range Histogram for accurate latency measurement capturing the full distribution.
  • Gatling Reports - ๐ŸŸข Built-in HTML reports with percentile distributions and response time series.
  • Apache JMeter Dashboard - ๐ŸŸข Built-in HTML dashboard generating APDEX scores and response time distributions.
  • Taurus Reporting - ๐ŸŸข Unified reporting across multiple load testing engines with BlazeMeter integration.

Cloud Provider Services

  • Azure App Testing - ๐ŸŸ ๐Ÿ”ต Microsoft's managed load testing service supporting JMeter and Locust with multi-region simulation.
  • AWS Distributed Load Testing - ๐ŸŸ ๐Ÿ”ต Distributed load testing architecture on AWS via CloudFormation supporting JMeter, k6, and Locust.

Developer-Centric Platforms

  • Grafana k6 Cloud - ๐ŸŸ  Managed k6 execution with multi-region load zones and real-time Grafana visualization.
  • Octoperf - ๐ŸŸ  SaaS performance testing platform built on JMeter with distributed load generation.

Enterprise Platforms

  • BlazeMeter - ๐ŸŸ  Cloud performance testing platform supporting JMeter, Gatling, Locust, Selenium, and Playwright.

Tools & Integrations