Add benchmark overview doc by cijothomas · Pull Request #1528 · open-telemetry/otel-arrow

cijothomas · 2025-12-04T22:31:34Z

This docs is an attempt at the schema for our Phase 2 performance summary, when phase 2 is completed. It defines the key scenarios (Idle, 100k Load, Saturation) and the comparative analysis with OTLP/Collector. I've put TBD for actual numbers, as this is just attempting to finalize what we want to have in an easy to consume format. Actual numbers will be filled in later. This can also be used to see if there are gaps in the perf test suites that we want to add.

The existing pages like https://open-telemetry.github.io/otel-arrow/benchmarks/nightly/backpressure/ are still retained. This doc will have distilled information from them.

codecov · 2025-12-04T22:33:19Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 87.71%. Comparing base (cd545fa) to head (d87fddc).
⚠️ Report is 2 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1528      +/-   ##
==========================================
- Coverage   87.73%   87.71%   -0.02%     
==========================================
  Files         578      578              
  Lines      198334   198413      +79     
==========================================
+ Hits       174012   174044      +32     
- Misses      23796    23843      +47     
  Partials      526      526

Components	Coverage Δ
otap-dataflow	`89.74% <ø> (-0.03%)`	⬇️
query_abstraction	`80.61% <ø> (ø)`
query_engine	`90.61% <ø> (ø)`
syslog_cef_receivers	`∅ <ø> (∅)`
otel-arrow-go	`52.44% <ø> (ø)`
quiver	`91.91% <ø> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

lquerel

I really like this document.

I think we should also include the OTLP to OTLP scenario in the different sections since it will be one of the most common scenarios, at least in the beginning.

I also think we should add the wait_for_result mode in the otel-arrow section because it provides a true end to end unified ack/nack mechanism, which I believe is not fully supported by the Go collector.

docs/benchmarks-summary.md

Fixed one TODO! #1528 - Still working on this separately, which will include actual numbers for key scenarios, so readers don't have to go through the graphs themselves!

docs/benchmarks-summary.md

github-actions · 2026-02-04T02:18:54Z

This pull request has been marked as stale due to lack of recent activity. It will be closed in 30 days if no further activity occurs. If this PR is still relevant, please comment or push new commits to keep it active.

github-actions · 2026-02-26T02:19:10Z

This pull request has been marked as stale due to lack of recent activity. It will be closed in 30 days if no further activity occurs. If this PR is still relevant, please comment or push new commits to keep it active.

reyang · 2026-03-19T16:09:31Z

docs/benchmarks-summary.md

+performance characteristics and efficient resource utilization across varying
+load conditions. The engine uses a [thread-per-core
+architecture](#thread-per-core-design) where resource consumption scales with
+the number of configured cores.


resource consumption scales with the number of configured cores

I found this a bit hard to interpret. Do you mean "the throughput scales with the number of configured CPU cores, almost in a linear fashion?"

ya it reads weird.. I'll update with a better wording
(I meant throughput scales linearly, but so does memory consumption)

reyang · 2026-03-19T16:10:28Z

docs/benchmarks-summary.md

+All performance tests are executed on bare-metal compute instance with the
+following specifications:
+
+- **CPU**: 64 physical cores / 128 logical cores (x86-64 architecture)


Consider calling out the number of NUMA groups?

ya. We just confirmed that the CNCF machine has 2 sockets
https://github.com/open-telemetry/otel-arrow/actions/runs/23278418373#summary-67686308469

So far no tests were run with engine running on more than 32 cores (and they were all in same node). I have to see if we can actually do that, given load-gen and fake-backend also needs cores to run on.

reyang · 2026-03-19T16:10:55Z

docs/benchmarks-summary.md

+
+### Test Environment
+
+All performance tests are executed on bare-metal compute instance with the


Suggested change

All performance tests are executed on bare-metal compute instance with the

All performance tests are executed on a dedicated bare-metal compute instance with the

Not sure if this is a shared resource or dedicated, consider calling it out.

it is dedicated. Will update

docs/benchmarks-summary.md

reyang · 2026-03-19T16:16:26Z

docs/benchmarks-summary.md

+*Note: CPU usage is normalized (percentage of total system capacity). Memory
+usage scales with core count due to the [thread-per-core
+architecture](#thread-per-core-design).*


Memory usage could be confusing (cached, shared, non-paged pool, virtual memory vs. physical memory, ...), consider aligning and pointing to the OTel System metrics semantic conventions https://github.com/open-telemetry/semantic-conventions/blob/main/docs/system/system-metrics.md.

reyang · 2026-03-19T16:18:35Z

docs/benchmarks-summary.md

+This represents the optimal scenario where the dataflow engine operates with its
+native protocol end-to-end, eliminating protocol conversion overhead.
+
+##### Standard Load - OTLP -> OTLP (Standard Protocol)


Which OTLP? (gRPC, proto via HTTP 1.1, JSON, TLS enabled vs. not)

reyang · 2026-03-19T16:22:28Z

docs/benchmarks-summary.md

+engine and the OpenTelemetry Collector, we use **Syslog (UDP/TCP)** as the
+ingress protocol for both systems.
+
+#### Rationale for Syslog-Based Comparison


Suggested change

#### Rationale for Syslog-Based Comparison

#### Rationale for Syslog-based Comparison

reyang · 2026-03-19T16:24:34Z

docs/benchmarks-summary.md

+
+Scaling Efficiency = (Throughput at N cores) / (N * Single-core throughput)
+
+### Architecture


It is a bit weird to have an architecture section in the benchmark document (unless it is talking about the benchmarking environment's own architecture).

Add benchmark overview doc

a459ee4

github-project-automation bot added this to OTel-Arrow Dec 4, 2025

cijothomas added 3 commits December 4, 2025 15:05

use throughput per core

38ca568

nits

557c524

Merge branch 'main' into cijothomas/benchoverviewdoc1

3357270

lquerel requested changes Dec 9, 2025

View reviewed changes

Merge branch 'main' into cijothomas/benchoverviewdoc1

bad5cb6

gouslu reviewed Jan 6, 2026

View reviewed changes

cijothomas added 3 commits January 6, 2026 12:52

few feedback addressed

6f33600

consistent wordings

2a68ded

Merge branch 'main' into cijothomas/benchoverviewdoc1

0c461ad

cijothomas mentioned this pull request Jan 13, 2026

Update benchmarks doc to list all current tests #1781

Merged

cijothomas added 10 commits January 14, 2026 15:49

Merge branch 'main' into cijothomas/benchoverviewdoc1

e95ab5c

fill in available data

132fd9b

few tweaks to clarfy

18259e0

Merge branch 'main' into cijothomas/benchoverviewdoc1

8c8bfcc

Merge branch 'main' into cijothomas/benchoverviewdoc1

8961a15

snaity

4efa9a1

md

218676b

add passthrough

b501207

Merge branch 'main' into cijothomas/benchoverviewdoc1

2dbd3f1

fill some data for standard load

c8cc9b8

albertlockett approved these changes Jan 16, 2026

View reviewed changes

cijothomas added 4 commits January 16, 2026 20:42

fill more values

6cc3316

include passthrough

a1abd95

Merge branch 'main' into cijothomas/benchoverviewdoc1

fa63a4f

lengthh

e28d208

cijothomas marked this pull request as ready for review January 17, 2026 00:54

cijothomas requested a review from a team as a code owner January 17, 2026 00:54

cijothomas added 3 commits January 20, 2026 12:37

Merge branch 'main' into cijothomas/benchoverviewdoc1

c407c88

clarify syslog

3c67b1d

Merge branch 'main' into cijothomas/benchoverviewdoc1

029b0b3

reyang reviewed Jan 20, 2026

View reviewed changes

docs/benchmarks-summary.md Outdated Show resolved Hide resolved

github-actions bot added the stale Not actively pursued label Feb 4, 2026

jmacd removed the stale Not actively pursued label Feb 4, 2026

cijothomas added 3 commits February 4, 2026 10:11

Merge branch 'main' into cijothomas/benchoverviewdoc1

0e58031

Merge branch 'main' into cijothomas/benchoverviewdoc1

cc02ae9

Merge branch 'main' into cijothomas/benchoverviewdoc1

b0c8806

github-actions bot added the stale Not actively pursued label Feb 26, 2026

jmacd removed the stale Not actively pursued label Mar 5, 2026

cijothomas mentioned this pull request Mar 10, 2026

Add batchprocessor to perf tests #2246

Merged

Merge branch 'main' into cijothomas/benchoverviewdoc1

3d3eb70

reyang reviewed Mar 19, 2026

View reviewed changes

docs/benchmarks-summary.md Outdated Show resolved Hide resolved

Apply suggestion from @reyang

d87fddc

reyang reviewed Mar 19, 2026

View reviewed changes


		### Test Environment

		All performance tests are executed on bare-metal compute instance with the

	All performance tests are executed on bare-metal compute instance with the
	All performance tests are executed on a dedicated bare-metal compute instance with the

	#### Rationale for Syslog-Based Comparison
	#### Rationale for Syslog-based Comparison


		Scaling Efficiency = (Throughput at N cores) / (N * Single-core throughput)

		### Architecture

Conversation

cijothomas commented Dec 4, 2025

Uh oh!

codecov bot commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

lquerel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Feb 4, 2026

Uh oh!

github-actions bot commented Feb 26, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

codecov bot commented Dec 4, 2025 •

edited

Loading