[Tracing] Implement initial OTLP traces weblog tests by zacharycmontoya · Pull Request #6363 · DataDog/system-tests

zacharycmontoya · 2026-02-20T22:46:45Z

Motivation

We are seeing increased demand for exporting traces as OTLP from our DD SDKs (rather than Datadog-proprietary MessagePack), so we are prototyping and establishing requirements for generating OTLP traces payloads. This is only the first of a series of PRs to establish clear expectations for what the generated OTLP traces and trace stats will look like.

Changes

Scenario: Adds a new APM_TRACING_OTLP scenario to test the weblog application with the configuration needed for the DD SDK to export traces using OTLP. This also adds a include_opentelemetry property to the EndToEndScenario to set up the OpenTelemetry interface.
Tests: Adds tests/otel/test_tracing_otlp.py::Test_Otel_Tracing_OTLP::test_tracing to send a request to the weblog app using weblog.get("/") and asserts properties of the OTLP trace payload
Interfaces: Updates the OpenTelemetry interface with methods get_otel_spans and get_trace_stats to retrieve the OTLP payloads for test assertions

Workflow

⚠️ Create your PR as draft ⚠️
Work on you PR until the CI passes
Mark it as ready for review
- Test logic is modified? -> Get a review from RFC owner.
- Framework is modified, or non obvious usage of it -> get a review from R&P team

🚀 Once your PR is reviewed and the CI green, you can merge it!

🛟 #apm-shared-testing 🛟

Reviewer checklist

Anything but tests/ or manifests/ is modified ? I have the approval from R&P team
A docker base image is modified?
- the relevant build-XXX-image label is present
A scenario is added, removed or renamed?
- Get a review from R&P team

…ple. Notably, this creates a new scenario APM_TRACING_OTLP to enable the environment variables needed to configure the SDK to export traces as OTLP.

…is yet

…e only use the "/v1/metrics" subpath

github-actions · 2026-02-20T22:47:11Z

CODEOWNERS have been resolved as:

tests/otel/test_tracing_otlp.py                                         @DataDog/system-tests-core
utils/proxy/traces/otlp_v1.py                                           @DataDog/system-tests-core
.github/workflows/run-end-to-end.yml                                    @DataDog/system-tests-core
manifests/cpp_nginx.yml                                                 @DataDog/dd-trace-cpp
manifests/dotnet.yml                                                    @DataDog/apm-dotnet @DataDog/asm-dotnet
manifests/golang.yml                                                    @DataDog/dd-trace-go-guild
manifests/java.yml                                                      @DataDog/asm-java @DataDog/apm-java
manifests/nodejs.yml                                                    @DataDog/dd-trace-js
manifests/php.yml                                                       @DataDog/apm-php @DataDog/asm-php
manifests/python.yml                                                    @DataDog/apm-python @DataDog/asm-python
manifests/ruby.yml                                                      @DataDog/ruby-guild @DataDog/asm-ruby
manifests/rust.yml                                                      @DataDog/apm-rust
utils/_context/_scenarios/__init__.py                                   @DataDog/system-tests-core
utils/_context/_scenarios/endtoend.py                                   @DataDog/system-tests-core
utils/dd_constants.py                                                   @DataDog/system-tests-core
utils/interfaces/_open_telemetry.py                                     @DataDog/system-tests-core
utils/proxy/_deserializer.py                                            @DataDog/system-tests-core
utils/scripts/ci_orchestrators/workflow_data.py                         @DataDog/system-tests-core

datadog-official · 2026-02-20T23:15:31Z

⚠️ Tests

✨ Fix all issues with BitsAI or with Cursor

⚠️ Warnings

🧪 18 Tests failed

tests.ai_guard.test_ai_guard_sdk.Test_ContentParts.test_content_parts[flask-poc] from system_tests_suite

(Fix with Cursor)

assert 500 == 200
 +  where 500 = HttpResponse(status_code:500, headers:{'Server': 'gunicorn', 'Date': 'Thu, 26 Mar 2026 18:01:09 GMT', 'Connection': 'k...: '103'}, text:{"error":"Authentication credentials required: provide DD_API_KEY and DD_APP_KEY","type":"ValueError"}\n).status_code
 +    where HttpResponse(status_code:500, headers:{'Server': 'gunicorn', 'Date': 'Thu, 26 Mar 2026 18:01:09 GMT', 'Connection': 'k...: '103'}, text:{"error":"Authentication credentials required: provide DD_API_KEY and DD_APP_KEY","type":"ValueError"}\n) = <tests.ai_guard.test_ai_guard_sdk.Test_ContentParts object at 0x7f80f0799cd0>.r

self = <tests.ai_guard.test_ai_guard_sdk.Test_ContentParts object at 0x7f80f0799cd0>

    def test_content_parts(self):
        """Test AI Guard evaluation with multi-modal content parts.
    
        Validates that prompts with content part format (text + image_url) are:
...

tests.ai_guard.test_ai_guard_sdk.Test_ContentParts.test_content_parts[flask-poc] from system_tests_suite

(Fix with Cursor)

assert 500 == 200
 +  where 500 = HttpResponse(status_code:500, headers:{'Server': 'gunicorn', 'Date': 'Thu, 26 Mar 2026 18:05:17 GMT', 'Connection': 'k...: '103'}, text:{"error":"Authentication credentials required: provide DD_API_KEY and DD_APP_KEY","type":"ValueError"}\n).status_code
 +    where HttpResponse(status_code:500, headers:{'Server': 'gunicorn', 'Date': 'Thu, 26 Mar 2026 18:05:17 GMT', 'Connection': 'k...: '103'}, text:{"error":"Authentication credentials required: provide DD_API_KEY and DD_APP_KEY","type":"ValueError"}\n) = <tests.ai_guard.test_ai_guard_sdk.Test_ContentParts object at 0x7fbfded2ef00>.r

self = <tests.ai_guard.test_ai_guard_sdk.Test_ContentParts object at 0x7fbfded2ef00>

    def test_content_parts(self):
        """Test AI Guard evaluation with multi-modal content parts.
    
        Validates that prompts with content part format (text + image_url) are:
...

tests.ai_guard.test_ai_guard_sdk.Test_Evaluation.test_abort[flask-poc] from system_tests_suite

(Fix with Cursor)

assert False

self = <tests.ai_guard.test_ai_guard_sdk.Test_Evaluation object at 0x7fbfded2e810>

    def test_abort(self):
        """Test ABORT action for tool call attempting to read /etc/passwd.
        Expects 403 when blocking enabled, 200 when disabled.
        Span should have action="ABORT" and target="tool" with tool_name.
        """
        for block, request in self.r.items():
...

View all

ℹ️ Info

No other issues found (see more)

❄️ No new flaky tests detected

_{This comment will be updated automatically if new data arrives.

🔗 Commit SHA: 470194c | Docs | Datadog PR Page | Was this helpful? React with 👍/👎 or give us feedback!}

tests/otel/test_tracing_otlp.py

…v, since users do not need this feature for OTLP export to work

…e we're not immediately implementing that

…ing specifically: - At runtime determine if the request is JSON - If JSON, look up proto field names by their camelCase representation. Otherwise, look up field names by their snake_case representation - If JSON, assert that the 'traceId' and 'spanId' fields are case-insensitive hexadecimal strings, rather than base64-encoded strings - If JSON, assert that enums (e.g. span.kind and span.status.code) are encoded using an integer, not a string representation of the enum value name - Regardless of protocol, get the time before and after the test HTTP request is issued, and assert that the span's reported 'start_time_unix_nano' and 'end_time_unix_nano' fall in this range - Regardless of protocol, make the 'http.method' and 'http.status_code' span attribute assertions more flexible by also testing against their stable OpenTelemetry HTTP equivalents of 'http.request.method' and 'http.response.status_code', respectively

tests/otel/test_tracing_otlp.py

…ision of 0 is respected for OTLP traces by default

Since JSON must be expressed in lowerCamelCase (according to the OpenTelemetry spec), we can consolidate our parsing and assertions on that style of field names

….py to the proxy, in utils/proxy/traces/otlp_v1.py

…aces-otlp

… explain why other scenarios are facing issues with the "Library not ready" messages

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 5f73ad4c0b

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-20T22:10:53Z

utils/interfaces/_open_telemetry.py

+                            yield data.get("request"), content, span
+                            break  # Skip to next span


Return all matching OTLP spans for a request

Dropping out of the span loop after the first RID match causes get_otel_spans() to undercount spans when a single OTLP payload contains multiple matching spans in the same scopeSpans block (for example, server + framework spans that both carry the request user-agent tag). This can make tests/otel/test_tracing_otlp.py pass even when extra spans are exported, weakening the regression signal for this new scenario.

Useful? React with 👍 / 👎.

zacharycmontoya · 2026-03-20T23:16:57Z

As far as I can tell, it's only the AI_GUARD scenario that is failing, not my new APM_TRACING_OTLP scenario. @cbeauchesne what do you think of these failures?

ida613 · 2026-03-23T17:51:42Z

tests/otel/test_tracing_otlp.py

+        # Assert that the span fields match the expected values
+        span_start_time_ns = int(span["startTimeUnixNano"])
+        span_end_time_ns = int(span["endTimeUnixNano"])
+        assert span_start_time_ns >= self.start_time_ns


this line is flaky for me.

Claude says "self.start_time_ns is captured using time.time_ns() on the test host machine, while span_start_time_ns comes from the weblog container. If the container's clock is behind the host's clock (even by a small amount due to clock skew or Docker clock drift), the span start time will appear earlier than self.start_time_ns, failing the assertion".

Does it make any sense? or should I look into my code? :3

I'm getting this issue too, I'll look into fixing this!

OK I think I've fixed it in the latest set of commits by making a call to the weblog container to get the time via Unix's date utility. Let me know if you're still experiencing this!

cbeauchesne · 2026-03-23T21:38:42Z

There are some recent changes on this scenario (#6445), @obordeau or @smola , do we expect to have some instability here?

@zacharycmontoya, waiting for a response, you can rebase. If it does not fix the issue, I'll look closely, and if needed, force-merge on your call.

obordeau · 2026-03-23T21:45:33Z

There are some recent changes on this scenario (#6445), @obordeau or @smola , do we expect to have some instability here?

@zacharycmontoya, waiting for a response, you can rebase. If it does not fix the issue, I'll look closely, and if needed, force-merge on your call.

I think the SDS cassette broke (the JSON looks badly formatted) with this PR and made the AI Guard scenario failing. Looking on a fix :) cc @cbeauchesne

obordeau · 2026-03-23T22:14:17Z

Gotta log off, trying to fix it here Fix AI Guard SDS cassette

…' call to the weblog container

…s the tracing OTLP export feature

obordeau · 2026-03-25T13:37:07Z

Just merged the fix Fix AI Guard SDS system test, you can update your branch. Sorry for the inconvenience 🥲

cbeauchesne · 2026-03-25T19:21:16Z

.github/workflows/run-end-to-end.yml

+      if: always() && steps.build.outcome == 'success' && contains(inputs.scenarios, '"APM_TRACING_OTLP"')
+      run: ./run.sh APM_TRACING_OTLP
+      env:
+        DD_API_KEY: ${{ secrets.DD_API_KEY }}


In theory, we don't need this, you can use a fake key and it'll work the same (and if not, ping me, I'll fix that).

Yes I think you're right. I'll either remove this entirely or use a fake key, stay tuned for updates 😎

- Assert that resource attribute telemetry.sdk.name=datadog - Assert that span attribute span.type=web - Assert that span attribute operation.name is present

…on't implement the latest required span attributes

zacharycmontoya added 5 commits February 20, 2026 14:29

Implement initial OTLP traces + trace stats with the .NET weblog exam…

dd04bd9

…ple. Notably, this creates a new scenario APM_TRACING_OTLP to enable the environment variables needed to configure the SDK to export traces as OTLP.

Skip assertions for the Trace Metrics, as we don't have a spec for th…

ebbbb5a

…is yet

Run formatter

7fee5a2

Remove "/api/v0.2/stats" subpath for the OpenTelemetry interface so w…

5f8b2a8

…e only use the "/v1/metrics" subpath

Add manifest entries for languages

221b031

Merge branch 'main' into zach.montoya/weblog-traces-otlp

86b284d

ida613 reviewed Feb 24, 2026

View reviewed changes

tests/otel/test_tracing_otlp.py Outdated Show resolved Hide resolved

ida613 reviewed Feb 24, 2026

View reviewed changes

tests/otel/test_tracing_otlp.py Outdated Show resolved Hide resolved

ida613 reviewed Feb 24, 2026

View reviewed changes

tests/otel/test_tracing_otlp.py Outdated Show resolved Hide resolved

zacharycmontoya added 5 commits March 5, 2026 14:10

Remove DD_TRACE_OTEL_ENABLED=true from the APM_TRACING_OTLP weblog en…

cacbab5

…v, since users do not need this feature for OTLP export to work

Remove all helper methods and setup related to OTLP trace stats, sinc…

14b1995

…e we're not immediately implementing that

Small refactoring of code and comments

6a1df07

Merge branch 'main' into zach.montoya/weblog-traces-otlp

f032d9d

cbeauchesne reviewed Mar 6, 2026

View reviewed changes

tests/otel/test_tracing_otlp.py Outdated Show resolved Hide resolved

cbeauchesne reviewed Mar 6, 2026

View reviewed changes

tests/otel/test_tracing_otlp.py Outdated Show resolved Hide resolved

zacharycmontoya and others added 4 commits March 10, 2026 14:39

Add test case asserting that an incoming trace context's sampling dec…

746148b

…ision of 0 is respected for OTLP traces by default

Move enums to dd_constants.py

919b5ef

Merge branch 'main' into zach.montoya/weblog-traces-otlp

a63042c

Support protobuf format in MessageToDict and modify helpers accoridngly

0d6ff15

mtoffl01 mentioned this pull request Mar 13, 2026

Support protobuf format in MessageToDict and modify helpers accoridngly #6494

Merged

5 tasks

zacharycmontoya added 7 commits March 16, 2026 14:12

Remove snake_case proto fields PART ONE.

ca4dc74

Since JSON must be expressed in lowerCamelCase (according to the OpenTelemetry spec), we can consolidate our parsing and assertions on that style of field names

Move flattening of OTLP attribute dictionaries from test_tracing_otlp…

bfcd7a9

….py to the proxy, in utils/proxy/traces/otlp_v1.py

Run the formatter

d0ce4dd

Merge branch 'mtoff/protobuf-traces-otlp' into zach.montoya/weblog-tr…

6c632cd

…aces-otlp

Fix imports after merge

43271c4

Remove unnecessary transformation for OTLP traces response

3c77f0e

Fix possible bug in calculating which interfaces to start. This would…

8d1af47

… explain why other scenarios are facing issues with the "Library not ready" messages

zacharycmontoya requested review from a team as code owners March 20, 2026 22:03

zacharycmontoya requested review from brettlangdon, daniel-romano-DD, dmehala, duncanista, jandro996 and manuel-alvarez-alvarez and removed request for a team March 20, 2026 22:03

chatgpt-codex-connector bot reviewed Mar 20, 2026

View reviewed changes

ida613 reviewed Mar 23, 2026

View reviewed changes

zacharycmontoya added 4 commits March 23, 2026 15:47

Merge branch 'main' into zach.montoya/weblog-traces-otlp

428aded

Merge branch 'main' into zach.montoya/weblog-traces-otlp

25d94c3

Fix the assertion for the span start time by issuing a 'date -u +%s%N…

b1fa3b8

…' call to the weblog container

Update dotnet manifest with latest development version that implement…

2287e1b

…s the tracing OTLP export feature

Merge branch 'main' into zach.montoya/weblog-traces-otlp

292969c

cbeauchesne reviewed Mar 25, 2026

View reviewed changes

zacharycmontoya added 3 commits March 26, 2026 08:38

Remove usage of DD_API_KEY from new APM_TRACING_OTLP scenario

4d386dc

Add several new assertions:

84d68be

- Assert that resource attribute telemetry.sdk.name=datadog - Assert that span attribute span.type=web - Assert that span attribute operation.name is present

Skip dotnet for the updated Test_Otel_Tracing_OTLP tests because we d…

470194c

…on't implement the latest required span attributes

zacharycmontoya force-pushed the zach.montoya/weblog-traces-otlp branch from 2538f35 to 470194c Compare March 26, 2026 16:38

		yield data.get("request"), content, span
		break # Skip to next span

Conversation

zacharycmontoya commented Feb 20, 2026

Motivation

Changes

Workflow

Reviewer checklist

Uh oh!

github-actions bot commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

datadog-official bot commented Feb 20, 2026 • edited by datadog-datadog-prod-us1-2 bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ Warnings

ℹ️ Info

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

zacharycmontoya commented Mar 20, 2026

Uh oh!

ida613 Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zacharycmontoya Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

zacharycmontoya Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

cbeauchesne commented Mar 23, 2026

Uh oh!

obordeau commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

obordeau commented Mar 23, 2026

Uh oh!

obordeau commented Mar 25, 2026

Uh oh!

cbeauchesne Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

zacharycmontoya Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

github-actions bot commented Feb 20, 2026 •

edited

Loading

datadog-official bot commented Feb 20, 2026 •

edited by datadog-datadog-prod-us1-2 bot

Loading

ida613 Mar 23, 2026 •

edited

Loading

obordeau commented Mar 23, 2026 •

edited

Loading