Clearer failure/errors in `failure_reason` & `failure_expanded` by pda · Pull Request #64 · buildkite/test-collector-python

pda · 2025-07-18T06:11:03Z

Currently, the entire failure from PyTest (PyTest calls it longreprtext) is set as failure_reason.

However, failure_reason is intended for a short one-line summary, and gets truncated to ~1KIB, which isn't great for longreprtext.

Buildkite also supports failure_expanded, for including detailed information and backtraces on errors.

So, derive that from the various shapes that longrepr can take, and include them in the uploaded JSON payload.

Before	After
Unclear `failure_reason` summaries:	Useful failure/error messages:
Truncated unclear expanded text which doesn't even mention the ZeroDivisionError:	Full expanded information and stack trace:

Currently, the entire failure from PyTest (PyTest calls it longreprtext) is set as failure_reason. However, failure_reason is intended for a short one-line summary, and gets truncated to ~1KIB, which isn't great for longreprtext. Buildkite also supports failure_expanded, to included detailed information and backtraces on errors. So, derive that from the various shapes that longrepr can take, and include them in the uploaded JSON payload.

Copilot

Pull Request Overview

This PR improves the handling of test failure information in the Buildkite Test Analytics collector by providing clearer failure summaries and detailed expanded information. The changes separate the short failure reason (for UI display) from the detailed failure information (for expanded views).

Key changes:

Added a new failure_expanded field to TestResultFailed to store detailed failure information including stack traces
Implemented failure_reasons() function to parse different PyTest longrepr formats and extract appropriate failure summaries
Updated the plugin to use the new failure parsing logic instead of the raw longreprtext

Reviewed Changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
`src/buildkite_test_collector/pytest_plugin/failure_reasons.py`	New module implementing failure reason parsing logic for different PyTest longrepr formats
`src/buildkite_test_collector/collector/payload.py`	Added failure_expanded field to TestResultFailed and updated JSON serialization
`src/buildkite_test_collector/pytest_plugin/buildkite_plugin.py`	Updated to use new failure parsing logic and added debug logging
`tests/buildkite_test_collector/pytest_plugin/test_plugin.py`	Added comprehensive tests for different failure scenarios
`tests/buildkite_test_collector/conftest.py`	Updated test fixture to include failure_expanded data
`tests/buildkite_test_collector/collector/test_payload.py`	Updated test assertions to verify failure_expanded field

Copilot · 2025-07-18T06:12:16Z

src/buildkite_test_collector/pytest_plugin/failure_reasons.py

+        case None:
+            return None, None
+
+        case str() as s:


[nitpick] The match case pattern 'str() as s' is redundant. Consider using 'case str(s):' or 'case s if isinstance(s, str):' for better readability.

Suggested change

case str() as s:

case str(s):

Copilot · 2025-07-18T06:12:16Z

src/buildkite_test_collector/pytest_plugin/failure_reasons.py

+        case str() as s:
+            lines = s.splitlines()
+            failure_reason = lines[0] if lines else s
+            return failure_reason, [{"expanded": lines[1:]}]


When there's only one line in the string, this will return an empty list for 'expanded'. Consider returning None for failure_expanded when there are no additional lines, similar to the single-line case at the end of the function.

Suggested change

return failure_reason, [{"expanded": lines[1:]}]

return failure_reason, [{"expanded": lines[1:]}] if len(lines) > 1 else (failure_reason, None)

src/buildkite_test_collector/pytest_plugin/failure_reasons.py

tests/buildkite_test_collector/pytest_plugin/test_plugin.py

CI is still testing in Python down to 3.8

gchan

LGTM, some minor questions

gchan · 2025-07-20T23:23:43Z

src/buildkite_test_collector/collector/payload.py

 class TestResultFailed:
    """Represents a failed test result"""
    failure_reason: Optional[str]
+    failure_expanded: Optional[Iterable[Mapping[str, Iterable[str]]]] = None


Not a huge fan of this data structure but understand it's because of the existing API :)

gchan · 2025-07-20T23:30:09Z

src/buildkite_test_collector/pytest_plugin/failure_reasons.py

+    if isinstance(longrepr, ExceptionRepr) and longrepr.reprcrash is not None:
+        return _handle_exception_repr_longrepr(longrepr)
+
+    return _handle_default_longrepr(longrepr)


I'm trying to understand Pytest's longrepr.

Do we need to handle when it's a TerminalRepr?

That can be handled by the default _ handler, which just uses str(longrepr) and comes out okay.

gchan · 2025-07-20T23:39:11Z

src/buildkite_test_collector/pytest_plugin/failure_reasons.py

+    msg: str
+) -> tuple[str | None, Iterable[Mapping[str, Iterable[str]]] | None]:
+    """Handle tuple longrepr case (path, line, msg)"""
+    failure_reason = msg


Is there any chance the tuple could have a very long message?

Possibly. We truncate it server side in that case.

I think this code path might only happen when the longrepr is for a skipped test, not a failed test, but I'm not entirely sure.

pda · 2025-07-21T01:24:56Z

src/buildkite_test_collector/pytest_plugin/failure_reasons.py

@@ -0,0 +1,113 @@
+"""Buildkite Test Engine PyTest failure reason mapping"""
+
+from __future__ import annotations


This is just for Python 3.9 which we'll drop pretty soon, it's EOL in a couple of months.

pda requested review from a team and Copilot July 18, 2025 06:11

Copilot AI reviewed Jul 18, 2025

View reviewed changes

pda mentioned this pull request Jul 18, 2025

Handle failure in setup eg fixtures #65

Merged

pda added 3 commits July 18, 2025 21:05

Avoid Python 3.10+ match in failure_reasons.py

0f8b6ab

CI is still testing in Python down to 3.8

failure_reason.py: from __future__ import annotations for Python 3.9

0fdc666

failure_reasons: avoid empty expanded list

ee29a28

gchan approved these changes Jul 20, 2025

View reviewed changes

pda commented Jul 21, 2025

View reviewed changes

pda merged commit 99c7d05 into main Jul 21, 2025
11 checks passed

pda deleted the failure-expanded branch July 21, 2025 01:26

pda mentioned this pull request Jul 21, 2025

[release] v1.1.0 #70

Merged

	return failure_reason, [{"expanded": lines[1:]}]
	return failure_reason, [{"expanded": lines[1:]}] if len(lines) > 1 else (failure_reason, None)

		@@ -0,0 +1,113 @@
		"""Buildkite Test Engine PyTest failure reason mapping"""

		from __future__ import annotations

Conversation

pda commented Jul 18, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gchan left a comment

Choose a reason for hiding this comment

Uh oh!

gchan Jul 20, 2025

Choose a reason for hiding this comment

Uh oh!

pda Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

gchan Jul 20, 2025

Choose a reason for hiding this comment

Uh oh!

pda Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

gchan Jul 20, 2025

Choose a reason for hiding this comment

Uh oh!

pda Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

pda Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants