Coverage test sys.settrace & improve coverage #17538

jepler · 2025-06-20T18:30:36Z

Summary

I noticed that py/profile.c was not covered at all according to codecov, due to the fact it was never enabled in the unix port.

I enabled it, and then added a new test to cover some previously un-covered lines.

Testing

I ran the tests locally and reviewed the coverage of profile.c.

Note: I suspect one or more Windows x64 builds will fail due to improved coverage in this PR, exposing a latent problem.

I wrote a rambling post in the discussions area about passing qstrs to mp_printf. The TL;DR explanation is, there appears to be undefined behavior that occurs when a "small qstr" (uint16_t) argument such as type->name is printed via the %q format specifier, which retrieves a ssize_t value. The specific behavior I saw with x64 msvc on godbolt looked like it would trigger on one specific format string in the core, the repr of frame objects. That's what spurred me to ensure that this code was covered.

If the failure does occur, I'll try to write it up properly as an issue.

github-actions · 2025-06-20T18:42:09Z

Code size report:

   bare-arm:    +0 +0.000% 
minimal x86:    +0 +0.000% 
   unix x64:    +0 +0.000% standard
      stm32:    +0 +0.000% PYBV10
     mimxrt:    +0 +0.000% TEENSY40
        rp2:    +0 +0.000% RPI_PICO_W
       samd:    +0 +0.000% ADAFRUIT_ITSYBITSY_M4_EXPRESS
  qemu rv32:    +0 +0.000% VIRT_RV32

jepler · 2025-06-20T19:00:52Z

While there are some other errors going on that also leave me scratching my head, the "expected error" happens on Windows CI, specifically "build-vs (x64, Debug, dev, 2017)".

Run python run-tests.py --print-failures
--- D:/a/micropython/micropython/tests/results\misc_sys_settrace_cov.py.exp	2025-06-20 18:41:30.413152600 +0000
+++ D:/a/micropython/micropython/tests/results\misc_sys_settrace_cov.py.out	2025-06-20 18:41:30.413152600 +0000
@@ -1,2 +1,2 @@
-FRAME <frame at 0x\[0-9a-f\]\+, file 'misc/sys_settrace_repr\.py', line \\d\+, code f>
-LASTI \\d\+
+FRAME <frame at 0xe06df4a0, file 'D:/a/micropython/micropython/tests/misc/sys_settrace_cov.py', line 20, code Assertion failed: *q < pool->len, file d:\a\micropython\micropython\py\qstr.c, line 198
+CRASH
\ No newline at end of file

FAILURE D:/a/micropython/micropython/tests/results\misc_sys_settrace_cov.py

jepler · 2025-06-20T19:11:22Z

I can't come up with any good theory as to what's making those unix builds fail. It's also a surprise that it's only ONE of the msvc x64 builds.

Josverl · 2025-06-20T19:28:32Z

While working on extending sys.settrace() with additional capabilities I also noticed that the debug builds are not tested across ports. I first noticed Errors first popping up when building an ESP32 locally.

Is there a way to determine coverage locally?

codecov · 2025-06-21T07:21:37Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.23%. Comparing base (2ab06b6) to head (dbd87a9).
Report is 8 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #17538      +/-   ##
==========================================
- Coverage   98.56%   98.23%   -0.34%     
==========================================
  Files         169      171       +2     
  Lines       21948    22140     +192     
==========================================
+ Hits        21634    21749     +115     
- Misses        314      391      +77

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

jepler · 2025-06-21T07:24:18Z

Yes, you can go through the steps of the unix coverage build locally. In ports/unix, make VARIANT=coverage test_gcov. The build output will finish with a summary of coverage, and a large number of ".gcov" files will be written.

gcov files show coverage on a line by line basis, with "#####" indicating no coverage of the line, "-" indicating no coverage possible, and a number indicating the number of times the line was covered. The format is described in man gcov.

jepler · 2025-06-21T08:13:47Z

The nanbox build were failing like this:

-FRAME <frame at 0x\[0-9a-f\]\+, file '\.\*/sys_settrace_cov.py', line \\d\+, code f>
+FRAME <frame at 0xf4d622e0, file 'misc/sys_settrace_cov.py', line 16, code >

The reason is that a mp_uint_t is passed in to a %d format specifier. But in nanbox, sizeof(mp_uint_t) > sizeof(int), so again there's the UB problem that arises when the promoted argument type does not match the va_arg type.

jepler · 2025-06-21T14:50:50Z

This PR is now failing only because it decreases the overall coverage percentage. In reality, it adds coverage to 21751 previously not covered lines.

ports/unix/mpconfigport.h

jepler · 2025-06-22T06:45:54Z

I've updated the PR so that settrace is only enabled in the coverage build, not the standard build or other variants.

we should consider whether this makes the standard+settrace CI build redundant. I can remove that if desired. the standard+stackless+settrace CI job is probably still useful.

dpgeorge · 2025-07-01T05:30:30Z

I've updated the PR so that settrace is only enabled in the coverage build, not the standard build or other variants.

Yes, that's a good idea. As mentioned, it really decreases performance.

OTOH, it's important to get coverage of the settrace code, so enabling it on the coverage build makes sense.

we should consider whether this makes the standard+settrace CI build redundant. I can remove that if desired. the standard+stackless+settrace CI job is probably still useful.

Yes, we can now simplify those CI jobs.

These settrace jobs were added in 0b85b5b . Maybe we can replace those two with just a test for stackless mode? Or at least just remove the standard+settrace CI and keep standard+stackless+settrace (as you suggest).

Josverl · 2025-07-01T14:47:29Z

wrt the work on micropython-lib/python-ecosys/debugpy I have been contemplating a few changes as well

moving the misc/sys_settrace*.* tests to their own folder: tests/sys_settrace
there I have been updating and adding tests to cover the new functionality @andrewleech and I have been working on

That makes it simpler to run that subset during development.

debug variant ?
Also if you have guidance how to test the combination of a sys,settrace firmware together with micropython-lib/python-ecosys/debugpy that would be appreciated. Creating a separate 'settrace/debug' variant seems the simplest approach to me.

dpgeorge · 2025-07-03T06:21:29Z

moving the misc/sys_settrace*.* tests to their own folder: tests/sys_settrace

That sounds reasonable, although not in scope for this PR.

Also if you have guidance how to test the combination of a sys,settrace firmware together with micropython-lib/python-ecosys/debugpy that would be appreciated. Creating a separate 'settrace/debug' variant seems the simplest approach to me.

This PR will enable settrace on the unix coverage build, so once this PR is merged you could use that.

dpgeorge · 2025-07-03T06:23:12Z

@jepler there's a failing test here on the CI which I think is due to the code taking longer to run now that settrace is enabled. It looks like the combination of already-long-running-test + settrace + ubsan = tests now takes longer than the 30s timeout.

I guess the simplest way to fix this is just increase TEST_TIMEOUT in tests/run-tests.py.

jepler · 2025-07-03T11:54:13Z

I guess the simplest way to fix this is just increase TEST_TIMEOUT in tests/run-tests.py.

I'll increase the timeout globally if that's ok. Or I could pipe through a timeout multiplier used only for 1 build.

dpgeorge · 2025-07-03T11:59:09Z

Or I could pipe through a timeout multiplier used only for 1 build.

Might be worth adding a command line argument for this timeout? And then increase it just for this build?

Because if it's increased globally, eg to 60 seconds, then that's a long time to wait to know that a small test failed, if it locks up.

jepler · 2025-07-03T19:14:05Z

I added an environment variable override and set it from ci.sh when running coverage tests. I tested that locally, by making sure that a timeout of 2 produced a handful of failures.

.github/workflows/ports_unix.yml

py/runtime.h

py/objcode.h

Signed-off-by: Jeff Epler <jepler@gmail.com>

This side-steps the problem where the output does not match when the host python version is 3.12.x (as it is on unix ci today). Signed-off-by: Jeff Epler <jepler@gmail.com>

If the fields added for MICROPY_PY_SYS_SETTRACE are not initialized properly, their value in a thread is indeterminate. In particular, if the callback is not NULL, it will be invoked as a function. Signed-off-by: Jeff Epler <jepler@gmail.com>

When MICROPY_PY_SYS_SETTRACE was enabled, a crash was seen in the qemu_mips build. It seems likely that this was due to these added fields not being initialized. Signed-off-by: Jeff Epler <jepler@gmail.com>

The argument corresponding to a `%q` specifier must be of type `qstr`, not a narrower type like `int16_t`. Not ensuring this caused an assertion error on one Windows x64 build. The argument corresponding to a `%d` specifier must be of type `int`, not a potentially-wider type like `mp_uint_t`. Not ensuring this prevented the function name from being printed on the unix nanbox build. Signed-off-by: Jeff Epler <jepler@gmail.com>

This becomes redundant when the main coverage build includes settrace. Signed-off-by: Jeff Epler <jepler@gmail.com>

The additional overhead of the settrace profiler means that the `aes_stress.py` test was running too slowly on GitHub CI. Double the timeout to 60 seconds. Signed-off-by: Jeff Epler <jepler@gmail.com>

This removes the need for the `sys_settrace_features.py.exp` file. This means that people testing locally will also need to install Python 3.11 in some way, such as with pyenv or uv, and use it during `make VARIANT=coverage test`, or they will get failures. When using python from GitHub actions/setup-python, pip3 can't be wrapped by sudo, because this invokes the operating system python instead. Signed-off-by: Jeff Epler <jepler@gmail.com>

as requested in code review. Signed-off-by: Jeff Epler <jepler@gmail.com>

Signed-off-by: Jeff Epler <jepler@gmail.com>

dpgeorge · 2025-07-04T14:03:38Z

Let me know when this is ready.

dpgeorge

Looking good now!

dpgeorge · 2025-07-05T14:11:52Z

Squashed the fix-up commits and merged in fcfed6a through a9801f9

jepler · 2025-07-05T15:27:37Z

Thank you!

jepler force-pushed the profile-coverage branch from 74fe1d6 to feb117d Compare June 20, 2025 18:33

jepler force-pushed the profile-coverage branch from feb117d to 986acd2 Compare June 20, 2025 18:57

jepler force-pushed the profile-coverage branch 2 times, most recently from d134cdb to c4d5f09 Compare June 21, 2025 06:33

jepler mentioned this pull request Jun 21, 2025

Undefined behavior with %q format specifier on LP64 and I16LP32 targets #17540

Open

jepler force-pushed the profile-coverage branch from c4d5f09 to 21996b2 Compare June 21, 2025 07:17

jepler force-pushed the profile-coverage branch from e10831b to 74279e3 Compare June 21, 2025 08:13

dlech reviewed Jun 21, 2025

View reviewed changes

ports/unix/mpconfigport.h Outdated Show resolved Hide resolved

jepler force-pushed the profile-coverage branch from 74279e3 to ec1036f Compare June 22, 2025 06:13

dpgeorge added the port-unix label Jul 1, 2025

dpgeorge reviewed Jul 4, 2025

View reviewed changes

.github/workflows/ports_unix.yml Show resolved Hide resolved

py/runtime.h Outdated Show resolved Hide resolved

py/objcode.h Outdated Show resolved Hide resolved

jepler mentioned this pull request Jul 4, 2025

[RFC] Add compile-time checking of mp_printf format strings #17556

Draft

jepler force-pushed the profile-coverage branch from 6c71538 to 81b9d03 Compare July 4, 2025 13:18

coverage: Enable sys.settrace.

989b0be

Signed-off-by: Jeff Epler <jepler@gmail.com>

jepler added 6 commits July 4, 2025 14:20

tests: Improve test coverage of py/profile.c.

cf94486

Signed-off-by: Jeff Epler <jepler@gmail.com>

sys_settrace_features: Add expected output.

f25c1cd

This side-steps the problem where the output does not match when the host python version is 3.12.x (as it is on unix ci today). Signed-off-by: Jeff Epler <jepler@gmail.com>

coverage: Initialize more code_state fields.

73f03f2

When MICROPY_PY_SYS_SETTRACE was enabled, a crash was seen in the qemu_mips build. It seems likely that this was due to these added fields not being initialized. Signed-off-by: Jeff Epler <jepler@gmail.com>

ci: Remove the "settrace" build.

1ab8fb0

This becomes redundant when the main coverage build includes settrace. Signed-off-by: Jeff Epler <jepler@gmail.com>

jepler force-pushed the profile-coverage branch 2 times, most recently from 3fd7cea to b3d1321 Compare July 4, 2025 13:23

jepler added 4 commits July 4, 2025 14:33

tests: Increase test timeout in coverage build.

5b00f0f

The additional overhead of the settrace profiler means that the `aes_stress.py` test was running too slowly on GitHub CI. Double the timeout to 60 seconds. Signed-off-by: Jeff Epler <jepler@gmail.com>

objcode: Fully parenthesize macro expansion.

8d1205c

as requested in code review. Signed-off-by: Jeff Epler <jepler@gmail.com>

runtime: Initialize bool field idiomatically.

dbd87a9

Signed-off-by: Jeff Epler <jepler@gmail.com>

jepler force-pushed the profile-coverage branch from b3d1321 to dbd87a9 Compare July 4, 2025 13:33

jepler requested a review from dpgeorge July 4, 2025 14:16

dpgeorge approved these changes Jul 5, 2025

View reviewed changes

dpgeorge closed this Jul 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Coverage test sys.settrace & improve coverage #17538

Coverage test sys.settrace & improve coverage #17538

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Coverage test sys.settrace & improve coverage #17538

Coverage test sys.settrace & improve coverage #17538

Conversation

Summary

Testing

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!