gh-118138: Adds test for multithreaded writes to files. #119107

elistevens · 2024-05-17T02:01:35Z

Multithreaded writes to files (including standard calls to print() from threads) can result in assertion failures, missed dropped output, and (in non-debug builds) writing uninitialized memory out due to race conditions in _io_TextIOWrapper_write_impl and _textiowrapper_writeflush.

S 8000 ee: #118138

I've provided a change that fixes the original issue for me on python 3.10.12, ~~but my attempt to turn the repro script into a self-contained test case fails with that patch on both 3.10.12 and 3.14.0a0 (main as of today).~~ That patch makes self->pending_bytes always be a list, and calls PyList_SetSlice(pending, 0, pending_size_at_start, NULL) to remove entries after they've been processed. I strongly suspect that the patch only narrows the time windows for race conditions, not actually eliminate them. I also think that it might introduce the possibility of dropped or repeated output should those race conditions trigger, but I am uncertain if that's actually the case (and they don't happen in practice, according to the test script).

I am out of my depth and would appreciate a maintainer taking a look.

Thanks to @sterwill for the original repro script, which the test case here is a descendent of.

Issue: SSL session content bleeds into stdout with lots of threads #118138

Multithreaded writes to files (including standard calls to print() from threads) can result in assertion failures and potentially missed output due to race conditions in _io_TextIOWrapper_write_impl and _textiowrapper_writeflush. See: python#118138

ghost · 2024-05-17T02:01:37Z

The following commit authors need to sign the Contributor License Agreement:

wickedgrey@gmail.com

Click the button to sign:

bedevere-app · 2024-05-17T02:01:38Z

Most changes to Python require a NEWS entry. Add one using the blurb_it web app or the blurb command-line tool.

If this change has little impact on Python users, wait for a maintainer to apply the skip news label instead.

siddhu032d · 2024-05-17T17:03:18Z

Result: ACCESS

bedevere-app · 2024-05-17T18:50:47Z

Most changes to Python require a NEWS entry. Add one using the blurb_it web app or the blurb command-line tool.

If this change has little impact on Python users, wait for a maintainer to apply the skip news label instead.

elistevens · 2024-05-17T18:52:14Z

I believe that the issue with my patch was due to closing the file accepting output before calling executor.shutdown(wait=True). The modified test now passes with my change.

Edit: I also feel like there's probably a cleaner way to handle pending_bytes_count but given that I'm not even certain that this is the right way to solve the problem, I'm not investing too much thought into it.

elistevens · 2024-05-24T17:36:52Z

After deploying this patch at scale internally, I can confirm that there are still race condition issues; the test fails about 14% of the time in our CI cluster (interestingly it always passed while running locally, so it might be a load issue; I haven't investigated the exact nature of the failures yet). 14% is better than the 100% failure rate we were seeing prior, but it's still not fully fixed.

bedevere-app · 2024-05-30T05:34:13Z

Most changes to Python require a NEWS entry. Add one using the blurb_it web app or the blurb command-line tool.

If this change has little impact on Python users, wait for a maintainer to apply the skip news label instead.

elistevens · 2024-05-30T18:37:29Z

I think that the test provided here might be useful (after being adapted to meet cpython standards), but the actual code change likely is not. On #119507 a much simpler patch achieves the same practical "it doesn't assert any longer" effect for my use case.

The issue is also being discussed here: https://discuss.python.org/t/bug-with-multi-threaded-print-write-causing-dropped-output-output-of-uninitialized-memory-assertion-failures/54538 where @pitrou suggests using a mutex to protect the internal state.

Should we close this PR?

ericsnowcurrently · 2024-06-04T14:59:37Z

@elistevens, thanks for taking the time you did on this PR. While there wasn't much feedback here, that DPO thread definitely served as a proxy, which is probably okay. Regardless, you clearly put a lot of thought into this and demonstrated a healthy dose of determination and skill. I hope there will be more opportunities for you to participate in Python core development. If you have any feedback, please let us know.

elistevens · 2024-06-04T17:05:15Z

Thanks, Eric! I'm not worried about the relative quiet on this PR; #119507 had a lively discussion and ultimately resulted in a much cleaner fix than the one I proposed here. I also recognize that maintainers are quite busy and I think should focus on the PRs that are likely to merge.

Thank you for all the work that you all do!

bedevere-app bot added the awaiting review label May 17, 2024

bedevere-app bot mentioned this pull request May 17, 2024

SSL session content bleeds into stdout with lots of threads #118138

Closed

elistevens added 2 commits May 17, 2024 11:41

Calls executor.shutdown before closing file.

997dbc2

Fixes race conditions from multithreaded writes.

3f05fd9

gvanrossum requested review from colesbury and ericsnowcurrently May 29, 2024 16:18

elistevens mentioned this pull request May 30, 2024

gh-119506: fix _io.TextIOWrapper.write() write during flush #119507

Merged

Update test_io_threading.py to not suffer from interleaved prints

d156d3c

elistevens closed this Jun 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-118138: Adds test for multithreaded writes to files. #119107

gh-118138: Adds test for multithreaded writes to files. #119107

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

gh-118138: Adds test for multithreaded writes to files. #119107

gh-118138: Adds test for multithreaded writes to files. #119107

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants