Possible Async Bug in MultiPart Form Parser #2927

HonakerM · 2025-04-17T17:39:31Z

HonakerM
Apr 17, 2025

Hello!

I think I might've discovered a possible issue with how starlette parses multi-part form data. It calls python-multipart's parser.write() on the main asyncio thread which can cause issues in a high concurrency & large payload environment.

I've constructed a minimum reproducible example and walked through what I think a possible solution is. Any feedback would be greatly appreciated or if I should move this to an issue.

Context

I am working on a data processing service which uses fastapi/starlette as it's HTTP framework. The data comes in varying sizes but usually maxes out around ~8-10Mb so to get data into the service I use an UploadFile object supplied by the multi-part form parser.

Example

My code can be simplified to the following example:

from starlette.applications import Starlette
from starlette.responses import JSONResponse
from starlette.routing import Route
    
async def multi_part_download(request):
    async with request.form() as form:
        contents = await form["file"].read()
    return JSONResponse({'content-length': len(contents)})


app = Starlette(debug=True, lifespan=lifespan, routes=[
    Route('/', multi_part_download, methods=["POST"]),
])

Test Scenario

As a stress test on the my HTTP server I decided to send 1000 concurrent requests each with their own 10Mb file. I understand that 1k concurrent requests is way to high for a single fastapi process and ~700 requests outright failed initially but the rest of the results were interesting:

Here is a graph with the total request time in orange and the client connect time in blue. As expected the total time continues to increase with the number of requests; however, I was surprised to see that the connect time was increasing as well.

After profiling the test with viztracer I discovered that the reason client connects were slowing down was because the MultiPartParser.parse() function was holding onto the main thread:

IMO this is incorrect and should not be happening. The main loop should be left free to respond to other requests while this parsing happens in the background. Starlette actually already uses run_in_threadpool to read multi part files into the spooled temporary file but the core parser.write happens on the main thread.

Proposed solution

My proposal is to call the entire parser.write(chunk) in an threadpool instead of just the file writing. This way the main thread is free to do other work.

I tested out my changes in this draft PR and I got significantly better results. We were able to get double the responses and the average client connect time was significantly reduced. The only reason some requests still failed is because we ran out of threads for scheduling though that can be controlled via anyio thread pool count.

Summary

I think we should merge the changes in this PR to allow starlette to process more multi-part forms in parallel. It will slightly slow down small form parsing but I think the scalability and predictability tradeoff is worth it. I'd love to hear any feedback or other possible solutions!

HonakerM · 2025-04-21T18:22:37Z

HonakerM
Apr 21, 2025
Author

@Kludex sorry for the ping but I have a few people telling me this could be a security issue due being a denial of service attack vector (stopping from accepting new connections). I think that's a bit premature but would appreciate your thoughts.

FYI I also tested out a solution using multipart as discussed in this issue. I got a nice performance improvement but still had to use run_in_threadpool to avoid blocking the main thread

2 replies

Kludex Apr 22, 2025
Maintainer

Multipart can be improved, no security issue in the repository tho.

HonakerM Apr 22, 2025
Author

Okay thank you for the confirmation! I thought as much

Kludex · 2025-04-22T14:24:48Z

Kludex
Apr 22, 2025
Maintainer

I tested out my changes in #2926 and I got significantly better results. We were able to get double the responses and the average client connect time was significantly reduced. The only reason some requests still failed is because we ran out of threads for scheduling though that can be controlled via anyio thread pool count.

This doesn't seem proof of improvement.

I think we should merge the changes in #2926 to allow starlette to process more multi-part forms in parallel.

No, it doesn't make sense to run a CPU bound code that doesn't escape the GIL in a threadpool.

0 replies

HonakerM · 2025-04-22T15:25:19Z

HonakerM
Apr 22, 2025
Author

This doesn't seem proof of improvement.

Could you please expand on how that's not showing improvement? Being able to process ~2x the amount of concurrent requests with the same resources seems significant to me.

No, it doesn't make sense to run a CPU bound code that doesn't escape the GIL in a threadpool.

Upon further investigation I think it's less about running it in a threadpool and more about giving the scheduler a place to break and handle other more pressing actions (like handling new connections).

How else would you explain the doubling in handled requests?

I'm going to try continuing my replacement of python-multipart with multipart to see if we're able to gain more

2 replies

nggit Apr 22, 2025

I'd love to hear any feedback

Despite claims of improvement,

IMHO. Keep in mind there is a price of complexity, and possibly the addition of other unexpected bugs. Submitting something to a thread pool will pass a lot of logic. Fast is not guaranteed to be efficient.

HonakerM Apr 23, 2025
Author

@nggit thank you for the feedback! The more I've looked into this the more I understand the rats nest I've opened....

I think I'm officially off the threadpool solution since (as @Kludex mentioned) it solves a symptom and not the actual problem.

I hope switching to a faster multi part parser (like multipart) will address most of my concerns with less risk.

Also I'm not sure what you meant by "despite claims of improvement". Feel free to expand on that if I should change something in the parent issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible Async Bug in MultiPart Form Parser #2927

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 3 comments 4 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Possible Async Bug in MultiPart Form Parser #2927

HonakerM Apr 17, 2025

Context

Example

Test Scenario

Proposed solution

Summary

Replies: 3 comments · 4 replies

HonakerM Apr 21, 2025 Author

Kludex Apr 22, 2025 Maintainer

HonakerM Apr 22, 2025 Author

Kludex Apr 22, 2025 Maintainer

HonakerM Apr 22, 2025 Author

nggit Apr 22, 2025

HonakerM Apr 23, 2025 Author

HonakerM
Apr 17, 2025

Replies: 3 comments 4 replies

HonakerM
Apr 21, 2025
Author

Kludex Apr 22, 2025
Maintainer

HonakerM Apr 22, 2025
Author

Kludex
Apr 22, 2025
Maintainer

HonakerM
Apr 22, 2025
Author

HonakerM Apr 23, 2025
Author