The strict server does streaming json decodes. The client does not, because it exposes a Body field, so it has to buffer the response. But this generates unnecessary allocations if (as I suspect is common) the Body field is unused.
I don't know what the design trade-offs are here, and how you'd want to expose it, but it'd be nice to be able to generate code optimized for being lightweight, which doesn't buffer all the reads.