Proposal: Transport-agnostic resumable streams #543

jonathanhefner · 2025-05-17T20:12:24Z

jonathanhefner
May 17, 2025
Collaborator

Pre-submission Checklist

I have verified this would not be more appropriate as a feature request in a specific repository
I have searched existing discussions to avoid duplicates

Your Idea

Motivation

Stream resumability as currently defined faces multiple issues:

Limited to the Streamable HTTP transport.
- This increases the implementation burden for new or custom transports.
- This makes it more difficult to develop MCP features without accounting for (or relying on) the nuances of a particular transport.
Requires at least one event in order to use Last-Event-ID.
- A server cannot close a connection (e.g. when operating asynchronously) before sending an event, because there is no way for the client to resume the stream without an event ID.
- If a connection is lost before an event is sent, there is no way for the client to resume the stream. This is especially problematic because the spec says that disconnection should not be interpreted as the client cancelling its request.
No provision for expiration / resource reclamation.
- The spec does not indicate whether a server can delete previously missed events once they have been replayed on a resumed stream. The spec could explicitly allow this, but resuming is done via HTTP GET, and HTTP GET requests should be read-only.
- There is no mechanism for a server to communicate that it will expire a stream after a certain duration of client inactivity.
No mechanism to check the status of a stream.
- The only way for a client to check whether a disconnected stream has a pending result is for the client to resume the stream.

Proposal

To address these issues, I'd like to propose a transport-agnostic resumable streams. The basic elements are:

stream/begin JSON-RPC notification
- When a client sends a request to a server, the server sends a stream/begin JSON-RPC notification with a requestId and streamId. requestId is the request's id property. streamId is a generated unique ID for the stream.
- stream/begin also includes a resumeInterval param with min and max properties.
  - resumeInterval.min indicates the minimum amount of time the client should wait before resuming the stream after a disconnect.
  - resumeInterval.max indicates the maximum amount of time the client may wait before resuming the stream after a disconnect. If this limit is exceeded, the stream is considered "abandoned", and the server may delete all data and cancel all work related to the stream. A value of 0 indicates that the stream is not resumable, and that the work will be cancelled upon disconnect.
- After stream/begin is sent, both the server and client may disconnect at will. (We will need to amend this part of the spec.)
stream/end JSON-RPC notification
- When a server has emitted all JSON-RPC messages for a stream, it emits a stream/end JSON-RPC notification with the streamId to mark the end of the stream.
stream/resume JSON-RPC notification (or request?)
- A client may send a stream/resume JSON-RPC notification with a streamId to resume that stream on the current connection. The server should then send all JSON-RPC messages for that stream that were not previously sent to the client.
  - Note: An alternative approach would be to make stream/resume a JSON-RPC request instead of a JSON-RPC notification. As such, the server would immediately send a JSON-RPC response either confirming the resume or reporting an error for an invalid streamId. However, this would inject an extra message into the stream, which might slightly complicate replay mechanisms.
- If the streamId is invalid, the server should immediately send a stream/end notification.
- Clients should not resume streams that are ongoing on other connections; doing so results in undefined behavior.
stream/poll JSON-RPC request
- A client may send a stream/poll JSON-RPC request with a streamId, and the server should respond with the status of the stream, including whether there are pending JSON-RPC messages and whether the stream has ended.
- Upon a stream/poll request, the server should reset the "abandoned" timer for the stream. However, the server may choose not to do so based on internal limits for stream age.

Server-to-client requests

Handling server-to-client requests (e.g. sampling requests) can be tricky, depending on the transport. There are two points that I think are important:

The spec should permit, and the SDKs should encourage, that servers not hold onto resources while waiting for a response from a client.
When a client sends a response, the stream should resume on that connection, though at-will disconnects are still allowed.

Following from those points, I think unidirectional transports such as HTTP should disconnect when the server sends a request to the client during a stream, including during a stream initiated by stream/resume.

Also, following from the 2nd point, I think server-to-client requests should embed the stream ID in the request ID. That way, if a response is received on a new connection, the stream ID can be extracted from the response (which will have the same ID as the request), and the stream can resume on that connection.

Though not part of this proposal, I have a suggestion for an SDK API that leverages resumable streams to improve scalability in the face of server-to-client requests.

Batching and multiplexing

Support for JSON-RPC batching was recently removed from the spec; however, when using a unidirectional transport such as HTTP, there are a couple of compelling use cases related to streams:

Batching would allow the client to send multiple stream/poll requests on a single connection.
Batching would allow the client to send multiple stream/resume notifications on a single connection to multiplex streams.

Webhooks

I see webhooks as more of a transport-level concern, so webhooks are not part of this proposal. However, webhook support could be based on resumable streams. For a detailed example, see #543 (comment).

Asynchronous long-running tasks

This proposal is a generalization of my proposed approach to asynchronous long-running tasks, and supersedes that proposal. I think the basic elements in this proposal provide a solid base for handling asynchronous long-running tasks and can be extended as necessary with features like batching and webhooks.

Scope

jonathanhefner · 2025-05-19T18:14:37Z

jonathanhefner
May 19, 2025
Collaborator Author

Universal subscription mechanism

Another thought I had is that transport-agnostic resumable streams provide an alternative way of modeling subscriptions.

For example, currently, in order to subscribe to resource updates, the client sends a resources/subscribe JSON-RPC request, and the server sends back an empty JSON-RPC response. Then the server starts sending notifications/resources/updated JSON-RPC notifications. To unsubscribe, the client sends a resources/unsubscribe JSON-RPC request, and the server sends back an empty JSON-RPC response.

The transport is left to handle the case where a client might disconnect while the server continues to emit notifications/resources/updated notifications. Currently, only the Streamable HTTP transport handles this case, and the client must be sure to begin the GET /mcp SSE stream before subscribing to the resource (otherwise, it could miss a notification).

An alternative way of modeling resource subscriptions would be for the client to send a resources/stream JSON-RPC request, and for the server to send a stream/begin JSON-RPC notification but no JSON-RPC response. The server could then send notifications/resources/updated notifications on that stream until either the client cancels the request (via notifications/cancelled), or the stream is abandoned.

The stream would be resumable regardless of transport, guaranteeing that the client sees all update notifications. There would also be a clear retention policy for update notifications, allowing servers to reclaim resources as appropriate.

2 replies

jonathanhefner Jun 1, 2025
Collaborator Author

Tools could also use this strategy to act as event publishers.

Related discussions:

jonathanhefner Jun 9, 2025
Collaborator Author

This strategy also allows the server to notify the client when a subscription is no longer valid.

For example, currently, if a client subscribes to a file resource, and that file is subsequently deleted, the server can either quietly do nothing or send a notifications/resources/updated notification. If the server sends a notifications/resources/updated notification, the client will likely try to read the deleted resource.

When using a transport-agnostic resumable stream for the subscription, the server could instead send a stream/end notification, which should not trigger the client to read the resource.

Relatedly, it might be worth adding a reason param to the stream/end notification.

jonathanhefner · 2025-05-25T18:41:20Z

jonathanhefner
May 25, 2025
Collaborator Author

Statefulness

During the last Hosting WG meeting, there were some questions about statefulness and how this proposal would work with the Streamable HTTP transport, so I wanted to share a high-level sketch. The sketch is not prescriptive; it's just one possible implementation. The steps are as follows:

A tool emits a JSON-RPC message as part of a transport-agnostic stream.
The message plus an increasing ID are added to a buffer.
- A convenient format for the ID would be ${streamId}/${timestamp}/${localCounter}. For example, "00112233-4455-6677-8899-aabbccddeeff/1798783200/001".
The transport layer iterates through the buffer, and sends each message to the client.
When the client ACKs (acknowledges) a message, it is removed from the buffer.
- In-order delivery via lower-level protocols is assumed.
If there is a disconnection, all items in the buffer are transferred to a persistent queue¹, and all future messages emitted by the tool are added to the queue instead of the buffer.
When the client resumes the transport-agnostic stream, the transport layer will resume sending messages from the buffer, but the buffer will be populated from the queue rather than directly from the tool.
- Future messages from the tool will continue to be added to the queue. If the queue is in an external store, servers can be horizontally scaled behind a load balancer, and the client can connect to any server instance to resume the stream.

Streaming HTTP transport compatibility

Because the Streamable HTTP transport is unidrectional, it does not use explicit ACKs. Instead, Last-Event-ID acts as an implicit ACK. Therefore, the Streamable HTTP transport handles steps 4-6 slightly differently:

Step 4 is technically the same, but no messages are removed from the buffer because there are no explicit ACKs.
Step 5 is technically the same — all items in the buffer are transferred to the queue.
In step 6, before resuming sending, the Streamable HTTP transport discards messages where the ID is less than or equal to Last-Event-ID.

Technically, a dequeue so that steps 5 and 6 can be repeated multiple times. ↩

0 replies

pantanurag555 · 2025-05-30T17:54:31Z

pantanurag555
May 30, 2025

I might have missed something but I didn't gather whether this is going to be an approach that applies to all tools (all tools stream regardless of whether the response is created at once or over time) or only to select tools where streaming responses comes more naturally.

If this idea allows for the latter, it could also benefit from tool annotations that allow the client to identify streaming vs non-streaming tools: #489

1 reply

jonathanhefner Jun 1, 2025
Collaborator Author

Ideally, this would apply to all tools, and the SDKs would transparently handle stream management on both the server and the client. That way tool authors don't have to make a guess about how long their tool will run for all inputs and environments. So, with regards to a tool annotation, I don't think it would make sense.

jonathanhefner · 2025-06-01T14:29:27Z

jonathanhefner
Jun 1, 2025
Collaborator Author

Webhooks example

Webhooks aren't part of this proposal, but during the last Hosting WG meeting, I mentioned an example of how this proposal could integrate with webhooks:

User installs client app on their phone, and connects the client app to a remote MCP server.
Client sends a tools/call request with a webhook URL in a _meta param.
Server sends a stream/begin message to start the resumable stream.
Due to user action, the client app suspends and disconnects from the server.
Server can continue to send messages, such as progress notifications, via the resumable stream.
Server sends the tool call result via the resumable stream.
Server sends a stream/end message via the resumable stream.
Server calls the webhook with the streamId of the resumable stream.
Webhook host sends a push notification with the streamId to the client app.
Client app resumes the stream via a stream/resume request with the streamId.
Client receives all previously undelivered messages and processes them normally, in the same way as if there had been no disconnection.

Steps 3 and 5-8 can be handled transparently by the SDKs, so the tool author does not need to do anything extra to enable this flow.

0 replies

jonathanhefner · 2025-06-09T17:54:40Z

jonathanhefner
Jun 9, 2025
Collaborator Author

Backward compatibility and opting in

This proposal does not require that tools be written differently than they are now. The SDKs should be able to handle beginning, resuming, and ending streams in a transparent way (and only when permitted by the negotiated protocol version).

The only required change to server code should be configuring resumeInterval. The SDKs should provide an API to configure resumeInterval on a per-tool basis, and likely an API to configure global default values as well.

If there are no configured values for resumeInterval, the SDK should not begin resumable streams. Thus, resumable streams will be opt-in for the server. (However, due to the unpredictable nature of distributed systems, I think we should recommend that all servers opt in to resumable streams.)

4 replies

davemssavage Jun 10, 2025

Hi @jonathanhefner we had an offline conversation, I'm bringing the remaining points here for visibility. I think my residual concerns with this proposal are that it is a breaking change for clients (I agree prototected by protocol negotation) and that clients only know AFTER they've sent a request that the stream is setup. I'll elaborate both points below and propose a variation of the above that I think solves the latter problem and potentially the former by making this opt in if so desired on the client side, whilst also keeping the essense of this proposal intact.

As per Proposal: Transport-agnostic resumable streams #543 (comment) I agree protocol negotiation makes this opt-in, so if the choice of the specification group is to introduce a breaking change then this allows for it, however this makes me wonder on a more general principle, SHOULD MCP be aiming for 'forwards compatability' for clients where ever possible? If so SHOULD it adopt semantic versioning to make this clear who is affected by a change? The current protocol version seems to be date based which leaves it up to the server/client implementors to do quite a lot of study to figure out whether one or other or both are going to be broken. I worry that people will just default to supporting latest version meaning clients will keep dropping out of the eco system if they fail to keep up. This might be it's own discussion if so I'm happy to drop this from this discussion and we can move onto the final concern I have (below)
I am worried with the proposal as is that the client only gets notified that a stream is active AFTER a call has been made via a notification. This means that in the case of non-idempotent calls then the client doesn't get any certainty it will receive a result which may have changed the server in some way the client can't know. In the case of calls that incur real world cost such as debitting a bank balance, I think this is far from ideal. As a variation I propose the following sequence of events:

sequenceDiagram
        actor Client
        actor Server
        Client->>Server: InitializeRequest(stream={resumeInterval={min=10, max=60}}}
        Server-->>Client: InitializeResult(capabilities(stream={streamId=1,resumeInterval={min=10, max=60}})
        Client->>Server: CallToolRequest()
        Server-)Client: ProgressNotification (1/4)
        Server-)Client: ProgressNotification (2/4)
        Note over Client,Server: Client disconnects
        Client->>Server: InitializeRequest(stream={streamId=1,resumeInterval={min=10, max=60}}}
        Server-->>Client: InitializeResult(capabilities(stream={streamId=1,resumeInterval={min=10, max=60}})
        Server-)Client: ProgressNotification (3/4)
        Server-)Client: ProgressNotification (4/4)
        Server-->>Client: CallToolResult

This changes the stream setup to be an active request as part of session initialisation from the client (possibly with a default to min=0, max=inf so the client defaults to requesting infinite stream durability) the server then responds with the actual negotiated stream durability and the client can then chose whether or not they can honour that reconnect time period and therefore whether to call the expensive or long running request.

Another variant of the above would be adopt the CallToolAsyncRequest from #617 which would allow the stream setup to be done per call rather than as a default for the session as a whole with the initialise approach above. This is a larger change though so if the consensus is go with this option I think this variant is a reasonable compromise that allows for the client to have some level of certainty BEFORE it submits the request.

jonathanhefner Jun 10, 2025
Collaborator Author

however this makes me wonder on a more general principle, SHOULD MCP be aiming for 'forwards compatability' for clients where ever possible? If so SHOULD it adopt semantic versioning to make this clear who is affected by a change? The current protocol version seems to be date based which leaves it up to the server/client implementors to do quite a lot of study to figure out whether one or other or both are going to be broken. I worry that people will just default to supporting latest version meaning clients will keep dropping out of the eco system if they fail to keep up.

I'm not sure I understand the concern. I think most clients and servers will be using an SDK, and the SDKs will likely maintain support for older protocol versions for a while. During < 8000 a href="https://modelcontextprotocol.io/specification/2025-03-26/basic/lifecycle#version-negotiation" rel="nofollow">version negotiation, the client SDK will send the latest protocol version that it supports. If the server SDK is newer, it will downgrade to that protocol version. If the server is older, it will respond with the latest protocol version that it supports. If, for some reason, the client does not support that version, it will disconnect. Otherwise, things proceed. It's analogous to HTTP servers supporting multiple versions of the HTTP protocol.

Also, a semantic version number might be worthwhile once the protocol is more mature, but, for now, you can think of the date as a major version number.

I am worried with the proposal as is that the client only gets notified that a stream is active AFTER a call has been made via a notification. This means that in the case of non-idempotent calls then the client doesn't get any certainty it will receive a result which may have changed the server in some way the client can't know. In the case of calls that incur real world cost such as debitting a bank balance, I think this is far from ideal.

There is no guarantee of a result regardless of whether there is a resumable stream. Paraphrasing from our previous conversation: if the client were able to request a resumeInterval.max of, say, 15 minutes, would that be enough for a guaranteed result, even in the face of a disconnect? Should the client demand 1 week instead, just to be safe? If the server rejects 1 week, what should the client do?

The real problem is that a client can't know whether it will be disconnected nor how long it will be disconnected for, so it is futile for a client to specify these parameters. Instead, the server should make a best effort to accommodate offline clients, within reasonable limits. Anything more than that should be handled outside of the protocol (via HA client proxies, service agreements, etc).

This changes the stream setup to be an active request as part of session initialisation from the client (possibly with a default to min=0, max=inf so the client defaults to requesting infinite stream durability) the server then responds with the actual negotiated stream durability and the client can then chose whether or not they can honour that reconnect time period and therefore whether to call the expensive or long running request.

In my opinion, breaking up the sequence into two separate requests (one for "begin", one for the actual tool call) would be a non-starter. It would be less efficient due to the extra round trip, and it would complicate horizontal scaling because the servers have to maintain continuity between requests.

And, again, how would the client know whether it can honor the reconnect period in order to make the decision? (Also, for that matter, how would the client know whether the tool call will actually be expensive or long-running?)

davemssavage Jun 11, 2025

I'm not sure I understand the concern.

Mainly that there is quite a burden on builders to keep up with changes and being intentional about breaking changes I think is positive for the community building and using this protocol, by using semantic versioning it's at least clear when something has changed and who is affected.

Also, a semantic version number might be worthwhile once the protocol is more mature, but, for now, you can think of the date as a major version number.

Rather than going round the loop on this in this point I'll raise a discussion point on protocol versioning and relationship to mcp protocol maturity. Note https://semver.org/ generally handles fast moving early iteration by using a 0.X.Y version number the 0 indicates to the parties involve the API is not yet stable and breaking changes might occur at any time. However after 1.0.0 this is not the case and clients/servers should be able to infer without having to read all the docs whether they support particular combinations of versions.

I am worried with the proposal as is that the client only gets notified that a stream is active AFTER a call has been made via a notification. This means that in the case of non-idempotent calls then the client doesn't get any certainty it will receive a result which may have changed the server in some way the client can't know. In the case of calls that incur real world cost such as debitting a bank balance, I think this is far from ideal.

There is no guarantee of a result regardless of whether there is a resumable stream. Paraphrasing from our previous conversation: if the client were able to request a resumeInterval.max of, say, 15 minutes, would that be enough for a guaranteed result, even in the face of a disconnect? Should the client demand 1 week instead, just to be safe? If the server rejects 1 week, what should the client do?

I agree this is a risk, the use case I'm focussing on is calling a long running task (the example I've been using is training a model) I envisage this to be done on a network that is generally available so I'm expecting the client to be able to reconnect within a matter of minutes at worst. I agree that in the case the client can't reconnect, i.e. the initialise(streamId=previousStreamId) fails then the client has some extra work to do to figure out what state the server is in, but at least it can do this knowing the previous stream is gone. This is not obviously possible if it relies on notifications with no return value.

The real problem is that a client can't know whether it will be disconnected nor how long it will be disconnected for, so it is futile for a client to specify these parameters. Instead, the server should make a best effort to accommodate offline clients, within reasonable limits. Anything more than that should be handled outside of the protocol (via HA client proxies, service agreements, etc).

I do agree these should all be best endevours behaviours, I agree other mechnanisms can be layered on top if full durability is required.

This changes the stream setup to be an active request as part of session initialisation from the client (possibly with a default to min=0, max=inf so the client defaults to requesting infinite stream durability) the server then responds with the actual negotiated stream durability and the client can then chose whether or not they can honour that reconnect time period and therefore whether to call the expensive or long running request.

In my opinion, breaking up the sequence into two separate requests (one for "begin", one for the actual tool call) would be a non-starter. It would be less efficient due to the extra round trip, and it would complicate horizontal scaling because the servers have to maintain continuity between requests.

The initialize step is already a part of the protocol so I /think/ this is simplifying the implementation of this, i.e. no new protocol messages but some extra optional params on the already existing requests. I agree horizontal scaling is non-trivial however I don't think this changes things as initialize is already a part of the protocol and happens once on the client irrespective of whether the backend server is scalled across multiple processes.

I suggest that a server should setup an external event store (pick a caching technology of choice, e.g. redis etal) and use the session id to respond to look up the current state if a request is sent to a process that was not the original receiver of the initialise request.

And, again, how would the client know whether it can honor the reconnect period in order to make the decision? (Also, for that matter, how would the client know whether the tool call will actually be expensive or long-running?)

I had envisage some other changes to the Tool along the lines of idempotentHint, in the simple case it might just state this as text in a tool description and an agent (either human or AI could then adjust its behaviour accordingly)

jonathanhefner Jun 11, 2025
Collaborator Author

I envisage this to be done on a network that is generally available so I'm expecting the client to be able to reconnect within a matter of minutes at worst.

I forgot to mention in my previous comment that if there is a significant need for clients to specify a resumeInterval.max, then we could support that in the future. We would simply add a tools/call meta param for the requisite resumeInterval.max. If the value is greater than what the server is willing to offer, then the server would respond with an error JSON-RPC response.

but at least it can do this knowing the previous stream is gone. This is not obviously possible if it relies on notifications with no return value.

The client will know the stream is gone because resumeInterval.max will have elapsed. But if you are referring to the stream/resume notification, my proposal states that when a stream/resume specifies an invalid stream ID, the server should immediately send stream/end. I am also considering that we add a reason param to stream/end.

The initialize step is already a part of the protocol so I /think/ this is simplifying the implementation of this, i.e. no new protocol messages but some extra optional params on the already existing requests. I agree horizontal scaling is non-trivial however I don't think this changes things as initialize is already a part of the protocol and happens once on the client irrespective of whether the backend server is scalled across multiple processes.

Ah, I was getting confused with our previous discussion. So are you suggesting that the client specifies a global resumeInterval.max during the initialization phase, akin to a capability? I think that could work. Though I'm still not sure how useful it would be, so I would prefer to leave it to a future proposal.

One thing I like about it, though, is that it jibes with the idea of sending capabilities on each tool call in order to support session-less servers. So if we had something like that, the client could still specify resumeInterval.max per tool without the tools/call meta param that I mentioned above.

And, again, how would the client know whether it can honor the reconnect period in order to make the decision? (Also, for that matter, how would the client know whether the tool call will actually be expensive or long-running?)

I had envisage some other changes to the Tool along the lines of idempotentHint, in the simple case it might just state this as text in a tool description and an agent (either human or AI could then adjust its behaviour accordingly)

Those might help a bit, but also consider that a tool itself may not know whether it will be expensive or long-running, especially without foreknowledge of its inputs or runtime environment.

jonathanhefner · 2025-06-10T19:29:32Z

jonathanhefner
Jun 10, 2025
Collaborator Author

SDK API for multi-turn tool interactions

The following SDK API suggestion is not part of this proposal; this proposal will work with the current SDK APIs. However, transport-agnostic resumable streams offer a way to scale multi-turn tool interactions: when the server sends a request to the client, such as a sampling request, the server can disconnect and stop execution while waiting for a response.

The current TypeScript SDK (and perhaps other SDKs) makes this difficult because server-to-client requests are handled as promises. The server makes a request and awaits the promise, holding server resources until the client responds.

Below is an example of a suggested API for handling server-to-client requests using discrete functions. To make a request, the server returns a request object from a function, allowing the server to stop execution while waiting for a response. The request object id will embed the stream ID, the tool name, and the function number. When the client sends a response, it will have the same id, so the server will have all the information it needs to resume execution. If the server puts its state in an external store, then the client can be routed to any server instance when resuming the tool call.

server.tool(
  "my_long_running_tool",
  inputSchema,
  streaming((inputParams) => {
    // When this function returns, the SDK sends a `stream/begin` notification,
    // then the SDK runs the `perform` callback further below.
    return { resumeInterval: { min: 10, max: 24 * 60 * 60 } };
  }).perform((inputParams, stream) => {
    // This function can close the connection at any time and continue to emit
    // JSON-RPC notifications.
    stream.closeConnection();
    stream.emitProgress(progress);

    // When a stream will be interrupted due to a server-to-client request, the
    // server should explicitly manage its state.
    myStore.set(stream.id, myState);

    // The server can make a request to the client by returning a request
    // object.  If the connection is still open, the request will be sent over
    // the connection, and the SDK could then close the connection so that
    // resources aren't consumed while waiting for a response.  If the
    // connection was already closed, the request will be stored along with
    // emitted notifications.
    return stream.samplingRequest(prompt);
  }).resume((samplingResult, stream) => {
    // Restore state from the previous callback:
    let previousState = myStore.get(stream.id);

    // Emit result:
    stream.emitResult(result);

    // This function does not return a value, so the SDK sends a `stream/end`
    // notification.
  }).finalize((stream) => {
    // This function is called after the stream ends or is abandoned.

    // Clear state from external store to free resources:
    myStore.delete(stream.id);
  })
);

0 replies

Proposal: Transport-agnostic resumable streams #543

Uh oh!

Uh oh!

jonathanhefner May 17, 2025 Collaborator

Pre-submission Checklist

Your Idea

Motivation

Proposal

Server-to-client requests

Batching and multiplexing

Webhooks

Asynchronous long-running tasks

Scope

Replies: 6 comments · 7 replies

Uh oh!

Uh oh!

jonathanhefner May 19, 2025 Collaborator Author

Universal subscription mechanism

Uh oh!

jonathanhefner Jun 1, 2025 Collaborator Author

Uh oh!

jonathanhefner Jun 9, 2025 Collaborator Author

Uh oh!

Uh oh!

jonathanhefner May 25, 2025 Collaborator Author

Statefulness

Streaming HTTP transport compatibility

Footnotes

Uh oh!

pantanurag555 May 30, 2025

Uh oh!

jonathanhefner Jun 1, 2025 Collaborator Author

Uh oh!

jonathanhefner Jun 1, 2025 Collaborator Author

Webhooks example

Uh oh!

jonathanhefner Jun 9, 2025 Collaborator Author

Backward compatibility and opting in

Uh oh!

Uh oh!

davemssavage Jun 10, 2025

Uh oh!

jonathanhefner Jun 10, 2025 Collaborator Author

Uh oh!

davemssavage Jun 11, 2025

Uh oh!

Uh oh!

jonathanhefner Jun 11, 2025 Collaborator Author

Uh oh!

Uh oh!

jonathanhefner Jun 10, 2025 Collaborator Author

SDK API for multi-turn tool interactions

jonathanhefner
May 17, 2025
Collaborator

Replies: 6 comments 7 replies

jonathanhefner
May 19, 2025
Collaborator Author

jonathanhefner Jun 1, 2025
Collaborator Author

jonathanhefner Jun 9, 2025
Collaborator Author

jonathanhefner
May 25, 2025
Collaborator Author

pantanurag555
May 30, 2025

jonathanhefner Jun 1, 2025
Collaborator Author

jonathanhefner
Jun 1, 2025
Collaborator Author

jonathanhefner
Jun 9, 2025
Collaborator Author

jonathanhefner Jun 10, 2025
Collaborator Author

jonathanhefner Jun 11, 2025
Collaborator Author

jonathanhefner
Jun 10, 2025
Collaborator Author