Clean up client response payloads on error to free up connections #2710

daschl · 2023-09-25T09:17:25Z

Motivation

When a client makes a request and during response processing an exception is thrown, the exception bubbles up to the client but the message payload body is then not available to be consumed.

In this case, the underlying pooled connection will not be freed up and a new connection is created.

Modifications

By tracking the message payloads when responses are bubbled up through the filter chain, we can detect if an error happens and if detected the message payload body is being proactively drained.

This will free up the connection again so it can be reused by a subsequent request.

Result

More correct handling of message payload bodies if an exception happens during response filter processing.

idelpivnitskiy · 2023-10-03T05:54:06Z

...ttp-netty/src/main/java/io/servicetalk/http/netty/DefaultSingleAddressHttpClientBuilder.java

+
+            // This filter cleans up tracked and discarded message payloads.
+            currClientFilterFactory = appendFilter(currClientFilterFactory,
+                    HttpMessageDiscardWatchdogClientFilter.INSTANCE);


The watching filter should be added as the last connection-level filter at connectionFilterFactory. That way it can intercept the original publisher before it's returned to any user-defined filter.

See my updated code - I tried that but the tests are failing .. maybe I put it in the wrong spot? 🤔

No, the initial spot is perfect. But it affected some other state. There are 2 stages that rely on full response being consumed:

connection state

LoadBalancedStreamingHttpClient state

After we moved "catch" logic to the connection filter, it intercepts original payload body publisher, but it doesn't see the result of liftSync operator applied by LoadBalancedStreamingHttpClient. Something similar to what we observed on the server-side when users have BeforeFinallyHttpOperator in the middle of the chain.

It seems like we need 2 "catch" points:

as the last connection filter for the initial catch

as the last client filter that will override the message body publisher inside the atomic reference if response reached client filter successfully.

That way we can at least guarantee that we release at both levels.

...tp-netty/src/main/java/io/servicetalk/http/netty/HttpMessageDiscardWatchdogClientFilter.java

idelpivnitskiy · 2023-10-03T06:22:03Z

...tp-netty/src/main/java/io/servicetalk/http/netty/HttpMessageDiscardWatchdogClientFilter.java

+                            if (message != null) {
+                                // No-one subscribed to the message (or there is none), so if there is a message
+                                // proactively clean it up.
+                                return message.ignoreElements().concat(Single.failed(originalThrowable));


The problem with concat is that if ignoreElements fails, we loose originalThrowable. Consider using a similar trick I used in proxy LB factory to preserve the originalThrowable

Can you help me understand what you mean here exactly? As in: dropping the error from ignoreElements or adding one as the cause from the other?

Recovering from any error, propagating the originalThrowable, and suppressing any new errors. Something similar to what I had here: https://github.com/apple/servicetalk/pull/2697/files#diff-740b3f38328018b08090bca154c032069df66494a187fc201ad50b5c9693a464R178

...tp-netty/src/main/java/io/servicetalk/http/netty/HttpMessageDiscardWatchdogClientFilter.java

...etty/src/test/java/io/servicetalk/http/netty/HttpMessageDiscardWatchdogClientFilterTest.java

idelpivnitskiy · 2023-10-04T04:54:15Z

...ttp-netty/src/main/java/io/servicetalk/http/netty/DefaultSingleAddressHttpClientBuilder.java

+
+            // This filter cleans up tracked and discarded message payloads.
+            currClientFilterFactory = appendFilter(currClientFilterFactory,
+                    HttpMessageDiscardWatchdogClientFilter.INSTANCE);


No, the initial spot is perfect. But it affected some other state. There are 2 stages that rely on full response being consumed:

connection state

LoadBalancedStreamingHttpClient state

After we moved "catch" logic to the connection filter, it intercepts original payload body publisher, but it doesn't see the result of liftSync operator applied by LoadBalancedStreamingHttpClient. Something similar to what we observed on the server-side when users have BeforeFinallyHttpOperator in the middle of the chain.

It seems like we need 2 "catch" points:

as the last connection filter for the initial catch

as the last client filter that will override the message body publisher inside the atomic reference if response reached client filter successfully.

That way we can at least guarantee that we release at both levels.

idelpivnitskiy · 2023-10-04T04:58:36Z

...tp-netty/src/main/java/io/servicetalk/http/netty/HttpMessageDiscardWatchdogClientFilter.java

+                            if (message != null) {
+                                // No-one subscribed to the message (or there is none), so if there is a message
+                                // proactively clean it up.
+                                return message


Consider logging a warning for users to clarify that they lost a reference to response payload body that had to be drained. Ideally, we want users to implement correct filters that clean up the state before propagating an error

Since this is happening on error, do they even have a chance to clean it up if an exception bubbles up and they never get a reference to the message?

In such cases, an exception is happening in a user filter. Their responsibility is to clean up the state before propagating an exception. For example, instead of

response.flatMap(r -> Single.failed(...));

they suppose to return:

response.flatMap(r -> r.messageBody().ignoreElements().concat(Single.failed(...)));

idelpivnitskiy

Thank you!

...tp-netty/src/main/java/io/servicetalk/http/netty/HttpMessageDiscardWatchdogClientFilter.java

...etty/src/test/java/io/servicetalk/http/netty/HttpMessageDiscardWatchdogClientFilterTest.java

Motivation ---------- When a client makes a request and during response processing an exception is thrown, the exception bubbles up to the client but the message payload body is then not available to be consumed. In this case, the underlying pooled connection will not be freed up and a new connection is created. Modifications ------------- By tracking the message payloads when responses are bubbled up through the filter chain, we can detect if an error happens and if detected the message payload body is being proactively drained. This will free up the connection again so it can be reused by a subsequent request. Result ------ More correct handling of message payload bodies if an exception happens during response filter processing.

daschl requested a review from idelpivnitskiy September 25, 2023 09:17

daschl force-pushed the client-discard branch 4 times, most recently from 725f101 to f742cf8 Compare September 26, 2023 07:22

idelpivnitskiy reviewed Oct 3, 2023

View reviewed changes

daschl force-pushed the client-discard branch from f742cf8 to 135653d Compare October 3, 2023 09:44

idelpivnitskiy reviewed Oct 4, 2023

View reviewed changes

daschl force-pushed the client-discard branch from 135653d to 34c75d4 Compare October 4, 2023 14:40

daschl requested a review from idelpivnitskiy October 19, 2023 13:46

daschl self-assigned this Oct 20, 2023

daschl force-pushed the client-discard branch from a481091 to 93a8015 Compare October 20, 2023 13:05

idelpivnitskiy approved these changes Oct 23, 2023

View reviewed changes

...tp-netty/src/main/java/io/servicetalk/http/netty/HttpMessageDiscardWatchdogClientFilter.java Outdated Show resolved Hide resolved

...etty/src/test/java/io/servicetalk/http/netty/HttpMessageDiscardWatchdogClientFilterTest.java Outdated Show resolved Hide resolved

daschl added 7 commits October 23, 2023 09:27

More idel feedback

4ca44dd

More rework (still not functional)

ed53b8c

More rework

ee4b57b

Checkstyle fixes

eed7439

More modifications

6a77c7e

One more polish round

3fcbc64

daschl force-pushed the client-discard branch from 8e977fb to 3fcbc64 Compare October 23, 2023 07:28

More docs polish

357504f

daschl merged commit 3649525 into apple:main Oct 23, 2023
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clean up client response payloads on error to free up connections #2710

Clean up client response payloads on error to free up connections #2710

daschl commented Sep 25, 2023

idelpivnitskiy Oct 3, 2023

daschl Oct 3, 2023

idelpivnitskiy Oct 4, 2023

idelpivnitskiy Oct 3, 2023

daschl Oct 3, 2023

idelpivnitskiy Oct 4, 2023

idelpivnitskiy Oct 4, 2023

idelpivnitskiy Oct 4, 2023

daschl Oct 4, 2023

idelpivnitskiy Oct 4, 2023

idelpivnitskiy left a comment

Clean up client response payloads on error to free up connections #2710

Clean up client response payloads on error to free up connections #2710

Conversation

daschl commented Sep 25, 2023

Motivation

Modifications

Result

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

idelpivnitskiy left a comment

Choose a reason for hiding this comment