KAFKA-14274: [1/7] basic refactoring by kirktrue · Pull Request #14305 · apache/kafka

kirktrue · 2023-08-29T01:41:03Z

This change introduces some basic clean up and refactoring for forthcoming commits related to the revised fetch code for the consumer threading refactor project.

See KAFKA-14274 for more background.

kirktrue · 2023-08-29T01:43:41Z

clients/src/main/java/org/apache/kafka/clients/NetworkClientUtils.java

Author’s note: The two new methods in this file were moved here from ConsumerNetworkClient as they will be used in other classes in future commits.

kirktrue · 2023-08-29T01:45:03Z

clients/src/main/java/org/apache/kafka/clients/consumer/ConsumerConfig.java

Author’s note: this method is used in the new Consumer’s constructor in a future commit.

kirktrue · 2023-08-29T01:47:35Z

clients/src/main/java/org/apache/kafka/common/utils/ExponentialBackoff.java

Author’s note: adding toString() implementations is helpful in debugging.

kirktrue · 2023-08-29T01:47:45Z

clients/src/main/java/org/apache/kafka/clients/consumer/internals/RequestState.java

Author’s note: adding toString() implementations is helpful in debugging.

kirktrue · 2023-08-29T01:48:26Z

clients/src/main/java/org/apache/kafka/clients/consumer/internals/RequestState.java

Author’s note: formatting and added debug

kirktrue · 2023-08-29T01:50:25Z

clients/src/main/java/org/apache/kafka/clients/consumer/internals/RequestState.java

Author’s note: this is the logic from the now-removed requestBackoffExpired method with a log line for debugging.

kirktrue · 2023-08-29T01:51:03Z

clients/src/main/java/org/apache/kafka/clients/consumer/internals/RequestState.java

Author’s note: requestBackoffExpired is now inline in canSendRequest above.

kirktrue · 2023-08-29T01:51:48Z

clients/src/main/java/org/apache/kafka/clients/consumer/internals/RequestState.java

Author’s note: added a LogContext argument and formatting.

kirktrue · 2023-08-29T01:51:52Z

clients/src/main/java/org/apache/kafka/clients/consumer/internals/RequestState.java

Author’s note: added a LogContext argument and formatting.

kirktrue · 2023-08-29T02:00:48Z

clients/src/main/java/org/apache/kafka/clients/consumer/internals/CompletedFetch.java

Author’s note: mostly encapsulating the instance variables for protection against accidental outside changes.

kirktrue · 2023-08-29T02:02:00Z

clients/src/main/java/org/apache/kafka/clients/consumer/internals/ConsumerNetworkClient.java

Author’s note: As stated previously, the core logic from isUnavailable and maybeThrowAuthFailure were moved to NetworkClientUtils for reuse.

kirktrue · 2023-08-29T02:03:07Z

clients/src/main/java/org/apache/kafka/clients/consumer/internals/AbstractFetch.java

Author’s note: most of these changes are related to the encapsulation changes that were made in CompletedFetch.

kirktrue · 2023-08-29T02:03:59Z

clients/src/main/java/org/apache/kafka/clients/consumer/internals/AbstractFetch.java

Author’s note: requestMetadataUpdate was moved to FetchUtils as it will be reused elsewhere in future commits.

kirktrue · 2023-08-29T02:04:37Z

clients/src/main/java/org/apache/kafka/clients/consumer/internals/CommitRequestManager.java

Author’s note: renamed subscriptionState to subscriptions for consistency with the rest of the newly refactored code base.

clients/src/main/java/org/apache/kafka/clients/consumer/internals/FetchUtils.java

kirktrue · 2023-08-29T19:18:22Z

cc @lianetm @philipnee

philipnee · 2023-08-29T20:52:12Z

clients/src/main/java/org/apache/kafka/clients/consumer/internals/RequestState.java

i actually wonder if the debug would be useful due to lack of the information of what request it is. maybe this log should stay at the request manager level?

Hi @philipnee!

That's a good point. This debug logging doesn't have context of the request.

The request manager calls canSendRequest() before creating the request, though. So in the cases where we return false, there won't be a request, so there isn't any additional request information that the request manager would be able to log.

Additionally, there are two different reasons why a request cannot be sent:

There's already an inflight request

The backoff hasn't expired

When canSendRequest() returns false back to the request manager, we'd lose the context to explain the reason why the request can't be sent.

I'll look into this a bit more to see what I can do.

Thanks!

I added a relevant toString() method to the RequestState sub-classes that includes details about the request. The logging now includes the toString()-ed RequestState for context.

philipnee · 2023-08-29T20:52:39Z

clients/src/main/java/org/apache/kafka/clients/consumer/internals/RequestState.java

philipnee · 2023-08-29T20:53:11Z

Hey @kirktrue - Thanks for the PR. Left a couple of comments otherwise it looks good.

kirktrue · 2023-08-31T00:41:50Z

Test failures are unrelated:

kafka.api.ConsumerBounceTest.testConsumptionWithBrokerFailures()
kafka.server.DynamicBrokerReconfigurationTest.testThreadPoolResize()
o.a.k.connect.mirror.integration.IdentityReplicationIntegrationTest.testReplicateSourceDefault()
o.a.k.controller.QuorumControllerTest.testBalancePartitionLeaders()
o.a.k.server.log.remote.metadata.storage.TopicBasedRemoteLogMetadataManagerTest.testNewPartitionUpdates()
o.a.k.streams.integration.EOSUncleanShutdownIntegrationTest.shouldWorkWithUncleanShutdownWipeOutStateStore()
o.a.k.streams.processor.internals.StreamsAssignmentScaleTest.testHighAvailabilityTaskAssignorLargeNumConsumers
o.a.k.trogdor.coordinator.CoordinatorTest.testTaskRequestWithOldStartMsGetsUpdated()

kirktrue · 2023-09-01T21:15:28Z

@junrao let me know your thoughts on this PR. Thanks!

junrao

@kirktrue : Thanks for the PR. Just one comment.

junrao · 2023-09-05T18:26:27Z

clients/src/main/java/org/apache/kafka/clients/consumer/internals/CommitRequestManager.java

Instead of duplicating the code in RequestState, could we pull out the common part as a util and reuse?

@junrao Are you referring to the toString() code specifically, or something else?

I can make an internal toStringBase() to RequestState that has the core instance variables concatenated as a string. And then subclasses can first append their own instance variable string and then append the result of toStringBase() or something. Does that make sense?

Yes, I was referring to the toString method. It seems that every subclass of RequestState directly gets every field in RequestState to build its own string. This creates duplicated code and can be a bit error prone.

@junrao I created toStringBase() in RequestState:

/** * This method appends the instance variables together in a simple String of comma-separated key value pairs. * This allows subclasses to include these values and not have to duplicate each variable, helping to prevent * any variables from being omitted when new ones are added. * * @return String version of instance variables. */ protected String toStringBase() { return "owner='" + owner + '\'' + ", exponentialBackoff=" + exponentialBackoff + ", lastSentMs=" + lastSentMs + ", lastReceivedMs=" + lastReceivedMs + ", numAttempts=" + numAttempts + ", backoffMs=" + backoffMs; } @Override public String toString() { return "RequestState{" + toStringBase() + '}'; }

Subclasses look like this:

@Override public String toString() { return "OffsetFetchRequestState{" + "requestedPartitions=" + requestedPartitions + ", requestedGeneration=" + requestedGeneration + ", future=" + future + ", " + toStringBase() + '}'; }

Does that seem OK?

Introduces some basic clean up and refactoring for forthcoming commits.

junrao

@kirktrue : Thanks for the updated PR. LGTM. Are the test failures related?

clolov

Thanks for the change! I believe the test failures are unrelated

clolov · 2023-09-07T12:38:45Z

clients/src/main/java/org/apache/kafka/clients/consumer/ConsumerConfig.java

-    protected static Map<String, Object> appendDeserializerToConfig(Map<String, Object> configs,
-                                                                    Deserializer<?> keyDeserializer,
-                                                                    Deserializer<?> valueDeserializer) {
+    public static Map<String, Object> appendDeserializerToConfig(Map<String, Object> configs,


I assume this visibility change is needed for a subsequent commit?

kirktrue · 2023-09-07T22:08:08Z

Test failures in latest build are unrelated and about half already have Jiras:

integration.kafka.server.FetchFromFollowerIntegrationTest.testRackAwareRangeAssignor(): https://issues.apache.org/jira/browse/KAFKA-15020
kafka.api.DelegationTokenEndToEndAuthorizationWithOwnerTest.testNoGroupAcl(): https://issues.apache.org/jira/browse/KAFKA-15411
kafka.api.TransactionsTest.testBumpTransactionalEpoch(): https://issues.apache.org/jira/browse/KAFKA-15099
kafka.api.TransactionsTest.testCommitTransactionTimeout()
kafka.controller.ControllerIntegrationTest.testTopicIdPersistsThroughControllerRestart()
kafka.network.ConnectionQuotasTest.testListenerConnectionRateLimitWhenActualRateAboveLimit(): https://issues.apache.org/jira/browse/KAFKA-12319
kafka.server.DescribeClusterRequestTest.testDescribeClusterRequestExcludingClusterAuthorizedOperations()
kafka.server.ProduceRequestTest.testSimpleProduceRequest(): https://issues.apache.org/jira/browse/KAFKA-8076
o.a.k.clients.consumer.internals.AbstractCoordinatorTest.testWakeupAfterSyncGroupSentExternalCompletion()
o.a.k.connect.integration.ConnectorRestartApiIntegrationTest.testMultiWorkerRestartOnlyConnector()
o.a.k.controller.QuorumControllerTest.testBalancePartitionLeaders(): https://issues.apache.org/jira/browse/KAFKA-15052

divijvaidya · 2023-09-11T10:08:33Z

Hey @kirktrue
BaseAsyncConsumerTest have started becoming flaky [1] since this commit was introduced on Sept 8th. The flakiness may or may not be related since I haven't investigated in details but can you please check to ensure that this commit didn't cause a flakiness.

Thanks.

[1] https://ge.apache.org/scans/tests?search.relativeStartTime=P28D&search.rootProjectNames=kafka&search.timeZoneId=Europe/Berlin&tests.container=kafka.api.BaseAsyncConsumerTest&tests.test=testCommitSync()

philipnee · 2023-09-11T15:00:44Z

@divijvaidya - thanks for reporting that.

philipnee · 2023-09-11T15:09:58Z

@divijvaidya - I was trying to cherry pick some additional changes into trunk however, i actually don't find it flaky - I think these two should be failing constantly (causing by completing the wrong CompletableFuture). It's strange that this doesn't fail all the time...

philipnee · 2023-09-11T16:44:41Z

Hey @divijvaidya - Looking at the PR: in fact there isn't any implementation level changes, so I'm not sure why these two functions started to become flaky. Does this metric measure the trunk or all draft/pr? I did open a PR that has broken BaseAsyncConsumerTest.

divijvaidya · 2023-09-11T17:00:55Z

Does this metric measure the trunk or all draft/pr?

Thank you for looking into this @philipnee. You can add "trunk" as a tag in the above link to get data on trunk. Trunk seems all green, in which case, seems like it flaky only on PRs. Doesn't seem like this PR is related in that case. The timeline just happens to match up :)

I appreciate your effort towards this investigation. Thank you!

philipnee · 2023-09-11T17:08:31Z

@divijvaidya - thank you for looking into this proactively! 🤝

This change introduces some basic clean up and refactoring for forthcoming commits related to the revised fetch code for the consumer threading refactor project. Reviewers: Christo Lolov <[email protected]>, Jun Rao <[email protected]>

kirktrue commented Aug 29, 2023

View reviewed changes

philipnee added consumer KIP-848 The Next Generation of the Consumer Rebalance Protocol ctr Consumer Threading Refactor (KIP-848) labels Aug 29, 2023

kirktrue commented Aug 29, 2023

View reviewed changes

clients/src/main/java/org/apache/kafka/clients/consumer/internals/FetchUtils.java Outdated Show resolved Hide resolved

kirktrue closed this Aug 29, 2023

kirktrue reopened this Aug 29, 2023

kirktrue closed this Aug 29, 2023

kirktrue reopened this Aug 29, 2023

kirktrue changed the title ~~KAFKA-14274 #1: basic refactoring~~ KAFKA-14274: [1/7] basic refactoring Aug 29, 2023

philipnee reviewed Aug 29, 2023

View reviewed changes

clients/src/main/java/org/apache/kafka/clients/consumer/internals/RequestState.java Outdated

Copy link

Contributor

philipnee Aug 29, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ditto ^^

kirktrue closed this Aug 30, 2023

kirktrue reopened this Aug 30, 2023

junrao reviewed Sep 5, 2023

View reviewed changes

KAFKA-14274 #1: basic refactoring

8b5880b

Introduces some basic clean up and refactoring for forthcoming commits.

junrao approved these changes Sep 7, 2023

View reviewed changes

clolov approved these changes Sep 7, 2023

View reviewed changes

junrao merged commit a2de7d3 into apache:trunk Sep 7, 2023

kirktrue deleted the KAFKA-14274-basic-refactoring-ak branch October 25, 2023 00:17

Conversation

kirktrue commented Aug 29, 2023

Uh oh!

kirktrue Aug 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kirktrue commented Aug 29, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

philipnee commented Aug 29, 2023

Uh oh!

kirktrue commented Aug 31, 2023

Uh oh!

kirktrue commented Sep 1, 2023

Uh oh!

junrao left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

junrao left a comment

Choose a reason for hiding this comment

Uh oh!

clolov left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kirktrue commented Sep 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

divijvaidya commented Sep 11, 2023

Uh oh!

philipnee commented Sep 11, 2023

Uh oh!

kirktrue Aug 29, 2023 •

edited

Loading

kirktrue commented Sep 7, 2023 •

edited

Loading