NIFI-15226 Allow ConsumeKafka to use static partition mapping #10538

tpalfy · 2025-11-17T13:31:02Z

Summary

NIFI-15226

Tracking

Please complete the following tracking steps prior to pull request creation.

Issue Tracking

Apache NiFi Jira issue created

Pull Request Tracking

Pull Request title starts with Apache NiFi Jira issue number, such as NIFI-00000
Pull Request commit message starts with Apache NiFi Jira issue number, as such NIFI-00000

Pull Request Formatting

Pull Request based on current revision of the main branch
Pull Request refers to a feature branch with one commit containing changes

Verification

Please indicate the verification steps performed prior to pull request creation.

Build

Build completed using ./mvnw clean install -P contrib-check
- JDK 21
- JDK 25

Licensing

New dependencies are compatible with the Apache License 2.0 according to the License Policy
New dependencies are documented in applicable LICENSE and NOTICE files

Documentation

Documentation formatting appears as expected in rendered files

exceptionfactory · 2025-11-17T14:07:41Z

nifi-extension-bundles/nifi-kafka-bundle/nifi-kafka-processors/pom.xml

        </dependency>
+        <dependency>
+            <groupId>org.apache.nifi</groupId>
+            <artifactId>nifi-framework-nar-utils</artifactId>


This dependency appears to be unused, can it be removed?

org.apache.nifi.mock.MockComponentLogger is referenced in TestConsumerPartitionsUtil

Thanks for clarifying. In that case, this dependency should be removed and MockComponentLogger should be replaced with MockComponentLog from nifi-mock to avoid referencing framework modules in extension modules.

I preserved this class from the older change. But actually we don't need either. We can just mock the logger with Mockito.

exceptionfactory

The initial build fails on the following tests:

Error:    ConsumeKafkaTest.testVerifyFailed:116 » NullPointer Cannot invoke "org.apache.nifi.kafka.service.api.consumer.PollingContext.getTopics()" because "this.pollingContext" is null
Error:    ConsumeKafkaTest.testVerifySuccessful:99 » NullPointer Cannot invoke "org.apache.nifi.kafka.service.api.consumer.PollingContext.getTopics()" because "this.pollingContext" is null

...undle/nifi-kafka-processors/src/main/java/org/apache/nifi/kafka/processors/ConsumeKafka.java

...afka-service-shared/src/main/java/org/apache/nifi/kafka/service/Kafka3ConnectionService.java

exceptionfactory

Thanks for the work on this @tpalfy. I noted a handful of minor recommendations, and plan to take a closer look at some of the implementation details.

exceptionfactory · 2025-11-25T04:56:35Z

...-kafka-service-shared/src/main/java/org/apache/nifi/kafka/service/consumer/Subscription.java

    private final AutoOffsetReset autoOffsetReset;

-    public Subscription(final String groupId, final Collection<String> topics, final AutoOffsetReset autoOffsetReset) {
+    public Subscription(final String groupId, final Integer partition, final Collection<String> topics, final AutoOffsetReset autoOffsetReset) {


Very minor, but I recommend placing partition after topics to align with general hierarchy:

Suggested change

public Subscription(final String groupId, final Integer partition, final Collection<String> topics, final AutoOffsetReset autoOffsetReset) {

public Subscription(final String groupId, final Collection<String> topics, final Integer partition, final AutoOffsetReset autoOffsetReset) {

exceptionfactory · 2025-11-25T05:02:11Z

...sors/src/test/java/org/apache/nifi/kafka/processors/consumer/TestConsumerPartitionsUtil.java

+import static org.junit.jupiter.api.Assertions.assertTrue;
+import static org.mockito.Mockito.mock;
+
+public class TestConsumerPartitionsUtil {


Minor note, the public modifiers on the class and method level are not necessary for JUnit 5. Since this is a new test class, recommend removing them.

exceptionfactory · 2025-11-25T05:02:48Z

...sors/src/test/java/org/apache/nifi/kafka/processors/consumer/TestConsumerPartitionsUtil.java

+    }
+
+    @Test
+    public void testNoPartitionAssignments() throws UnknownHostException {


UnknownHostException does not appear to be thrown in this and other methods.

ConsumerPartitionsUtil.getPartitionsForHost throws it.

...sors/src/test/java/org/apache/nifi/kafka/processors/consumer/TestConsumerPartitionsUtil.java

...undle/nifi-kafka-processors/src/main/java/org/apache/nifi/kafka/processors/ConsumeKafka.java

exceptionfactory · 2025-11-25T05:13:07Z

...ocessors/src/main/java/org/apache/nifi/kafka/processors/consumer/ConsumerPartitionsUtil.java

+        return !hostnameToPartitionMapping.isEmpty();
+    }
+
+    public static int getPartitionAssignmentCount(final Map<String, String> properties) {


Recommend moving all public methods before all private methods in this class.

...ocessors/src/main/java/org/apache/nifi/kafka/processors/consumer/ConsumerPartitionsUtil.java

turcsanyip

@tpalfy Tested with different scenarios, including fewer or more concurrent tasks than partitions, and it works as expected.

Added one minor comment inline and one more thing here: There is some legacy code referencing partition properties in DynamicPropertyValidator. Please clean this up, as the partition dynamic properties are not applied to the Kafka controller service.

...undle/nifi-kafka-processors/src/main/java/org/apache/nifi/kafka/processors/ConsumeKafka.java

…nstead of just having it commented out)

turcsanyip

Thanks for the latest changes @tpalfy.

+1 from my side.

exceptionfactory · 2025-12-11T21:39:47Z

Thanks for the updates @tpalfy and thanks for the review @turcsanyip, I will follow up and also review the latest changes soon.

exceptionfactory

Thanks for making the adjustments @tpalfy, and thanks for the review @turcsanyip.

On a more detailed review, I am concerned about the level of complexity introduced with these changes. The ConsumerPartitionsUtil is very complex with implementation details based on current host and dynamic properties. The addition of partition-based dynamic property names, plus host names, makes the configuration non-portable between NiFi installations. This complexity is also evident in the handling of KafkaConsumerService instances.

For these reasons, I am not supportive of the current proposed set of changes.

The historical implementation grew very complex and very difficult to maintain, so considering other options would be helpful.

One option that comes to mind is a separate Processor, named something like ConsumeKafkaPartition. That would clearly communicate the purpose, allowing for more focused logic. I would not attempt to extend the current ConsumeKafka Processor, so copying some the existing implementation would be reasonable. As much of the handling logic is in other classes, there should be some code reuse opportunities, without a subclass.

I'm open to other options that would avoid introducing this level of complexity to the current Processor.

turcsanyip · 2026-01-06T13:27:12Z

Thanks for sharing your thoughts and concerns @exceptionfactory.

I agree that ConsumerPartitionsUtil is complex. However, it is worth noting that it was directly reused from the NiFi 1.x implementation where it was a single commit that worked properly. So I would not consider it legacy code that was getting more and more complicated over time. I also reviewed it in this round and I did not find how it could be simplified / changed. This brings us to your second concern about portability. The partitions need to be bound to a NiFi node and hostnames seem the only way to do it. So even if a separate processor were added, it would still have the portability issue.

Considering that the change does not affect the existing users of the processor (no config change, portability issue does not arise if the feature is not used), I do not think it would be worth adding a new processor that requires duplicated code leading to additional maintenance effort and issues going forward.

As far as I can recall, the original plan of migrating to Kafka 3 processors was to implement the core functionality in the first round and add the extra features later. Based on this, and considering that a separate implementation would also introduce complexity and non-portability issues via ConsumerPartitionsUtil, I believe we could add this feature to the current ConsumeKafka processor.

exceptionfactory · 2026-01-06T18:06:21Z

Thanks for the thoughtful reply @turcsanyip.

Although this capability and general approach was implemented in NiFi 1, part of the purpose of refactoring Kafka support was to provide better decoupling of features. The majority of that separation comes with the Kafka Connection Service, and it is good to see that these proposed changes have minimal impact on that Controller Service interface surface.

With that being said, the proposed changes do introduce significant logic into ConsumeKafka that is specific to this particular capability. Although it does not impact existing configuration, it does increase the maintenance of the ConsumeKafka implementation in general. For that reason, I would much rather see this particular use case decoupled into its own Processor.

A separate Processor also has benefits from a flow design perspective, as it provides a clearer distinction on the intended use case of static partition assignment to individual NiFi nodes. Given that such a use case is less common, keeping it distinct from the regular ConsumeKafka Processor is helpful from both a flow design and component implementation perspective. From an implement perspective, it also provides the opportunity to define more of the implementation in the Processor itself, versus in a separate Util class. Of course these are tradeoffs, but I think the benefits outweigh the costs.

exceptionfactory reviewed Nov 17, 2025

View reviewed changes

exceptionfactory requested changes Nov 17, 2025

View reviewed changes

turcsanyip requested changes Nov 17, 2025

View reviewed changes

...undle/nifi-kafka-processors/src/main/java/org/apache/nifi/kafka/processors/ConsumeKafka.java Outdated Show resolved Hide resolved

...undle/nifi-kafka-processors/src/main/java/org/apache/nifi/kafka/processors/ConsumeKafka.java Outdated Show resolved Hide resolved

turcsanyip requested changes Nov 20, 2025

View reviewed changes

exceptionfactory requested changes Nov 25, 2025

View reviewed changes

turcsanyip mentioned this pull request Nov 28, 2025

NIFI-14782 Extended Kafka3ConnectionService OAuth authentication with… #10567

Closed

11 tasks

turcsanyip requested changes Dec 11, 2025

View reviewed changes

...undle/nifi-kafka-processors/src/main/java/org/apache/nifi/kafka/processors/ConsumeKafka.java Show resolved Hide resolved

tpalfy added 12 commits December 11, 2025 20:11

NIFI-15226 Allow ConsumeKafka to use static partition mapping

8498228

NIFI-15226 Review suggestions

e3d32c0

NIFI-15226 Actually remove the nifi-framework-nar-utils dependency (i…

797224e

…nstead of just having it commented out)

NIFI-15226 Fix concurrency issue

65cf6fb

NIFI-15226 Review suggestions

54bd490

NIFI-15226 Review suggestions

d1dfe19

NIFI-15226 Ensure rotation of all partitions in all scenarios

fa4b6f3

NIFI-15226 Review suggestions

0933f20

NIFI-15226 Review suggestions

ff06a16

NIFI-15226 Remove unused import

2daa9af

NIFI-15226 Fix checkstyle violation

916bd89

NIFI-15226 Review suggestions

d1da7f6

tpalfy force-pushed the NIFI-15226-Kafka-static-partition-mapping branch from 6269ef2 to d1da7f6 Compare December 11, 2025 19:15

turcsanyip approved these changes Dec 11, 2025

View reviewed changes

exceptionfactory requested changes Dec 20, 2025

View reviewed changes

	public Subscription(final String groupId, final Integer partition, final Collection<String> topics, final AutoOffsetReset autoOffsetReset) {
	public Subscription(final String groupId, final Collection<String> topics, final Integer partition, final AutoOffsetReset autoOffsetReset) {

NIFI-15226 Allow ConsumeKafka to use static partition mapping #10538

Are you sure you want to change the base?

NIFI-15226 Allow ConsumeKafka to use static partition mapping #10538

Conversation

tpalfy commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Tracking

Issue Tracking

Pull Request Tracking

Pull Request Formatting

Verification

Build

Licensing

Documentation

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

exceptionfactory left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

exceptionfactory left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

turcsanyip left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

turcsanyip left a comment

Choose a reason for hiding this comment

Uh oh!

exceptionfactory commented Dec 11, 2025

Uh oh!

exceptionfactory left a comment

Choose a reason for hiding this comment

Uh oh!

turcsanyip commented Jan 6, 2026

Uh oh!

exceptionfactory commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tpalfy commented Nov 17, 2025 •

edited

Loading

turcsanyip left a comment •

edited

Loading