Confluent Cloud Q1 Launch: Build a Secure Shared Services Data Streaming Platform | Learn more

Real-Time Domain Rankings with Kafka Streams

Kafka Summit SF 2017 | Stream Processing Track

The HITS algorithm creates a score for documents; one is “hubbiness”, the other is “authority”. Usually this is done as a batch operation, working on all the data at once. However, with careful consideration, this can be implemented in a streaming architecture using KStreams and KTables, allowing efficient real time sampling of rankings at a frequency appropriate to the specific use case.


Hunter Kelly