Kafka In the Cloud: Why It’s 10x Better With Confluent | Get free eBook

Real-Time Domain Rankings with Kafka Streams

Kafka Summit SF 2017 | Stream Processing Track

The HITS algorithm creates a score for documents; one is “hubbiness”, the other is “authority”. Usually this is done as a batch operation, working on all the data at once. However, with careful consideration, this can be implemented in a streaming architecture using KStreams and KTables, allowing efficient real time sampling of rankings at a frequency appropriate to the specific use case.


Hunter Kelly