Project Metamorphosis: Unveiling the next-gen event streaming platformLearn More

Real-Time Document Rankings with Kafka Streams

 

Kafka Summit SF 2017 | Stream Processing Track

The HITS algorithm creates a score for documents; one is “hubbiness”, the other is “authority”. Usually this is done as a batch operation, working on all the data at once. However, with careful consideration, this can be implemented in a streaming architecture using KStreams and KTables, allowing efficient real time sampling of rankings at a frequency appropriate to the specific use case.

hunter-kelly Hunter Kelly, Senior Software/Data Engineer, Zalando