MSc Thesis work focusing on enhancing the clustering phase of graph partitioning to achieve a lower replication factor compared to the original approach: https://github.com/mayerrn/two_phase_streaming, while being independent of the stream order and only require a single stream edge pass.