_This developer preview was actually cut on July 21, 2014. Fixing the oversight of not creating the tag before to prepare for the upcoming API-changing release that supports pipelined communication and more features._
Varys is an open source network manager/scheduler that aims to improve communication performance of Big Data applications. Its target applications/jobs include those written in Spark, Hadoop, YARN, BSP, and similar data-parallel frameworks.
Varys provides a simple API that allows data-parallel frameworks to express their communication requirements as coflows with minimal changes to the framework. Using coflows as the basic abstraction of network scheduling, Varys implements novel schedulers either to make applications faster or to make time-restricted applications complete within deadlines.
Features included in this initial release include:
- Support for in-memory and on-disk coflows,
- Efficient scheduling to minimize the average coflow completion times, and
- In the deadline-sensitive mode, support for soft deadlines.
Here are some links, if you want to check it out, contribute to make it better, or just want to point someone else who can help us.
Project Website: http://varys.net
Git repository: https://github.com/coflow/varys
Relevant tools: https://github.com/coflow
Research papers with more details on coflows and coflow scheduling:
Support: