Road to Freedom in Big Data Analytics

  Provides an abstraction on top of Big Data platforms.

  Run data analytics over multiple data processing platforms.

  Users can 100% focus on the logics of their applications.


Look what happens when executing a job

How Rheem works?

Write once, run on any platform

Express the logic of your jobs without worrying on which platform they will be executed.
This separation unleash amazing capabilities
  • As application uses Rheem Operators, an abstract UDF that acts as an application-specific unit of data processing.

  • The Core expose a pool of Physical Operators, a platform independent implementation of a Rheem Operator"

  • Finally the Execution Operator defines how a task is executed on the underlying processing platform.


Enjoy the advantages of Rheem


Run a single data analytic task on top of any set of data processing platforms.


It selects the best available data processing platform for any incoming query.


User defined functions (UDFs) as first-class citizens, enabling extensibility and adaptability.


A simple interface that allows developers to focus only on the logics of their application.

Cost Saving

Fast development of data analytic applications.

Open Source

All code is on GitHub under Apache License.