Distributed Computing Systems Group

Nebula Edge Cloud


Centralized cloud systems are the de-facto platform for large-scale data analysis in many domains. However, this model is often unfit for many kinds of data analysis whose data itself is produced in a distributed fashion around the globe. For geo-distributed data analysis, the centralized nature of traditional clouds requires data to be brought into a central location for processing over the wide area network (WAN) which is highly heterogeneous, slow, and often costly.

Nebula edge cloud project explores the use of distributed edge resources to mitigate the overhead faced by centralized clouds for geo-distributed data-intensive applications. Nebula utilizes distributed storage and compute (volunteer) resources that are connected over a wide-area network. Nebula provides lightweight architecture and implements a number of optimizations to enable efficient exploitation of edge resources for in-situ data-intensive computing, including location-aware data and computation placement, replication, and recovery.

Nebula has been designed with the following goals in mind:

Geo-distributed Edge Cloud

Nebula components

Nebula Architecture

Demo Video





We would like to acknowledge NSF Grant: NSFCSR 1162405, which supported this research.
National Science Foundation Logo
National Science Foundation