Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
9 changes: 3 additions & 6 deletions justification/max.txt
Original file line number Diff line number Diff line change
@@ -1,6 +1,3 @@
Organisations use distributed stream processing systems to run never-ending services to drive their business-critical
analytics. Therefore, it is crucial that the systems are fault-tolerant. The three papers that have been selected all take a
different approach to achieve fault-tolerance and they are as following: (1) lineage-based recovery; (2) global consistent
checkpointing; (3) distributed state store. The aim is to gain a deeper understanding of the tradeoffs between the different
techniques. But, also to get an insight on which may be most applicable for future systems (Arcon) when taking the growing
network speeds into consideration.
The dataflow processing model is at the core of modern distributed data processing frameworks. In this talk, we will look into
3 systems (MillWheel, Ray, and Naiad) that utilise the aforementioned model. The aim is to gain a deeper understanding of the
design choices (e.g., how the dataflow model is used, how the system approaches state) and their implications for different applications.
Binary file not shown.
Binary file not shown.
Binary file added papers/Naiad_A_Timely_Dataflow_System.pdf
Binary file not shown.
Binary file not shown.