This document describes Flink’s fault tolerance mechanism for streaming data flows. Introduction Checkpointing Barriers State Exactly Once vs. At Least Once Asynchronous State Snapshots Recovery Operator Snapshot Implementation Introduction Apache Flink offers a fault tolerance mechanism to consistently recover the state of data streaming applications. The mechanism ensures that even in the presen