About 74,000 results
Open links in new tab
  1. Application checkpointing - Wikipedia

    Checkpointing is a technique that provides fault tolerance for computing systems. It involves saving a snapshot of an application 's state, so that it can restart from that point in case of failure.

  2. both bisection and binomial checkpointing requires knowing the number of steps taken in the primal code these are known as “offline” checkpointing algorithms

  3. What Does Checkpointing Mean | Dagster

    Checkpointing, in the context of data processing and pipeline execution, refers to the practice of saving the state of a process at certain points so that it can be restarted from that point in case …

  4. Checkpointing Jobs - NURC RTD - Northeastern University

    Checkpointing is a fault tolerance technique based on the Backward Error Recovery (BER) technique, designed to overcome “fail-stop” failures (interruptions during the execution of a job).

  5. Checkpointing | Springer Nature Link (formerly SpringerLink)

    The process of writing the computation’s state is referred to as Checkpointing, the data written as the Checkpoint, and the continuation of the application as Restart or Recovery. The execution …

  6. Checkpointing Jobs - CHTC

    Checkpointing is a technique that provides fault tolerance for a user’s analysis. It consists of saving snapshots of a job’s progress so the job can be restarted without losing its progress …

  7. A coordinated checkpointing and recovery technique that takes a consistent set of checkpointing and avoids domino effect and livelock problems during the recovery

  8. Checkpointing Overview | RCD Documentation

    Jul 17, 2025 · Checkpointing is when you job regularly saves it's state so that if it is prematurely ended, it can be restarted and resume from the checkpoint rather than the beginning. This is a …

  9. A survey on checkpointing strategies: Should we always …

    Dec 1, 2024 · Without checkpointing, all the work executed for the application is lost. With checkpointing, the execution can resume from the last checkpoint, after some downtime (enroll …

  10. Checkpointing and Rollback Error Recovery - IEEE Xplore

    Checkpointing is an important technique for recovery by means of rollback and re‐execution of a process following error detection. There are two types of error recovery: forward error recovery …