Monitoring Data Replication

Topics | How To | Related Topics


Overview

Job Phases

Job States

Job Details


Overview

All details of on-going replication activity are shown in the Data Replication Monitor in the CommCell Console, since this activity is continuous, and not job-based. All other activity, such as Recovery Point creation, is reflected in the Job Controller.

In the Data Replication Monitor, right-click any Replication Pair to view additional details, view any failed or orphan files, or select/create a filter to specify which clients' activity is displayed in the Data Replication Monitor.


Job Phases

CDR utilizes phases to perform three types of operations -- initial data transfer or baselining, smart synchronization, and continuous data replication. The sequence of these phases is listed below along with details of CDR activities during each phase, and the consequence of an interruption, such as a temporary loss of connectivity:

JOB PHASE ASSOCIATED ACTIVITY COMMENTS
Baseline Scan
  • For Windows only, start NTFS journaling on the source to track any file operations that occur during the entire Baseline phase.
  • Scan source path to obtain the number of files and bytes to transfer.
  • Generate Collect File.
The Replication Pair will show a Job State of "Preparing for Replication" in the Data Replication Monitor.

If this phase is interrupted:

  • for Windows, it can resume again at the same point
  • for UNIX, it will start over

A Full Re-Sync will start at this phase.

Baseline
  • Data is transferred from the Replication Pair source path to the destination path.
If this phase is interrupted, it can resume again at the same point.
SmartSync Scan
  • Create a non-persistent snapshot; for Windows, compare it to the change journal.
  • Scan snapshot and generate a new Collect File for any files or directories that were added or data that was modified since the beginning of the Baseline Scan phase.
For Windows:
  • During the SmartSync Scan phase, CDR requires a short period of no disk activity on the source to begin monitoring the source path, and if there is significant I/O on the source, this can fail, although CDR will continue making successive attempts to find a short period of inactivity. When you have multiple, active Replication Pairs on the same source computer, CDR requires this short period of no disk activity for all of them, even for Replication Pairs that use different drives as their source. For example, if you have a Replication Pair on the Client which is already replicating, using F:\ as its source, and you create a new Replication Pair on the same Client using G:\ as its source, too much I/O on F:\ can cause CDR to fail to begin monitoring G:\.

For UNIX:

  • To create the non-persistent snapshot on the source, CDR requires a short period of time, during which no files are deleted, created, or renamed. (File writes will not affect the snapshot.) If one of these three operations occurs while the snapshot is being taken, the process of creating the snapshot will begin again.

Note that while new files and directories will be copied in their entirety, modified files do not need to be; thus, for larger files, only the modified portion is re-copied, while smaller files with substantial change may be copied in their entirety. Modified files below a certain size threshold are copied again as complete files, while files above that size are broken into blocks with just the changed blocks copied to the destination computer.

  • Files smaller than 256KB will be copied in their entirety whether they match the destination or not. For files above 256KB in size, only the changed blocks will be transferred; the default block size for hashing is 64KB. The default values of the minimum file size and the block size for hashing, can be configured using the Advanced tab of the Replication Set Properties dialog box.

If this phase is interrupted:

  • for Windows, it can resume again at the same point
  • for UNIX, it will start over

A Smart Re-Sync will start at this phase.

Processing Orphan Files
  • Compare the Collect File to the Destination to identify orphan files, and apply orphan file settings.
Any data that was deleted on the replication source during the Baselining phases are treated according to your settings for Orphan Files.

If this phase is interrupted:

  • for Windows, it will resume again from the beginning of this phase; however, if the snapshot is no longer available, it will return to the SmartSync Scan phase.
  • for UNIX, it will return to the SmartSync Scan phase.
Checksum Calculation

(CDR on Windows only)

  • Calculate checksums on the source and destination to identify files that have changed since Baseline Scan.
If this phase is interrupted, it will resume again from the beginning of this phase; however, if the snapshot is no longer available, it will return to the SmartSync Scan phase.
SmartSync
  • Transfer all changed files to destination from the new Collect File.
If this phase is interrupted:
  • for Windows, it will resume again from the beginning of this phase; however, if the snapshot is no longer available, it will return to the SmartSync Scan phase.
  • for UNIX, it will return to the SmartSync Scan phase.
Updating Smart Sync

(CDR on Windows only)

  • Compare time stamps on source and destination and update.
  • Temporary snapshot is deleted.
If this phase is interrupted, it will resume again from the beginning of this phase.
Replication
  • Data is continuously replicated from the source to destination.
Log Transfer & Log Replay activity is on-going. For more information, refer to Replication Logs.

The Replication Pair will show a Job State of "Replicating" in the Data Replication Monitor.

If the Replication phase is interrupted, when restarted, if it is possible, replication will begin again from the last log replayed on the destination; if this is not possible, the Replication Pair will return to the Baseline Scan phase (Full Re-Sync) or to the SmartSync Scan phase (a Smart Re-Sync) depending on the nature and duration of the interruption. Note that if a user manually restarts Replication by choosing "Start Full Resync", the Replication Pair will return to the Baseline Scan phase.


Job States

The Data Replication Monitor shows the state of each Replication Pair. These states are briefly described:

To see more information about a particular Replication Pair, see View details of data replication activities.

You can change the state of a Replication Pair, or several at the same time. See Start/Suspend/Resume/Abort Data Replication Activity.

Considerations


Job Details

The following information is available in the Data Replication Monitor:

General

Initial Sync Information

Replicating State Information

Configuration


Attempts

The following information is available in the Attempts window:

Back to Top