SIDB Simulator

Scope

Intended Audience
All X
CommVault Internal  
CE/System Test  
Development  

Release

8.0.0

Description

This tool is used to estimate the performance of the disk where you plan to create the deduplication store. You can use this tool to determine whether the disk, where you are planning to create a deduplication store, is performing optimally. This tool simulates the deduplication operation by querying and inserting  a large number of records into the Deduplication Database. During simulation, when the program detects slowness in the system, informational events are printed out .

Use this tool to get the following estimates about the performance of the disk:

  • The Query & Insert average time for a given application data size.
  • The maximum size of application data that can be backed up using the deduplication database for a given time limit.

Usage

Running the Utility

To find out the maximum size of the application data, perform the following:

  1. From the Start Menu, right-click the Command Prompt and then click Run as administrator.

    User Account Control dialog box will appear, click Yes or Continue depending upon the operating system version.

  2. From the Command Prompt, navigate to the following location:

    <software_installation_path>/Base

  3. Execute the following command:

    sidb_sim.jar

The SIDB Simulator dialog box will be displayed with the following options:

  • Determine max data handled by the dedup database for given threshold query response time
  • Determine the average query response time for given application data size

 

Determining the maximum application data size 

You can find out the maximum size of the application data that can be backed up using the deduplication database for a given access time limit for each record.

To find out the maximum size of the application data:

  1. On the SIDB Simulator dialog box, select Determine max data handled by the dedup database for given threshold query response time.
  2. In the Directory to use for Simulation, specify the directory and click Continue.
  3. Specify the following parameters:
    • Time Limit  - The access time limit for each record in microsecond.
    • Dedup Ratio - The expected deduplication ratio
    • Block Size - The deduplication data block size in KB
    • Iterations for Sampling Query Time - Number of iterations performed to determine the query and insert time spent at a particular application datasize. The default value is 10000 iterations.
    • Continue Simulation upto - The percentage of the threshold time limit upto which the Simulation can continue after the threshold has reached. For example, when you specify 10000 microseconds as the threshold time limit and set the Continue Simulation upto  value to 50%, the simulation will continue upto 15000 microseconds. 
  4. If you want to estimate the disk performance statistics, select the Measure Disk Performance check box.
  5. If you want to clean the simulation data after the simulation is completed, select the clean db files after simulation check box.
  6. Click Next.
  7. The SIDB Simulator dialog box displays the parameters and its values. The tool will use these parameters to run the simulation. Click Start to initiate the simulation process.
  8. The Simulation Report tab of the SIDB Simulator dialog box displays the progress of the simulation process. Also a graph of Query Time against Data size is displayed. Click Maximize to see the details in the graph.

    Note

    Each data point in the graph is calculated from the average of the last x iterations; where x is the number of Iterations for Sampling Query.

  1. The Results tab of the SIDB Simulator dialog box displays the results of the simulation. The results display the maximum size of the application data that can be backed up using the deduplication database for the given access time limit.

    You can also view the results of the simulation in the cmd_out.log file. This file will be available in the SIDB folder on your disk.

 

Determining the average query response time

You can find out the the average query response time for each record for a given data size.

To find out the average query response time:

  1. On the SIDB Simulator dialog box, select Determine the average query response time for given application data size.
  2. In the Directory to use for Simulation, specify the directory and click Continue.
  3. Specify the following parameters:
    • Data Size - the application data size in GB
    • Dedup Ratio - The expected deduplicaiton ratio
    • Block Size - The deduplication data block size in KB
    • Iterations for Sampling Query Time - Number of iterations performed to determine the query and insert time spent at a particular application datasize. The default value is 10000 iterations. 
    • Continue Simulation upto - The percentage of the threshold time limit upto which the Simulation can continue after the threshold has reached. For example, when you specify 10 GB as the threshold data size and set the Continue Simulation upto  value to 50%, the simulation will continue upto 15GB.
  4. If you want to estimate the disk performance statistics, select the Measure Disk Performance check box.
  5. If you want to clean the simulation data after the simulation is completed, select the clean db files after simulation check box.
  6. Click Next.
  7. The SIDB Simulator dialog box displays the parameters and its values. The tool will use these parameters to run the simulation. Click Start to initiate the simulation process.
  8. The Simulation Report tab of the SIDB Simulator dialog box displays the progress of the simulation process. Also a graph of Query Time against Data size is displayed. Click Maximize to see the details in the graph.

    Note

    Each data point in the graph is calculated from the average of the last n iterations; where n is the number of Iterations for Sampling Query.

  9. The Results tab of the SIDB Simulator dialog box displays the results of the simulation. The results contain the estimated average transaction time for an insert/query in the deduplication database based on the size of the application data that is backed up.

    You can also view the results of the simulation in the cmd_out.log file. This file will be available in the SIDB folder on your disk.

History

Revision Date Description
Revision 1 March 2010  

Disclaimer of Warranties

CommVault provides software tools and utilities on an "as is" basis, as a courtesy and convenience to its users, and they may be changed from time to time without notice. These tools and utilities are provided without express, implied, or statutory warranties of any kind. You assume full risk and responsibility for the selection, possession, performance and proper installation and use of the software tools and utilities, and for verifying the results obtained there from. Your use of CommVault software tools and utilities indicates your unconditional acceptance of the above disclaimer.