Configuration - Content Indexing and Search

Topics | How To | Related Topics


Content Indexing Engine

Offline Content Indexing

Online Content Indexing

Security

Search Console

User Administration

Agent-Specific Configuration

Upgrade Considerations


Content Indexing Engine

Once you have installed the Content Indexing Engine, you can set the following options in the CI Engine Properties dialog box from the CommCell Console to improve the performance of the content indexing operation: (In the case of multi-node installation, you can set these options for the Content Indexing Engine in the Admin node.)

Maximum Number of Batch Slots - You can set this option to determine the maximum number of batch slots to be sent at a time to the Content Indexing Server for content indexing. By default, the value is set to 40. It is recommended to set this value to 80.

Maximum Number of Documents Per Batch - You can set this option to determine the maximum number of documents to be included in a batch for content indexing. By default, the value is set to 100. It is always recommended to include 20 documents in a batch.

In addition to the above configurations, you can use the following registry keys to set the maximum time taken for the Content Indexing Engine to process a batch.

For step-by-step instructions on setting these options, see Configure Content Indexing Engine Options.

DocProcessingMaxTime - Use this registry key to set the timeout value for the "document processed" (first acknowledgement) notification from the Content Indexing Engine.

DocReceivedByIndexingMaxTime - Use this registry key to set the timeout value for the "document received by the indexer" (second acknowledgement) notification from the Content Indexing Engine.

DocPersistedByIndexingMaxTime - Use this registry key to set the timeout value for the "document persisted by the indexer" (third acknowledgement) notification from the Content Indexing Engine.

Configuring the Staging Location

When content indexing large files, you will need to place the files to be content indexed in a temporary staging location prior to content indexing.

The staging path can either be on the local drive or on a network share. When specifying the staging location, ensure that the size (in KB) of the staging directory is twice greater than the number of batches * number of files per batch * the average file size in KB.

Configuring a Local Drive as the Staging Location

When you specify a local staging path, make sure that the staging path is not on the same drive as that of the content index.

  1. From the CommCell Browser, navigate to Storage Resources | Content Indexing Engines.
  2. Right-click the <Content Indexing Engine> and select Properties.
  3. Click Index Large Files (files greater than 50 MB).
  4. In the Staging Path box, type the path where the files will be staged. Alternatively, click the Browse button to select the staging path.
  5. Click Browse and select the location where the files will be staged.
  6. Click OK.
  7. Restart the Content Indexing Services.

Configuring a Network Share as the Staging Location

On a multi-node Content Indexing Engine, you can use a network share to stage the files to be content indexed. When using a network share, you will also need to set the user authentication to access the network share. Note the following when specifying the network share:

  1. From the CommCell Browser, navigate to Storage Resources | Content Indexing Engines.
  2. Right-click the <Content Indexing Engine> and select Properties.
  3. Click Index Large Files (files greater than 50 MB).
  4. Click Use Network Share.
  5. In the Staging Path box, type the network path where the files will be staged. Alternatively, click the Browse button to select the staging path.
  6. Click Change.
  7. Type the user account credentials to access the network share and then click OK.
  8. Click OK.
  9. Restart the Content Indexing Services.
  10. If the indexing nodes are on Windows 2008 R2, disable the Server Message Block (SMB2). Refer the Microsoft KB Article 950836 for more details.

Offline Content Indexing

After installing the Content Indexing Engine, you can configure the Offline Content Index before running or scheduling content indexing operations. This includes the following tasks: (If you have not already installed the software, see Deployment - Content Indexing and Search for more information on how to install the Content Indexing Engine.)

Content Indexing RMS Protected Files

Rights Management Service (RMS) is a technology that works with RMS enabled applications (such as, Microsoft Office applications, Microsoft Exchange Server, and Microsoft SharePoint) to set usage rights on the documents or emails. This is basically used by content authors to set permissions on their documents/emails so as to limit access to other users. For more information on Rights Management Service, refer Microsoft documentation.

Offline content indexing is supported for RMS protected documents/emails of the following applications:

Use the following steps to enable content indexing for RMS protected files:

  1. Install the RMS client and RMS SDK (Software Development Kit) on the MediaAgent used for offline content indexing. On Windows 2008 platforms, the RMS client is installed by default.
  2. Add the Certificate URL of the RMS web site to the trusted sites in your web browser.
  1. From the Browser window, click Tools | Internet Options.
  2. Click the Security tab.
  3. Select Trusted Sites and click Custom Level.
  4. Select Automatic logon with current user name and password.
  5. Click OK.
  1. From the CommCell Browser, right-click the <Client> for which you wish to enable content indexing and select Properties.
  2. Click the Content Indexing tab.
  3. Select the Decrypt RMS document during content indexing checkbox.
  4. In the SMTP Address of the RMS super user, type the SMTP address of the super user. Example: user1@xyz.com
  5. In the User Name box, type the user account of the super user. For example: user1
  6. In the Password box, type the password for the user account.
  7. In the Confirm Password box, re-type the password.
  8. Click OK.

Other Considerations

Once you have configured offline content indexing, you can start to run or schedule the Content Indexing Operations. See Operations - Content Indexing and Search for more information.


Other Considerations

General


Online Content Indexing

Once you have installed an Online Content Indexing agent, you need to configure it before running or scheduling any online content indexing operations. (If you have not already installed the agent, see Deployment - Content Indexing and Search for information on installing the Online Content Indexing agents.)

When installed, the software by default creates a content index set with a default subclient. However, you can also create user-defined subclients based on your content indexing needs. Prior to performing content indexing operations, configure the following:

If necessary, you can also configure the following:

The other configurable properties available for the Online Content Indexing Agents are User Administration and Security and Activity Control.

Other Considerations

General

Make sure that the Windows Operating system is configured for best performance and priority is given for background processes.

Online Content Indexing for File System Agent

Once the Online Content Indexing subclient is configured you can start to run or schedule the Content Indexing Operations on the subclient. See Operations - Content Indexing and Search for more information.


Security

Security must be configured in the CommCell to grant permissions for users and user groups to search data before searches can be performed. This includes the following tasks for each search tool:

Search Console

Perform the following security configuration tasks for the Search Console as appropriate for your implementation:

CommCell Console

Perform the following security configuration tasks for the CommCell Console as appropriate for your implementation:

Outlook Add-In

Perform the following security configuration tasks for the Outlook Add-In as appropriate for your implementation:

Single Sign On

The Single Sign On (SSO) feature enables users to login to the CommServe using their user-account credentials from the Active Directory service provider, inheriting capabilities on the CommServe based on their Active Directory group membership mapping on the CommServe user groups, which must include the Browse capabilities. Single sign on works only for Intranet based sites and will be working for the domain where the WebClient is installed. To support SSO on other domains you will need to install a web client package on a machine which belongs to the domain.

Prior to enabling Single Sign On on a Name Server, note the following:

For step-by-step instructions on enabling Single Sign On, See Enable/disable Single Sign On.

You can configure the Single sign On from the Command line interface.

Follow the procedure given below to configure Single sign On from command prompt.

  1. Open the command prompt.
  2. Run the following command located at the base folder to add a Service Principal Name.

    cvspn.bat -A domainName\userName

    Here the userName must match with the Name Server Registration done in the CommCell GUI.

Once SSO is Enabled on the NameServer in the Commcell Console, you need to restart the Webserver(IIS) since domain Information is cached in the server.

 
  • If the user profile used for the Name server registration is modified in the GUI, you have to delete the previous registration associated with the username and password that was used for SSO.

Run the following command from the command prompt to delete the Service Principal Name.

cvspn.bat -D domainName\userName

Here the userName must match with the NameServer Registration done in the CommCell GUI.

  • Make sure that you perform the registration procedure to add new Service Principal Name  after deleting the old Service Principal Name.

Run the following command to add a new Service Principal Name.

cvspn.bat -A domainName\userName

  • You cannot add a Service Principal Name twice with two different users, for example:

    prompt>setspn.exe -A domainName\userName1

    prompt>setspn.exe -A domainName\userName2


Search Console

Once you have installed the Web Search Server and the Web Search Client, you need to perform certain configuration tasks before you begin searching for data from the Search Console. This includes the following tasks: (If you have not already installed the software, see Deployment - Content Indexing and Search for more information on how to install the Web Search Server and Web Search Client.)

Once the configuration tasks have been completed, and content indexing operations have been performed, you can begin conducting searches on the data. For more information, see Data Discovery and Search.

You can control the disk space utilization and search result display for each user from the User Administration page of the Search Console. To do this, you need to be a user with CommCell wide administrative rights. For more information, see User Administration.

To change the location of the URLs for accessing the Search Console or User Administration page, see Configure the Search Server URLs.

You can also view the name and URL of the Web Search Server associated with the Web Search Client. Note that, the Web Search Server association cannot be changed without re-installing the Web Search Client. Also note that, the Web Search Server URL specified in the Client Properties (Search Server URLs) tab is not directly accessible by the user from any Web browsers.

To change the language preferences for the Search Console, see Select Language Preferences for Search Console

To view the supported languages, see Languages - Support.

Enabling Delegated Search

Delegated search allows end-users to search for Exchange emails on delegated mailboxes configured through Outlook. When you enable Delegated Search, the CommServe collects the Outlook-configured delegate information of mailboxes from all the associated Exchange Servers, every 24 hours. This information is later used to search delegated mailboxes from the Web Console. For information on delegating mailbox folders to other users from Outlook, refer Microsoft documentation.

When you perform a delegated search on Exchange mailbox backups/archives from a previous version, the search results may not be accurate since the emails would have not been optimized prior to the backup/archive operation.

Make sure one of the following agents is installed to collect the Delegated Information: Mailbox, Mailbox Archiver, Public Folder, Mailbox Public Folder Archiver.

Use the following steps to enable delegated search:

  1. From the ComCell Browser, right-click the <CommServe> and then select Control Panel.
  2. Double-click Browse/Search/Recovery.
  3. Select the Enable Exchange Delegate Search check box.
  4. Click OK.

 

Once enabled, you can perform search on delegated mailboxes from the Web Console. See Searching Emails in Delegated Mailboxes for step-by-step instructions.

Enabling Secured Access for Web Search Client

Use the following steps to enable secured access on the Web search client. This will allow you to access the Search Console using https instead of http.

Installing Java with all the updates

  1. Stop the Tomcat services
  2. If UAC is enabled, disable it.
  3. Download and install the latest version of JAVA with all the updates on the Web client computer.
  4. Open the command prompt window on the Web client computer, and execute the following command to verify if JAVA has been properly installed on the Web client computer:

    C:\java -version

    If you find that JAVA with the latest updates has not been installed, uninstall JAVA that you have installed and reinstall it again with all the updates. Navigate to the command prompt on the Web client computer, and run the command specified in step 4 to verify if JAVA has been installed successfully.

    Skip this step if the updates are installed successfully.

  5. Start Tomcat services. If the Tomcat fails to start, point the JVM manually to Tomcat using the following steps:

Configuring SSL on the Tomcat Server

Use the following steps for configuring SSL (Secure Socket layer) on the Tomcat Server:

  1. Navigate to command prompt and run the following command:

    C:\Program Files\Java\jre6\bin>keytool -genkey -alias cvtomcat -keyalg RSA -keystore "C:\Program Files\company\product\Apache\cert\keystore"

  2. Backup the server.xml file located in <product_install_path>\Apache\conf before making any changes to it.
  3. In order to setup a JAVA JSSE connector to support SSL, search for the following entry in the server.xml:

    <Listener className="org.apache.catalina.core.AprLifecycleListener" SSLEngine="on" />

    Modify the above entry as following:

    <Listener className="org.apache.catalina.core.AprLifecycleListener" SSLEngine="off" />

  4. Add the following entry to the server.xml file:

    <Connector port="443" protocol="org.apache.coyote.http11.Http11NioProtocol" SSLEnabled="true" maxThreads="150" scheme="https" secure="true" clientAuth="false" keystoreFile="C:/Program Files/CommVault/Simpana/Apache/cert/keystore" keystorePass="mnoettomcat" sslProtocol="TLS" />

  5. Save the server.xml file, and restart the Tomcat services.

Configure the Tomcat Server to use SSL from a Certificate Authority (CA)

Prior to configuring SSL on the compliance search client running Apache Tomcat Server, note the following:

Use the following steps to enable SSL on the Tomcat server:

  1. Create the key store (with key-pair/certificate to be signed)

    keytool -genkey -alias tomcat -keyalg RSA -keystore "C:\mykeystore.jks"

    This command requires the following parameters:

    Parameter Description
    Alias  In this case, we used alias tomcat and is used for reference purposes while importing or installing the certificate. This can be chosen as any simple name used for cross reference. After certificate signing is done by certificate authority and returned back to the customer, then we’d need to use exact same alias to import the certificate. Importing / installing certificate step is explained later.
    Password Default is changeit and it is recommended to use a strong password
    First and Last name Site (fully qualified domain) Name such as www.testsite.com, someName.somecompany.com which has to run using https When requesting for wildcard certificate, it can be *.someportal.com.

    If the value given for this parameter is not the starting part of the URL for the web site (server) you are requesting the certificate, then note that browser may treat it as an untrusted site. An error or warning message like this would be shown in such cases: The security certificate presented by this website was issued for a different website's address.

    Organizational Unit Optional. If applicable, you can enter the DBA (Doing Business As) name in this field.
    Organization Name Full legal name of your organization. This listed organization must be legal registrant of the domain name in the certificate request. If you are enrolling as an individual, please enter the certificate requestor’s name.
    City / Locality Name of the city (do not abbreviate) in which your organization is located.
    State / Province Name of state or province (do not abbreviate) where your organization is located.
    Country Code The two letter international organization for standardization (ISO) format country code where your organization is legally registered.
  2. Generate CSR (Certificate Signing Request)

    keytool -certreq -keyalg RSA -alias tomcat -file C:\somename.csr -keystore C:\mykeystore.jks

    Parameter Description
    Alias Should be same as the one used when generating keystore.
    File Path including file for CSR creation
    Keystore Should be the path including file name of keystore we just created

    You do not need to change the following parameters: -certreq -keyalg RSA

  3. Upload the certificate signing request to CA web site and indicate the type of server (Tomcat) and submit for signing.
  4. Install / Import the signed certificates issued by CA:
    Note that this may be different based on the certificate authority and it is recommended to follow the KB article or guide-lines provided by the CA.

    The following certificates need to be downloaded and installed:

  5. Configure the Tomcat server to use the signed certificate.

User Administration

The User Administration page enables you to set user preferences, such as disk space utilization and search result display, etc., for each user for performing searches from the Search Console. You can also use this page to upload customized logos and view the search analysis on the Content Indexing Engine.

In order to access the User Administration page, you need to be a CommCell administrator. See Security Configuration for Search Console for information on creating a user with administrative rights.

User Preferences

You can use the Preferences page to view and set the user preferences for End-User and Compliance User Search Consoles.

The following user preferences can be added or modified for all the end users/compliance users or for specific individual users.

Preferences

Description

Possible Values

Default Value

Hide File Advanced Search Tab Use this preference to hide/show the advanced search window for the user. True or False False
Max Number of Queries Use this preference to specify the number of queries that can be saved by each user. Once the specified number is reached, you are not allowed to create any more queries, unless some of the existing queries are removed. 10, 20, 30, 50, 100 100
Enable Document Highlighting Use this preference to highlight the search keyword in the search result items. On or Off On
Results Polling Interval Use this preference to set the time interval (in seconds) for viewing the status of the operations performed on the review sets. 30, 60, 120, 180 30
Disable Browse Window Use this preference to disable/enable the Browse window for the user. True or False False
Default Search tab to show  Use this preference to specify the default search tab to be displayed. You can choose one of the following options;
  • All - Display all the search tabs.
  • File - Display the search tab for files only.
  • Email - Display the search tab for emails only.

You can also set this user preference from the end-user or compliance user search page. The value set for this preference in the end-user/compliance user search page overrides the value set in the User Administration page.

All_File_Email, Email, File All_File_Email
End User Based Security Specifies that the emails will be searched for the end-users based on mailbox ownership or recipient list. Based on Recipient Based on Recipient
Export Emails to PST Use this preference to enable/disable export of emails to a PST file for the user. You can set this preference for the end-user or compliance-user view level and cannot be assigned for individual users. On or Off OFF
Search Timeout Specifies the maximum time allowed for the search operation, after which a timeout error will be displayed. 30, 60, 120, 180 60
Switch to Review Set Upon adding Items Specifies that once you add the search result items to a review set from the search result page, the review set page is opened and the added list will be displayed. On or Off On
Disable My Contents Window Use this preference to enable/disable the My Contents window for the user. True or False False
Max number of Review Set Specifies the maximum number of Review Sets that can be created by the specific user. any positive integer 500
Disable Search Window Use this preference to disable the Search functionality for the user. If this preference is set to True, the user will not be allowed to perform any search operations. True or False False
Hide Email Advanced Search Tab Use this preference to hide/unhide the Email options in the Advanced tab for the user. True or False False
Enable lemmatization Use this preference to enable or disable lemmatization by default Yes or No Yes
Disable My Contents Window Use this preference to enable or disable the My Contents tab in the Search Console True or False True
Display Email Field Original Value Use this preference to enable or disable the display of email address of the sender. When disabled, the From field of the email messages will display only the sender's display name. When enabled, the address of the sender is also displayed. True or False False
Search Engine Use this preference to set the default content indexing engine to be used for the search operations.

You can also set this user preference from the end-user or compliance user search page. The value set for this preference in the end-user/compliance user search page overrides the value set in the User Administration page.

Content Indexing Engine name DefaultCIEngine
Query Language Use this preference to set the default query language to be used for the search operation.

You can also set this user preference from the end-user or compliance user search page. The value set for this preference in the end-user/compliance user search page overrides the value set in the User Administration page.

All languages supported by the Content Indexing Engine English

Adding a User Preference

When adding a user preference, note that you can configure the user details for only those users who have performed a search operation using the Search Console.

Use the following steps to add a user preference:

  1. From the left navigation pane, click Preferences. This window is displayed by default once you login to the User Administration page.
  2. Click Add User Preferences.
  3. In the User Preferences drop-down list, select the desired user preference.
  4. Click Select View Names and Users.
  5. In the View Names group, select the following:
    • End User - to apply the user preference for End-User searches.
    • Compliance User - to apply the user preference for Compliance User searches.
  6. From the Apply to Users group, select one of the following:
    • All - To apply the user preference for all the users.
    • Limited - To apply the user preference for selected users. On selecting this option, the list of user names will be displayed. Select the desired users to which the preference should be applied.
  7. Click Save.
  8. Depending upon the user preference selected, enter the value setting for the preference.
  9. Click Save.

Modifying a User Preference

User the following steps to modify the user preference. Note that, in order to modify the user preference for an individual user, you need to add the user preference for the user with the modified value.

  1. From the left navigation pane, click Preferences. This window is displayed by default once you login to the User Administration page.
  2. Right-click the user preference to be modified and select Edit.
  3. From the Edit User Preferences dialog, modify the value for the selected user preference.
  4. Click OK.

Deleting a User Preference

User the following steps to delete a user preference:

  1. From the left navigation pane, click Preferences. This window is displayed by default once you login to the User Administration page.
  2. Right-click the user preference to be deleted and select Delete.
  3. A confirmation message is displayed. Click Yes.

Users

You can use the Users window in the User Administration page, to view the users who had logged into the Search Console. You can also view the preferences that were set for each user. In order to add or modify the preferences, you need to navigate to the Preferences window. Note that, you can configure and view the user details for only those users who have performed a search operation using the Search Console.

The Users window also displays the Last login time and Last logged in system details for each user.

Customize Logo

You can use the Customize Logo window to upload the logo image for Compliance user and End-user Search Consoles. These logos will appear in the main search window of the Search Console. For step-by-step instructions, see Upload Customized Logos.

Search Analytics

The Search Analytics window in the User Administration page, enables you to view the statistics of search operations performed on a Content Indexing Engine. This is basically used by administrators to monitor the usage and improve the performance of the Content Indexing Engines. For step-by-step instructions, see View Search Analytics.


Agent-Specific Configuration

The following agent-specific configuration tasks are required for content indexing and search operations.

Domino Mailbox Archiver Agent

If you wish to perform content indexing operations for the Domino Server's journaling mailbox, you must configure the following administrative settings for the Domino Server's journaling mailbox:

  1. The Method must be set to Send to Mail-In Database.
  2. The option to Encrypt Incoming Mail must be set to NO.

Note the following:

In order to perform an end-user search for the Domino Mailbox Archiver agent, make sure that the Collect User Identity check box is selected in the Subclient Properties (General) dialog box from the CommCell Console.

Outlook Add-In

In order to take advantage of Search Console capabilities from the Outlook Add-In, perform the following configuration tasks:

  1. On the client where Outlook Add-In is installed, edit the UIOptions registry key to add 128 to the existing value. This will enable the Search Console toolbar button in Outlook with the default capability of performing End-User Searches.
  2. After editing the UIOptions registry key, if you want to change the default capability to be Compliance Searches instead of End-User Searches, then you will need to create the SearchPageURLOption registry key with a value of 1 on the Outlook Add-In client.
  3. Stop and re-start the Outlook session for the change to take effect.

The Outlook toolbar buttons to launch the Search Console will appear as follows:
Compliance Search
End-User Search

NAS iDataAgent

Only compliance search can be performed on NAS iDataAgent offline content indexed data. (End-user search is not supported for NAS iDataAgent offline content indexed data)


Upgrade Considerations

General Considerations for Content Indexing

To take advantage of the new features, it is recommended to upgrade the Web Search Server and the Web Search Client along with the CommServe upgrade.

  • For tagging and delegate search, ensure that to have a latest version of Content Indexing Engine profile.
  • Do not upgrade to a current version of Content Indexing Engine if currently you are using a previous version of Content Indexing Engine.

Back to Top