Skip to main content
BlueXP classification

Manage your private data

Contributors netapp-tonacki amgrissino netapp-bcammett

BlueXP classification provides many ways for you to manage your private data. Some functionality makes it easier to prepare for migrating your data, while other functionality allows you to make changes to the data.

  • You can copy files to a destination NFS share if you want to make a copy of certain data and move it to a different NFS location.

  • You can clone an ONTAP volume to a new volume, while including only selected files from the source volume in the new cloned volume. This is useful for situations where you're migrating data and you want to exclude certain files from the original volume.

  • You can copy and synchronize files from a source repository to a directory in a specific destination location. This is useful for situations where you're migrating data from one source system to another while there is still some final activity on the source files.

  • You can move source files that BlueXP classification is scanning to any NFS share.

  • You can delete files that seem insecure or too risky to leave in your storage system, or that you have identified as duplicate.

Note
  • The capabilities described in this section are available only if you have chosen to perform a full classification scan on your data sources. Data sources that have had a mapping-only scan do not show file-level details.

  • Data from Google Drive accounts can't use any of these capabilities at this time.

Copy source files

You can copy any source files that BlueXP classification is scanning. There are three types of copy operations depending on what you're trying to accomplish:

  • Copy files from the same, or different, volumes or data sources to a destination NFS share.

    This is useful if you want to make a copy of certain data and move it to a different NFS location.

  • Clone an ONTAP volume to a new volume in the same aggregate, but include only selected files from the source volume in the new cloned volume.

    This is useful for situations where you're migrating data and you want to exclude certain files from the original volume. This action uses the NetApp FlexClone functionality to quickly duplicate the volume and then remove the files that you didn't select.

  • Copy and synchronize files from a single source repository (ONTAP volume, S3 bucket, NFS share, etc.) to a directory in a specific destination (target) location.

    This is useful for situations where you're migrating data from one source system to another. After the initial copy, the service syncs any changed data based on the schedule that you set. This action uses the NetApp BlueXP copy and sync functionality to copy and sync data from a source to a target.

Copy source files to an NFS share

You can copy source files that BlueXP classification is scanning to any NFS share. The NFS share doesn't need to be integrated with BlueXP classification, you just need to know the name of the NFS share where all selected files will be copied in the format <host_name>:/<share_path>.

Tip You can't copy files that reside in databases.
Requirements
  • You must have the Account Admin or Workspace Admin role to copy files.

  • Copying files requires that the destination NFS share allows access from the BlueXP classification instance.

  • You can copy between 1 and 100,000 files at a time.

Steps
  1. In the Data Investigation results pane, select the file, or files, that you want to copy, and click Copy.

    A screenshot showing how to select the files to copy, and the Copy button, from the Data Investigation page.

    • To select individual files, check the box for each file ().

    • To select all files on the current page, check the box in the title row ().

    • To select all files on all pages, check the box in the title row (), and then in the pop-up message about selecting all files, click Select all items in list (xxx items).

  2. In the Copy Files dialog, select the Regular Copy tab.

    A screenshot showing the Copy Files dialog so you can specify the name of the NFS share where all selected files will be copied.

  3. Enter the name of the NFS share where all selected files will be copied in the format <host_name>:/<share_path>, and click Copy.

    A dialog appears with the status of the copy operation.

You can view the progress of the copy operation in the Actions Status pane.

Note that you can also copy an individual file when viewing the metadata details for a file. Just click Copy File.

A screenshot showing selection of the Copy File button from the metadata details for a file in the Data Investigation page.

Clone volume data to a new volume

You can clone an existing ONTAP volume that BlueXP classification is scanning using NetApp FlexClone functionality. This allows you to quickly duplicate the volume while including only those files you selected. This is useful if you're migrating data and you want to exclude certain files from the original volume, or if you want to create a copy of a volume for testing.

The new volume is created in the same aggregate as the source volume. Ensure that you have enough space for this new volume in the aggregate before you start this task. Contact your storage administrator if necessary.

Note: FlexGroup volumes can't be cloned because they're not supported by FlexClone.

Requirements
  • You must have the Account Admin or Workspace Admin role to copy files.

  • You must select a minimum of 20 files.

  • All selected files must be from the same volume, and the volume must be online.

  • The volume must be from a Cloud Volumes ONTAP or on-premises ONTAP system. No other data sources are currently supported.

  • The FlexClone license must be installed on the cluster. This license is installed by default on Cloud Volumes ONTAP systems.

Steps
  1. In the Data Investigation pane, create a filter by selecting a single Working Environment and a single Storage Repository to make sure all the files are from the same ONTAP volume.

    A screenshot of creating a filter that includes files from a single storage repository in a single working environment.

    Apply any other filters so that you're seeing only the files that you want to clone to the new volume.

  2. In the Investigation results pane, select the files that you want to clone and click Copy.

    A screenshot showing how to select the files to copy, and the Copy button, from the Data Investigation page.

    • To select individual files, check the box for each file ().

    • To select all files on the current page, check the box in the title row ().

    • To select all files on all pages, check the box in the title row (), and then in the pop-up message about selecting all files, click Select all items in list (xxx items).

  3. In the Copy Files dialog, select the FlexClone tab. This page shows the total number of files that will be cloned from the volume (the files you selected), and the number of files that are not included/deleted (the files you didn't select) from the cloned volume.

    A screenshot showing the Copy Files dialog so you can specify the name of the new volume that will be cloned from the source volume.

  4. Enter the name of the new volume, and click FlexClone.

    A dialog appears with the status of the clone operation.

Result

The new, cloned volume is created in the same aggregate as the source volume.

You can view the progress of the clone operation in the Actions Status pane.

If you initially selected Map all volumes or Map & Classify all volumes when you enabled BlueXP classification for the working environment where the source volume resides, then BlueXP classification will scan the new cloned volume automatically. If you didn't use either of these selections initially, then if you want to scan this new volume, you'll need to enable scanning on the volume manually.

Copy and synchronize source files to a target system

You can copy source files that BlueXP classification is scanning from any supported unstructured data source to a directory in a specific target destination location (target locations that are supported by BlueXP copy and sync). After the initial copy, any data changed in the files are synchronized based on the schedule that you configure.

This is useful for situations where you're migrating data from one source system to another. This action uses the NetApp BlueXP copy and sync functionality to copy and sync data from a source to a target.

Tip You can't copy and sync files that reside in databases, OneDrive accounts, or SharePoint accounts.
Requirements
  • You must have the Account Admin or Workspace Admin role to copy and sync files.

  • You must select a minimum of 20 files.

  • All selected files must be from the same source repository (ONTAP volume, S3 bucket, NFS or CIFS share, etc.).

  • You'll need to activate the BlueXP copy and sync service and configure a minimum of one data broker that can be used to transfer files between the source and target systems. Review the BlueXP copy and sync requirements beginning with the Quick Start description.

    Note that the BlueXP copy and sync service has separate service charges for your sync relationships, and will incur resource charges if you deploy the data broker in the cloud.

Steps
  1. In the Data Investigation pane, create a filter by selecting a single Working Environment and a single Storage Repository to make sure all the files are from the same repository.

    A screenshot of creating a filter that includes files from a single storage repository in a single working environment.

    Apply any other filters so that you're seeing only the files that you want to copy and sync to the destination system.

  2. In the Investigation results pane, select all files on all pages by checking the box in the title row (), then in the pop-up message about selecting all files click Select all items in list (xxx items), and then click Copy.

    A screenshot showing how to select the files to copy, and the Copy button, from the Data Investigation page.

  3. In the Copy Files dialog, select the Sync tab.

    A screenshot showing the Copy Files dialog so you can select the Sync option.

  4. If you are sure that you want to sync the selected files to a destination location, click OK.

    The BlueXP copy and sync UI is opened in BlueXP.

    You are prompted to define the sync relationship. The Source system is pre-populated based on the repository and files you already selected in BlueXP classification.

  5. You'll need to select the Target system and then select (or create) the Data Broker you plan to use. Review the BlueXP copy and sync requirements beginning with the Quick Start description.

Result

The files are copied to the target system and they'll be synchronized based on the schedule you define. If you select a one-time sync then the files are copied and synchronized one time only. If you choose a periodic sync, then the files are synchronized based on the schedule. Note that if the source system adds new files that match the query you created using filters, those new files will be copied to the destination and synchronized in the future.

Note that some of the usual BlueXP copy and sync operations are disabled when it is invoked from BlueXP classification:

  • You can't use the Delete Files on Source or Delete Files on Target buttons.

  • Running a report is disabled.

Move source files to an NFS share

You can move source files that BlueXP classification is scanning to any NFS share. The NFS share doesn't need to be integrated with BlueXP classification.

Optionally, you can leave a breadcrumb file in the location of the moved file. A breadcrumb file helps your users understand why a file was moved from its original location. For each moved file, the system creates a breadcrumb file in the source location named <filename>-breadcrumb-<date>.txt. You can add text in the dialog box that will be added to the breadcrumb file to indicate the location where the file was moved and the user who moved the file.

Note that the subdirectory structure from the source file is recreated on the destination share when the file is moved so it is easier to understand where the file was moved from. If a file with the same name exists in the destination location, the file will not be moved.

Tip You can't move files that reside in databases.
Requirements
  • You must have the Account Admin or Workspace Admin role to move files.

  • The source files can be located in the following data sources: On-premises ONTAP, Cloud Volumes ONTAP, Azure NetApp Files, File Shares, and SharePoint Online.

  • You can move a maximum of 15 million files at a time.

  • Only files which are 50 MB or smaller are moved.

  • The destination NFS share must allow access from the BlueXP classification instance IP address.

Steps
  1. In the Data Investigation results pane, select the file, or files, that you want to move.

    A screenshot showing how to select the files to move, and the Move button, from the Data Investigation page.

    • To select individual files, check the box for each file ().

    • To select all files on the current page, check the box in the title row ().

    • To select all files on all pages, check the box in the title row (), and then in the pop-up message about selecting all files, click Select all items in list (xxx items).

  2. From the button bar, click Move.

    A screenshot showing the Move Files dialog so you can specify the name of the NFS share where all selected files will be moved.

  3. In the Move Files dialog, enter the name of the NFS share where all selected files will be moved in the format <host_name>:/<share_path>.

  4. If you want to leave a breadcrumb file, check the Leave breadcrumb box. You can enter text in the dialog box to indicate the location where the file was moved and the user who moved the file, and any other information, such as the reason the file was moved.

  5. Click Move Files.

Note that you can also move an individual file when viewing the metadata details for a file. Just click Move File.

A screenshot showing selection of the Move File button from the metadata details for a file in the Data Investigation page.

Delete source files

You can permanently remove source files that seem insecure or too risky to leave in your storage system, or that you've identified as a duplicate. This action is permanent and there is no undo or restore.

You can delete files manually from the Investigation pane, or automatically using Policies.

Tip You can't delete files that reside in databases. All other data sources are supported.

Deleting files requires the following permissions:

  • For NFS data - the export policy needs to be defined with write permissions.

  • For CIFS data - the CIFS credentials need to have write permissions.

  • For S3 data - the IAM role must include the following permission: s3:DeleteObject.

Delete source files manually

Requirements
  • You must have the Account Admin or Workspace Admin role to delete files.

  • You can delete a maximum of 100,000 files at a time.

Steps
  1. In the Data Investigation results pane, select the file, or files, that you want to delete.

    A screenshot showing how to select the files to delete, and the Delete button, from the Data Investigation page.

    • To select individual files, check the box for each file ().

    • To select all files on the current page, check the box in the title row ().

    • To select all files on all pages, check the box in the title row (), and then in the pop-up message about selecting all files, click Select all items in list (xxx items).

  2. From the button bar, click Delete.

  3. Because the delete operation is permanent, you must type "permanently delete" in the subsequent Delete File dialog and click Delete File.

You can view the progress of the delete operation in the Actions Status pane.

Note that you can also delete an individual file when viewing the metadata details for a file. Just click Delete file.

A screenshot showing selection of the Delete File button from the metadata details for a file in the Data Investigation page.