DLM Administration
Also available as:
PDF
loading table of contents...

Replication of data from on-premise to Google Cloud Storage in HDFS

You must have a cluster registered with the DLM app to perform data replication from on-premise to Google cloud. You must register your cloud credentials. For more information, see Register cloud credentials.

You must create a new replication policy to replicate data from on-premise to Google cloud account. You can replicate data on-premise to Google cloud storage using a single cluster. You must have Infra Admin or DLM Admin role to perform this set of tasks.
  1. Select Policies and click Add Policy.
    By default, HDFS is selected as the service in the Create Replication Policy page.
  2. Enter the replication policy name and description.
  3. Click SELECT SOURCE and select type and source cluster from the drop-down.
  4. Provide the data replication folder path and click SELECT DESTINATION.
  5. Select the destination type as GCS and Cloud Credential from the drop-down.
  6. Provide a folder path bucket_name/path and click VALIDATE.
  7. Once the validation is successful, click SCHEDULE.
  8. Configure the job settings for the replication policy.
  9. Click ADVANCED SETTINGS to set up the policy queue.
  10. Click CREATE POLICY.
    The data replication process is enabled.

    View job status from the policies page. Verify that the job starts and runs as expected.