Replicating data on-premise to cloud
The process for creating a replication job from on-premise to the cloud is similar to creating one for on-premise to on-premise. The primary difference is that you must register your cloud credentials with DLM, so DLM can access your cloud storage.
Replication of HDFS data from on-premise to cloud is a Limited GA feature in DPS 1.1. The HDFS data that you replicate to cloud requires security policies outside the Hadoop system, so you should work with Hortonworks support to ensure proper configuration of your environment. This does not apply to Hive replication to cloud.
See the individual tasks linked below for considerations and tips when performing the tasks.
Register cloud credentials with
Enter the credentials for the bucket you want to replicate, so DLM can access the bucket.
Create a replication policy.
Choose which cluster is source and which is destination, then set the schedule and other rules for replication jobs.
View job status.
Verify that the job starts and runs as expected.