Pivotal Greenplum Database
EMC Data Domain Boost for Pivotal Greenplum Database provides database administrators with complete control of backup and disaster recovery as well as faster, more efficient big data backup and recovery.
Faster, More Efficient Backup and Recovery
Data Domain Boost for Greenplum meets the challenge of backing up big data by leveraging the massively parallel processing architecture of Greenplum to back up directly to the EMC Data Domain system. Data Domain Boost for Greenplum distributes parts of the deduplication process to the Greenplum Database server, enabling client-side deduplication so only unique data is sent from the database to the Data Domain system. Overall, this increases performance by 50 percent, reduces impact on the server by 20 to 40 percent, and reduces the required local area network (LAN) bandwidth by up to 80 to 99 percent.
Simplified Disaster Recovery
With Data Domain Boost for Greenplum, database administrators can control replication between multiple Data Domain systems. This allows them to efficiently create disaster recovery copies over a wide area network (WAN) using EMC Data Domain Replicator as well as keep track of all the copies within the Greenplum backup catalog for simplified disaster recovery.
Data Domain Boost for Greenplum is configured using native Greenplum Database gpcrondump and gpdbrestore backup and restore utilities. No additional backup application is required, which enables Greenplum database administrators to control backup and recovery of their environment.
Advanced Load-Balancing and Failover
Data Domain Boost aggregates network links on the Data Domain system into a single group so multiple links appear as one to the Greenplum backup utility. The Data Domain system then transparently balances the backup load between links in the group. In addition, the automatic link failover mechanism keeps backup systems operational in case of temporary network glitches.