Conclusion
This section provides a summary of the use cases and solutions provided by NetApp to fulfill various Hadoop data protection requirements. By using the data fabric powered by NetApp, customers can:
-
Have the flexibility to choose the right data protection solutions by leveraging NetApp’s rich data management capabilities and integration with Hadoop native workflows.
-
Reduce their Hadoop cluster backup window time by almost 70%.
-
Eliminate any performance effect resulting from Hadoop cluster backups.
-
Provide multicloud data protection and data access from different cloud providers simultaneously to a single source of analytics data.
-
Create fast and space-efficient Hadoop cluster copies by using FlexClone technology.
Where to find additional information
To learn more about the information described in this document, see the following documents and/or websites:
-
NetApp Big Data Analytics Solutions
-
Apache Spark Workload with NetApp Storage
-
NetApp Storage Solutions for Apache Spark
-
Apache Hadoop on data fabric enabled by NetApp
Acknowledgements
-
Paul Burland, Sales Rep, ANZ Victoria District Sales, NetApp
-
Hoseb Dermanilian, Business Development Manager, NetApp
-
Lee Dorrier, Director MPSG, NetApp
-
David Thiessen, Systems Engineer, ANZ Victoria District SE, NetApp
Version history
Version | Date | Document version history |
---|---|---|
Version 1.0 |
January 2018 |
Initial release |
Version 2.0 |
October 2021 |
Updated with use case #5: Accelerate analytic workload |
Version 3.0 |
November 2023 |
Removed NIPAM details |