• Product and Solutions
  • Support and Training
  • Cloud Central
  • Community
  • Blog
  • Customer Stories
  • Contact
  • Contact
Menu bars
netapp-mark netapp-logo
English
Select Your Language
    • English

Current Language: English

  • Home
  • Documentation
  • NetApp Solutions
  • Overview
PDFs
    Site
  • NetApp Solutions
  • Sections
  • Artificial Intelligence
  • MLRun Pipeline
  • Setup
  • Deploying the Application
  • Hybrid Cloud AI Operating System with Data Caching
  • Solution Deployment and Validation Details
  • Conversational AI using NVIDIA
  • Build a Virtual Assistant Using Jarvis, Cloud Sync, and NeMo
  • NetApp Orchestration Solution with Run:AI
  • Optimal Cluster and GPU Utilization with Run AI
  • AI Inferencing on the Edge Data Center with H615c and NVIDIA T4
  • This page
  • Overview
  • Sections
  • Modern Data Analytics
  • Sections
  • Private / Hybrid Cloud
  • Workload Performance
  • Sections
  • Virtual Desktop
  • Hybrid Cloud VDI with NetApp Virtual Desktop Service
  • HCI for Citrix Virtual Apps
  • Sections
  • Containers
  • NetApp HCI with Anthos
  • NetApp HCI for Red Hat OpenShift on Red Hat Virtualization
  • Deploying NetApp HCI for Red Hat OpenShift on RHV
  • Sections
  • Business Applications
  • Sections
  • Enterprise Database
  • SAP HANA
  • Oracle Database
  • Microsoft SQL Server
  • Open Source Databases
  • NoSQL Databases
  • Sections
  • Data Protection and Security
  • Data Protection
  • NetApp HCI Disaster Recovery with Cleondris
  • Security
  • Sections
  • Infrastructure
  • NetApp HCI with Red Hat Virtualization
  • Deployment Procedures
  • Best Practices for Production Deployments
  • NetApp HCI with Cisco ACI
  • Deploying NetApp HCI with Cisco ACI
  • NetApp Solutions Documentation
  • Artificial Intelligence
    • NetApp ONTAP AI with NVIDIA DGX A100 Systems Design Guide
    • NetApp ONTAP AI with NVIDIA DGX A100 Systems Deployment Guide
    • NetApp ONTAP AI with NVIDIA DGX A100 Systems and Mellanox Spectrum Ethernet Switches Design Guide
    • NetApp ONTAP AI with NVIDIA DGX A100 Systems and Mellanox Spectrum Ethernet Switches Deployment Guide
    • Moving Data from a Big Data Environment to an AI Environment
    • NetApp ONTAP and Lenovo ThinkSystem SR670 for AI and ML Model Training Workloads
    • NetApp AFF A800 and Fujitsu Server PRIMERGY GX2570 M5 for AI and ML Model Training Workloads
    • NetApp AI Control Plane
    • MLRun Pipeline
      • Technology Overview
      • Software and Hardware Requirements
      • Network Device Failure Prediction Use Case Summary
      • Setup
        • Configuring Kubernetes Cluster
        • Define Persistent Volume Claim
      • Deploying the Application
        • Get Code from GitHub
        • Configure Working Environment
        • Deploy Grafana Dashboard
      • Conclusion
      • Where to Find Additional Information
    • Hybrid Cloud AI Operating System with Data Caching
      • Use Case Overview and Problem Statement
      • Solution Overview
      • Concepts and Components
      • Hardware and Software Requirements
      • Solution Deployment and Validation Details
        • ONTAP AI Deployment
        • Kubernetes Deployment
        • cnvrg.io Deployment
      • Conclusion
      • Acknowledgments
      • Where to Find Additional Information
    • Conversational AI using NVIDIA
      • Solution Overview
      • Solution Technology
      • Build a Virtual Assistant Using Jarvis, Cloud Sync, and NeMo
        • Jarvis Deployment
        • Customize States and Flows for Retail Use Case
        • Connect to Third-Party APIs as Fulfillment Engine
        • NetApp Retail Assistant Demonstration
        • Use NetApp Cloud Sync to Archive Conversation History
        • Expand Intent Models Using NeMo Training
      • Conclusion
      • Acknowledgments
      • Where to Find Additional Information
    • NetApp Orchestration Solution with Run:AI
      • Solution Overview
      • Solution Technology
      • Optimal Cluster and GPU Utilization with Run AI
        • Run AI Installation
        • Run AI Dashboards and Views
        • Creating Projects for Data Science Teams and Allocating GPUs
        • Submitting Jobs in Run AI CLI
        • Achieving High Cluster Utilization
        • Fractional GPU Allocation for Less Demanding or Interactive Workloads
        • Achieving High Cluster Utilization with Over-uota GPU Allocation
        • Basic Resource Allocation Fairness
        • Over-Quota Fairness
        • Saving Data to a Trident-Provisioned PersistentVolume
      • Conclusion
      • Testing Details for Section 4.8
      • Testing Details for Section 4.9
      • Testing Details for Section 4.10
      • Where to Find Additional Information
    • AI Inferencing on the Edge Data Center with H615c and NVIDIA T4
      • Use Cases
      • Architecture
      • Design Considerations
      • Deploying NetApp HCI – AI Inferencing at the Edge
      • Validation Results
      • Additional Information
    • NetApp Data Science Toolkit
  • Modern Data Analytics
    • NetApp StorageGRID with Splunk SmartStore
  • Private / Hybrid Cloud
    • VMware Private Cloud (Design Guide)
    • VMware Private Cloud (Deployment Guide)
    • VMware Validated Design
    • Private Cloud with Red Hat (Design Guide)
    • Private Cloud with Red Hat (Deployment Guide)
    • Workload Performance
      • Guaranteeing Mixed-workload Performance and NetApp HCI
  • Virtual Desktop
    • Hybrid Cloud VDI with NetApp Virtual Desktop Service
      • Use Cases
      • NetApp Virtual Desktop Service Overview
      • NetApp HCI Overview
      • NVIDIA Licensing
      • Deployment
      • Hybrid Cloud Environment
      • Single Server Load Test with Login VSI
      • Management Portal
      • User Management
      • Workspace Management
      • Application Management
      • Data Management
      • Operation Management
      • Tools and Logs
      • Conclusion
      • Where to Find Additional Information
    • EUC with VMware Horizon
    • Citrix Virtual Apps and Desktops with VMware vSphere (Design Guide)
    • NetApp HCI for Dassault Systèmes CATIA
    • HCI for Citrix Virtual Apps
      • Solution Overview
      • Physical Infrastructure
      • Citrix Hypervisor
      • Resource Layer
      • Control Layer
      • Access Layer
      • User Layer
      • NetApp Value
      • Appendix - iSCSI Device Configuration
      • Where to Find Additional Information
  • Containers
    • Infrastructure as Code with Red Hat Ansible
    • Red Hat Openshift Container Platform (Design Guide)
    • Red Hat Openshift Container Platform (Deployment Guide)
    • NetApp HCI with Anthos
      • Solution Components
      • Design Considerations
      • Hardware and Software Requirements
      • Deployment Steps
      • Video Demos
      • Additional Information
    • NetApp HCI for Red Hat OpenShift on Red Hat Virtualization
      • Architectural Overview: NetApp HCI for Red Hat OpenShift on RHV
      • Design Considerations: NetApp HCI for Red Hat OpenShift on RHV
      • Deploying NetApp HCI for Red Hat OpenShift on RHV
        • 1. Create Storage Network VLAN
        • 2. Download OpenShift Installation Files
        • 3. Download CA Certificate from RHV
        • 4. Register API/Apps in DNS
        • 5. Generate and Add SSH Private Key
        • 6. Install OpenShift Container Platform
        • 7. Access Console/Web Console
        • 8. Configure Worker Nodes to Run Storage Services
        • 9. Download and Install NetApp Trident
      • Validation Results: NetApp HCI for Red Hat OpenShift on RHV
      • Best Practices for Production Deployments
      • Videos and Demos: NetApp HCI for Red Hat OpenShift on Red Hat Virtualization
      • Additional Information: NetApp HCI for Red Hat OpenShift on Red Hat Virtualization
    • Anthos on Bare Metal with NetApp
  • Business Applications
    • SAP
    • Microsoft
    • Oracle
    • Salesforce
  • Enterprise Database
    • SAP HANA
      • SAP HANA Backup and Recovery with SnapCenter
      • SAP HANA Lifecycle Management
    • Oracle Database
      • Deploy Oracle Database on NetApp ONTAP
    • Microsoft SQL Server
      • Modernizing Microsoft SQL Server
      • Deploying MSSQL Database Workloads on NetApp HCI
      • ESG Technical Validation: Assuring Database Performance and Availability with NetApp HCI
      • eSDS NetApp SolidFire Enterprise SDS Running Microsoft SQL Server and Virtualized Infrastructure
    • Open Source Databases
      • MySQL
      • PostgreSQL
      • DB2
    • NoSQL Databases
      • MongoDB
      • Cassandra
      • Elasticsearch
  • Data Protection and Security
    • Data Protection
      • NetApp HCI Data Protection Overview
      • Multicloud Data Protection with Cloud Volumes ONTAP
      • Disaster Recovery and Replication with VMware SRM
      • Splunk Enterprise with Arrow Appliance
      • Veeam Backup and Replication 9.5 v4
      • NetApp Scale-Out Data Protection with Commvault
      • NetApp HCI Disaster Recovery with Cleondris
        • Installing Cleondris
        • Configuring Cleondris
        • Disaster Recovery Pairing
        • Recovery Organization
        • Failover
        • Best Practices
        • Additional Information
    • Security
      • HCI Verified Architecture PCI DSS 3.2.1
      • NIST Security Controls for FISMA with HyTrust for Multitenant Infrastructure
  • Infrastructure
    • NetApp HCI with Red Hat Virtualization
      • Architecture Overview
      • Design Considerations
      • Deployment Procedures
        • 1. Configure Management Switches
        • 2. Configure Data Switches
        • 3. Deploy the Element Storage System on the HCI Storage Nodes
        • 4. Deploy the RHV-H Hypervisor on the HCI Compute Nodes
        • 5. Deploy the RHV Manager as a Self-Hosted Engine
        • 6. Configure RHV-M Infrastructure
        • 7. Deploy the NetApp mNode
      • Best Practices for Production Deployments
        • Updating RHV Manager and RHV-H Hosts
        • Enabling Fencing for RHV-H Hosts
        • Optimizing Memory for Red Hat Virtualization
      • Where to Find Additional Information NetApp HCI with RHV
    • NetApp HCI with Cisco ACI
      • Use Cases
      • Architecture
      • Design Considerations
      • Deploying NetApp HCI with Cisco ACI
        • VMware vSphere
        • Red Hat Virtualization
        • KVM on RHEL
        • ONTAP on AFF
        • ONTAP Select with VMware vSphere
        • StorageGRID with VMware vSphere
      • Validation Results
      • Where to Find Additional Information
    • eSDS NetApp SolidFire Enterprise SDS Running Microsoft SQL Server and Virtualized Infrastructure
    • Guaranteeing Mixed-workload Performance and NetApp HCI
  • Solution Automation
  • Change Log

Overview

01/13/2021 Contributors netapp-dorianh kevin-hoke Download PDF of this page

This section describes the steps required to deploy the AI inferencing platform using NetApp HCI. The following list provides the high-level tasks involved in the setup:

  1. Configure network switches

  2. Deploy the VMware virtual infrastructure on NetApp HCI using NDE

  3. Configure the H615c compute nodes to be used as K8 worker nodes

  4. Set up the deployment jump VM and K8 master VMs

  5. Deploy a Kubernetes cluster with NVIDIA DeepOps

  6. Deploy ONTAP Select within the virtual infrastructure

  7. Deploy NetApp Trident

  8. Deploy NVIDIA Triton inference Server

  9. Deploy the client for the Triton inference server

  10. Collect inference metrics from the Triton inference server

CONTRIBUTE
Edit on GitHub Request doc changes Contributor's Guide
ON THIS PAGE
    • © 2021 NetApp, Inc.
    • netapp-globe English
    • blog blog@
    • community community@
    • twitter twitter@
    • facebook facebook@
    • linkedin linkedin@
    • youtube youtube@
    • slideshare slideshare@

    Have feedback for our website?Let us know Announcements