Skip to main content
AI Data Engine

Data-to-RAG quick start for AI Data Engine

Contributors netapp-dbagwell

Go from a newly deployed AI Data Engine (AIDE) system to a working retrieval-augmented generation (RAG) endpoint using this workflow. Understand how storage administrators, data engineers, and data scientists collaborate using ONTAP System Manager and AIDE Console.

Before you begin
  • You've installed and added Data compute nodes (DCNs) to the ONTAP cluster.

  • You've installed and licensed AI Data Engine software for vectorization and guardrails.

  • You've configured OpenID Connect (OIDC) and mapped roles for admin, data engineer, and data scientist roles.

One Define data scope and governance

As a storage administrator or security administrator, you want to prepare the environment in AIDE Console and ONTAP System Manager:

Two Explore workspace metadata

As a data engineer or data scientist, you want to explore the workspace metadata using AIDE Console:

  • Explore workspace metadata to understand available content.

  • Define one or more logical subsets of data that should feed RAG (for example, support articles, product manuals, or anonymized clinical notes).

Three Create and publish a data collection

As a data engineer or data scientist, you want to turn the chosen subset into a RAG-ready collection: