google.cloud.gcp_vertexai_rag_engine_config module – Creates a GCP VertexAI.RagEngineConfig resource

Note

This module is part of the google.cloud collection (version 1.12.0).

You might already have this collection installed if you are using the ansible package. It is not included in ansible-core. To check whether it is installed, run ansible-galaxy collection list.

To install it, use: ansible-galaxy collection install google.cloud. You need further requirements to be able to use this module, see Requirements for details.

To use it in a playbook, specify: google.cloud.gcp_vertexai_rag_engine_config.

Synopsis

  • Vertex AI RAG Engine lets you scale your RagManagedDb instance based on your usage and performance requirements using a choice of two tiers, and optionally, lets you delete your Vertex AI RAG Engine data using a third tier. The tier is a project-level setting that’s available in the RagEngineConfig resource that impacts all RAG corpora using RagManagedDb. The following tiers are available in RagEngineConfig: Basic, Scaled and Unprovisioned.

Requirements

The below requirements are needed on the host that executes this module.

  • python >= 3.8

  • requests >= 2.18.4

  • google-auth >= 2.25.1

Parameters

Parameter

Comments

access_token

string

The access token used to authenticate.

auth_kind

string / required

The type of credential used.

Choices:

  • "accesstoken"

  • "application"

  • "machineaccount"

  • "serviceaccount"

env_type

string

Specifies which Ansible environment you’re running this module within.

This should not be set unless you know what you’re doing.

This only alters the User Agent string for any API requests.

project

string

The Google Cloud Platform project to use.

rag_managed_db_config

string

The config of the RagManagedDb used by RagEngine.

Basic tier is a cost-effective and low compute tier suitable for the following cases: Experimenting with RagManagedDb, Small data size, Latency insensitive workload, Only using RAG Engine with external vector DBs.

NOTE: This is the default tier if not explicitly chosen.

Scaled tier offers production grade performance along with autoscaling functionality.

It is suitable for customers with large amounts of data or performance sensitive workloads.

Unprovisioned tier disables the RAG Engine service and deletes all your data held within this service.

This will halt the billing of the service.

NOTE: Once deleted the data cannot be recovered.

To start using RAG Engine again, you will need to update the tier by calling the UpdateRagEngineConfig API.

NOTE: Setting to unprovisioned is the same as state=absent.

Choices:

  • "scaled"

  • "basic" ← (default)

  • "unprovisioned"

region

string

The region of the RagEngineConfig.

eg us-central1.

scopes

list / elements=string

Array of scopes to be used.

service_account_contents

jsonarg

The contents of a Service Account JSON file,

either in a dictionary or as a JSON string that represents it.

service_account_email

string

An optional service account email address if machineaccount is

selected and the user does not wish to use the default email.

service_account_file

path

The path of a Service Account JSON file if serviceaccount

is selected as type.

state

string

Whether the resource should exist in GCP.

Choices:

  • "present" ← (default)

  • "absent"

Notes

Note

Examples

- name: Create Basic RAG Engine Config
  google.cloud.gcp_vertexai_rag_engine_config:
    state: present
    rag_managed_config: basic
    region: us-central1
    project: "{{ gcp_project }}"
    auth_kind: "{{ gcp_cred_kind }}"
    service_account_file: "{{ gcp_cred_file }}"

################################################################################

- name: Create Scaled RAG Engine Config
  google.cloud.gcp_vertexai_rag_engine_config:
    state: present
    rag_managed_config: scaled
    region: us-central1
    project: "{{ gcp_project }}"
    auth_kind: "{{ gcp_cred_kind }}"
    service_account_file: "{{ gcp_cred_file }}"

################################################################################

- name: Create Scaled RAG Engine Config
  google.cloud.gcp_vertexai_rag_engine_config:
    state: absent
    rag_managed_config: unprovisioned
    region: us-central1
    project: "{{ gcp_project }}"
    auth_kind: "{{ gcp_cred_kind }}"
    service_account_file: "{{ gcp_cred_file }}"

Return Values

Common return values are documented here, the following are the fields unique to this module:

Key

Description

changed

boolean

Whether the resource was changed.

Returned: always

name

string

The resource name of the Dataset.

This value is set by Google.

Returned: success

state

string

The current state of the resource.

Returned: always

Authors

  • Google Inc. (@googlecloudplatform)