oai-reverse-proxy/docs/gcp-configuration.md

2.1 KiB

Configuring the proxy for Vertex AI (GCP)

The proxy supports GCP models via the /proxy/gcp/claude endpoint. There are a few extra steps necessary to use GCP compared to the other supported APIs.

Setting keys

Use the GCP_CREDENTIALS environment variable to set the GCP API keys.

Like other APIs, you can provide multiple keys separated by commas. Each GCP key, however, is a set of credentials including the project id, client email, region and private key. These are separated by a colon (:).

For example:

GCP_CREDENTIALS=my-first-project:xxx@yyy.com:us-east5:-----BEGIN PRIVATE KEY-----xxx-----END PRIVATE KEY-----,my-first-project2:xxx2@yyy.com:us-east5:-----BEGIN PRIVATE KEY-----xxx-----END PRIVATE KEY-----

Setup Vertex AI

  1. Go to https://cloud.google.com/vertex-ai and sign up for a GCP account. ($150 free credits without credit card or $300 free credits with credit card, credits expire in 90 days)
  2. Go to https://console.cloud.google.com/marketplace/product/google/aiplatform.googleapis.com to enable Vertex AI API.
  3. Go to https://console.cloud.google.com/vertex-ai and navigate to Model Garden to apply for access to the Claude models.
  4. Create a Service Account , and make sure to grant the role of "Vertex AI User" or "Vertex AI Administrator".
  5. On the service account page you just created, create a new key and select "JSON". The JSON file will be downloaded automatically.
  6. The required credential is in the JSON file you just downloaded.

Supported model IDs

Users can send these model IDs to the proxy to invoke the corresponding models.

  • Claude
    • claude-3-haiku@20240307
    • claude-3-sonnet@20240229
    • claude-3-opus@20240229
    • claude-3-5-sonnet@20240620