Caching
Model Caching stores responses returned from models for reuse with identical or similar requests, helping reduce latency, save resources, and improve throughput when handling repeated requests.
1. Access the Model Caching Page
2. Create a Caching Configuration

3. Manage Caching (Edit / Manage Models / Delete)
Manage Models (Assign Gateways and Models)
Steps to Assign Gateways and Models


View Caching Configurations Assigned to a Gateway

Edit Configuration (Modify TTL / Name / Cache Type)
Delete Configuration
Last updated


