Agent Skill · Replicate

run-models

Run AI models on Replicate via predictions, webhooks, and streaming.

Provider: Replicate Path in repo: skills/run-models/SKILL.md

Skill body

Docs

Workflow

  1. Choose the right model - Search with the API or ask the user.
  2. Get model metadata - Fetch input and output schema via API.
  3. Create prediction - POST to /v1/predictions.
  4. Poll for results - GET prediction until status is “succeeded”.
  5. Return output - Usually URLs to generated content.

Three ways to get output

  1. Create a prediction, store its id from the response, and poll until completion.
  2. Set a Prefer: wait header when creating a prediction for a blocking synchronous response. Only recommended for very fast models. Max 60 seconds.
  3. Set an HTTPS webhook URL when creating a prediction, and Replicate will POST to that URL when the prediction completes.

Guidelines

Predictions

Webhooks

Prediction lifetime

Streaming

File handling

Multi-model workflows