Agent Skill · Replicate

prompt-images

Prompting techniques for AI image generation and editing models on Replicate. Use when writing prompts for image models or building image generation features.

Provider: Replicate Path in repo: skills/prompt-images/SKILL.md

Skill body

Prompting image models on Replicate

Distilled from Replicate’s blog posts on prompting image models (2024-2026). Techniques are model-agnostic and focus on transferable principles.

Choose a model with the API, not from memory

This skill describes general prompting techniques. To choose a model, use the find-models skill and query the Replicate API. The image model landscape changes weekly. Don’t assume specific models exist or are still state-of-the-art based on names you’ve seen before. Always search the API for current options, then read the schema before running anything.

For pricing and feature comparison, see the compare-models skill.

Writing prompts

Use natural language, not keyword lists

Write full sentences describing what you want. Modern image models understand grammar and context far better than keyword-stuffed prompts.

Good: “A woman standing in a Tokyo alleyway at dusk, neon signs reflecting off wet pavement” Bad: “woman, Tokyo, alleyway, dusk, neon, wet pavement”

Be specific and unambiguous

Name exact colors, materials, lighting setups, camera equipment, and spatial relationships. Vague terms like “make it better” or “artistic” give unpredictable results.

Good: “A brutalist concrete building reflected in a perfectly still puddle after rain. A single figure with a red umbrella walks along the edge, the only color in an otherwise monochrome scene. Overcast sky, flat diffused light, tilt-shift lens effect on the edges.” Bad: “Cool building with a person near it, rainy day”

Name subjects directly

Use descriptive phrases like “the woman with short black hair” or “the red car.” Avoid pronouns, which are often too ambiguous for image models.

Use long, detailed prompts

Most modern models accept thousands of tokens. Long descriptive prompts with clear structure outperform short ones. A prompt with 12+ specific requirements (text on objects, labeled diagrams, color-coded elements, specific materials) can work if each requirement is stated clearly. But be aware: the longer and more complex the prompt, the more likely something will be missed.

Start simple, then iterate

Begin with basic changes. Test small edits first, then build on what works. Most editing models support iterative editing, so take advantage of that.

Photographic language

Modern image models understand camera and photography terminology deeply. Using this vocabulary gives you precise control over the look.

Camera and lens

Lighting setups

Composition

Text rendering

Rendering text in images is a common task. These techniques improve accuracy across models.

Style transfer

Character consistency

Maintaining the same character across multiple generations is one of the hardest challenges in image generation.

Image editing

General principles

Object removal

Background editing

Perspective and angle changes

Inpainting and outpainting

Multi-image and storyboard generation

Some models can generate multiple related images in a single prompt.

Product photography and commercial work

Fine-tuning and LoRAs

Common pitfalls

  1. Keyword-stuffed prompts: Modern models respond better to natural language sentences than comma-separated keyword lists. Write like you’re describing a scene, not tagging a photo.

  2. Using “transform” when you want a small edit: “Transform the person into a Viking” may swap the entire identity. Use targeted language: “change her outfit to Viking armor, keeping her face and expression unchanged.”

  3. Not specifying what to keep: When editing, always say what should stay the same. Without explicit instructions, models may change anything.

  4. Negative prompts on models not trained for them: Some models were not trained with negative prompts. Using them on these models introduces noise rather than removing unwanted elements. Check the model’s documentation.

  5. Too-high guidance scale (CFG): If images look “burnt” with excessive contrast, lower the guidance scale. Each model has a recommended range.

  6. Expecting real-time knowledge: No image model has internet access. Some have strong world knowledge baked in from training data, but it’s not live.

  7. Short prompts for complex scenes: Modern models accept thousands of tokens. For complex compositions with many specific requirements, use that capacity.

  8. Ignoring aspect ratio: Most models have specific resolutions they work best at (commonly ~1 megapixel). Going too large produces edge artifacts. Going too small produces harsh crops. Use the model’s recommended aspect ratios.

  9. Wrong model for the task: Not every model is good at every task. Some excel at text rendering but struggle with object removal. Some are great at style transfer but poor at background editing. If a model struggles with a specific edit type, try a different one rather than fighting the prompt. See the compare-models skill for guidance.

  10. Not iterating: The best results come from iterative workflows. Make a small change, evaluate, refine, repeat. Don’t try to get everything right in a single generation.

Background reading

The techniques above are distilled from Replicate’s blog posts on prompting image models. The posts are anchored to specific models that were current at the time, but the techniques generalize across modern image models. Use these for additional context, then use the find-models skill to pick a model that fits your task today.