Generates and edits images using AI models. Create illustrations, modify photos, generate art, and produce visual assets from text descriptions.
Setup required
A Gemini API key is required. If one isn't configured, your assistant will prompt you to set it up on first use
Permissions
No macOS permissions needed for generation
File system permission needed if saving images to your machine
Common prompts
You say...
What happens
“Generate an image of a sunset over mountains”
Creates an AI-generated image from your description
“Make me a logo for a coffee shop called 'Brew'”
Generates a logo design
“Edit this image to remove the background”
Modifies an existing image
“Create a cartoon version of this photo”
Transforms an image style
“Make a banner image for my blog post about AI”
Generates sized-for-purpose assets
“Generate 4 variations of this design”
Multiple options to choose from
Configuration
Models: Gemini 2.5 Flash (default, fast) or Gemini 3 Pro (higher quality, slower)
Variants: Generate 1–4 variations per prompt to pick the best result
Modes: Text-to-image (generate from a prompt) or edit mode (modify an existing image)
Tips & gotchas
Be descriptive: The more detail in your prompt, the better the result. “A watercolor painting of a golden retriever sitting in a field of lavender at sunset” beats “a dog.”
Iterate: First generation not perfect? Say “make it more colorful” or “try a different angle.” Your assistant refines based on feedback.
Formats: Images are generated as PNG or JPEG. Specify if you have a preference.
File delivery: Generated images are delivered as attachments in chat. You can also ask your assistant to save them to a specific folder on your machine (requires file access permission).