Class 01.
Image generation.
GPT Image 2 for the modern creator.
What we will learn.
- What GPT Image 2 can do, and where it still fails
- Prompt structures from one word up to JSON
- Twenty-plus prompt techniques across text, image, and multi-image work
- Six categories of text-driven deliverables in a single prompt
- Business use cases tied to real revenue
- Layered workflows for professional brand systems
- Higgsfield to access many models in one place
- A hands-on assignment to build your own artifacts
GPT Image 2
against Nano Banana Pro.
The two leading image models in 2026. We will lean on GPT Image 2 as the primary, Nano Banana Pro as the comparison.
Token-based pricing.
- Image input
- $8.00 per 1M tokens
- Cached image input
- $2.00 per 1M tokens
- Image output
- $30.00 per 1M tokens
- Text input
- $5.00 per 1M tokens
Every reference image you upload is billed at the high-fidelity input rate, regardless of your output quality setting. Edit-heavy workflows cost more than generation-only workflows. Pricing accurate as of May 8, 2026.
Mr. Grateful.
Meta. Adobe. OpenAI. Now in San Francisco building my own AI startup. Three years teaching with Mindvalley.
Content. Examples. Use cases.
Techniques. Workflow.
Then an assignment at the end to make your own artifacts.
I am not glazing the model the whole time. This is not a demo. It is so you can have the permission and confidence to put these techniques to your own use.
I am also showing you what others know how to do. So your eye is trained as you scroll the internet.
New capabilities.
Examples with transparent prompts.
Web search before generating.
The model can pull current visual references before it draws. Ask for a product as it actually looks today, a person in a recent outfit, an event from this week.
Thinking mode and reasoning.
The model plans before it draws. Useful for complex compositions, layered scenes, anything where the order of operations matters.
Accurate detailed text.
The marquee capability of GPT Image 2. Quote the text you want, name the typography, place it in the layout. Whole deliverable categories that needed a designer are now in scope for one prompt.
Multi-lingual.
Just ask. Render text in Mandarin, Arabic, Hindi, Devanagari, Hebrew, Cyrillic. The model handles the script as well as it handles English.
Many slides in one generation.
A storyboard, a comic, a 10-page recipe, a five-frame animation. Generate a sheet of related images that share style, character, and continuity.
Non-standard aspect ratios.
Beyond 16:9, 1:1, 9:16. Cinematic 21:9, ultra-tall 9:21, panoramic 3:1, magazine 8.5:11, billboard 4:1. Just specify in the prompt.
Business use cases.
Real revenue. Real production.
PoolSend.
Hyper-personalized AI image generation as a sales mechanic. Satellite imagery of the prospect's actual yard, plus an AI-rendered pool. The dream visualized in their own backyard before they say yes.
My example here is the landscaping render.
The content creator.
Reference selfies become every angle, every environment, every campaign. One photo shoot's worth of effort, infinite output. My example is from selfies of me.