Model comparison — flyer generation on the real product prompts

branch feature/inline-svgs-with-gemini-3.5-flash · 4 real prompts × 4 configs (3 models; GLM-5.2 in two OpenRouter routes) · cells are the live HTML (zoom freely, stays crisp) · open↗ = full size

promptGemini 3.1 Prothinking: medium · multimodalGemini 3.5 Flashthinking: medium · multimodalGLM-5.2reasoning: medium · Z.AI · text-onlyGLM-5.2reasoning: medium · nitro (throughput) · text-only
Garden Soiree$0.170100s · 6 svg$0.080646s · 5 svg$0.0564127s · 13 svg$0.032043s · 10 svg
Birthday Bash$0.052425s · 8 svg$0.10356s · 4 svg$0.114323s · 9 svg$0.024734s · 10 svg
The Crusty Loaf$0.12981s · 10 svg$0.12164s · 4 svg$0.037686s · 9 svg$0.025341s · 9 svg
Bright Smile$0.13077s · 6 svg$0.077046s · 8 svg$0.0577161s · 10 svg$0.026842s · 11 svg
TOTAL / column$0.481tot · avg 71s · 30 svg$0.381tot · avg 53s · 21 svg$0.266tot · avg 174s · 41 svg$0.109tot · avg 40s · 40 svg

Prompts are the exact ones from the product's own dev runs. Each cell shows latency · cost · inline-svg count · output tokens · image kind. All runs: product flyer, A5 Portrait, temp 0.3, font web. Gemini = thinking_level medium; GLM-5.2 = OpenRouter reasoning medium, capped 32768 out, run two ways: Z.AI (first-party endpoint, pinned) and nitro (OpenRouter throughput route → fastest provider). Same model & prompt — only the upstream provider differs. GLM cost: Z.AI @ $1.40/M in · $4.40/M out; nitro @ ~$0.95/M in · $3.00/M out (throughput route, typically DeepInfra; approximate). Gemini cost via genai-prices.

On where the images come from: raster photos are real user-supplied cut-out assets (birthday-cake.png, 02_bakery.png) embedded inline as base64 — exactly as the live product does. The two Gemini models are multimodal and are shown the asset (-img); they place it and the product embeds it. GLM-5.2 is text-only — it never sees the asset, so on the image prompts it can only emit a slot (filled afterward with the same provided asset) or draw SVG. Garden & dental had no asset, so all models there are pure HTML/CSS+SVG (dental's QR is a locally-generated PNG). Nothing here is an invented/AI-generated image.

Garden Soiree — spring garden party invite flyer · A5 · Portrait · no image provided → pure SVG/CSS

Gemini 3.1 Pro thinking: medium · multimodal 100s $0.170 6 svg 13200 out-tok inline SVG only open ↗
Gemini 3.5 Flash thinking: medium · multimodal 46s $0.0806 5 svg 8565 out-tok unfilled slot open ↗
GLM-5.2 reasoning: medium · Z.AI · text-only 127s $0.0564 13 svg 11026 out-tok inline SVG only open ↗
GLM-5.2 reasoning: medium · nitro (throughput) · text-only 43s $0.0320 10 svg 8868 out-tok inline SVG only open ↗

Birthday Bash — kids party invite flyer · A5 · Portrait · hero asset provided: birthday-cake.png (transparent cut-out)

Gemini 3.1 Pro thinking: medium · multimodal 25s $0.0524 8 svg 3177 out-tok embedded photo open ↗
Gemini 3.5 Flash thinking: medium · multimodal 56s $0.103 4 svg 10236 out-tok embedded photo open ↗
GLM-5.2 reasoning: medium · Z.AI · text-only 323s $0.114 9 svg 24104 out-tok unfilled slot open ↗
GLM-5.2 reasoning: medium · nitro (throughput) · text-only 34s $0.0247 10 svg 6431 out-tok unfilled slot open ↗

The Crusty Loaf — artisan bakery opening flyer · A5 · Portrait · hero asset provided: 02_bakery.png (transparent cut-out)

Gemini 3.1 Pro thinking: medium · multimodal 81s $0.129 10 svg 9495 out-tok embedded photo open ↗
Gemini 3.5 Flash thinking: medium · multimodal 64s $0.121 4 svg 12178 out-tok embedded photo open ↗
GLM-5.2 reasoning: medium · Z.AI · text-only 86s $0.0376 9 svg 6695 out-tok unfilled slot open ↗
GLM-5.2 reasoning: medium · nitro (throughput) · text-only 41s $0.0253 9 svg 6618 out-tok unfilled slot open ↗

Bright Smile — dental clinic, online booking flyer · A5 · Portrait · no image; QR code (zocdoc) localized to PNG

Gemini 3.1 Pro thinking: medium · multimodal 77s $0.130 6 svg 9672 out-tok inline SVG only open ↗
Gemini 3.5 Flash thinking: medium · multimodal 46s $0.0770 8 svg 8012 out-tok inline SVG only open ↗
GLM-5.2 reasoning: medium · Z.AI · text-only 161s $0.0577 10 svg 11062 out-tok inline SVG only open ↗
GLM-5.2 reasoning: medium · nitro (throughput) · text-only 42s $0.0268 11 svg 6877 out-tok inline SVG only open ↗