Loading / 加载中

Developer Building Next.js Whiteboard App Seeks Free Vision API to Generate Image Prompts from Drawings | thinkgap

SocialSource: REDDIT OPENAIJune 10, 2026Importance: 1/5

Developer Building Next.js Whiteboard App Seeks Free Vision API to Generate Image Prompts from Drawings

English summary

A developer is building a Next.js whiteboard app using tldraw. Users draw on a canvas, type a prompt, and click Enhance. The app exports the canvas as base64 PNG and sends it to an AI vision model to generate a detailed image prompt. That prompt is then used with Pollinations.ai to generate a refined image. The developer needs a free vision API that accepts base64 image and text prompt, returns text, and works without a credit card. They tried OpenRouter but encountered model routing issues and are seeking alternative reliable free models such as Gemini, LLaVA, or others.

Chinese summary

一名开发者正在构建一个使用tldraw的Next.js白板应用。用户可在画布上绘制，输入提示并点击增强按钮。应用将画布导出为base64 PNG，发送给AI视觉模型，以生成详细的图像提示。该提示随后用于Pollinations.ai生成精修图像。开发者需要免费的视觉API，能够接受base64图像和文本提示并返回文本，且无需信用卡即可使用。他们尝试了OpenRouter但遇到模型路由问题，现寻求替代的免费可靠模型，如Gemini、LLaVA等。

Key points

The developer is creating a Next.js app with tldraw canvas for drawing and generating images via AI.
开发者正在创建一个使用tldraw画布的Next.js应用，用于绘画并通过AI生成图像。
The current pipeline: canvas base64 → vision model for prompt generation → Pollinations.ai for image generation.
当前流程：画布base64 → 视觉模型生成提示 → Pollinations.ai生成图像。
They need a free vision API that accepts image+text, outputs text, and works without credit card.
他们需要一个免费的视觉API，接受图像和文本输入，输出文本，且无需信用卡。
OpenRouter was tried but routed to wrong models, so they seek alternatives like Gemini or LLaVA.
曾尝试OpenRouter但路由到错误的模型，因此寻求像Gemini或LLaVA这样的替代方案。

Open original