xAI's latest AI image generator with real-time web search for accurate visual references, multi-turn conversational editing, and batch generation. Create images from text, refine with natural language, and iterate through dialogue.
Grok 4.2 Image is xAI's advanced image generation model, part of the Grok 4.2 family featuring a 256K context window and real-time web search integration. It supports text-to-image generation with full prompt understanding and multi-turn image editing through natural conversation. The model can generate multiple images in batch, control aspect ratios and resolutions, and leverage Grok's real-time knowledge for up-to-date visual references. Unlike isolated image models, Grok 4.2 Image benefits from xAI's deep integration with live web data.

Grok 4.2 Image searches the live web during generation, incorporating current events, trending topics, and up-to-date visual references into your images โ no stale training data limitations.
Edit and refine images through natural back-and-forth conversation. Change colors, adjust compositions, swap elements, or evolve a concept across multiple dialogue turns without starting over.
A massive 256K token context window allows the model to maintain conversation history, reference multiple uploaded images, and keep complex creative directions consistent across long sessions.
Generate multiple images in a single API call with consistent style and quality. Perfect for A/B testing creative concepts, producing variations, or scaling content production.
Full control over aspect ratio and resolution for every generation. Create anything from social media squares to wide banners and tall posters without cropping or upscaling.
Upload existing photos and modify them through natural language. Change backgrounds, add elements, adjust colors, or completely transform the style while preserving subject integrity.
Grok 4.2 Image FAQ
Grok 4.2 Image is xAI's image generation model, part of the Grok 4.2 family. It features real-time web search integration, a 256K context window, multi-turn conversational editing, and batch generation capabilities. It generates images from text prompts and allows iterative refinement through natural dialogue.
Grok 4.2 Image is unique in its integration with xAI's real-time web search. While other models depend solely on training data, Grok can search the web during generation for current references. Its 256K context window also enables long conversational editing sessions without losing context.
Yes. You can upload existing photos and edit them through natural language instructions. Change backgrounds, add or remove elements, adjust colors and lighting, or transform the style. The model supports multi-turn editing, so you can refine progressively through conversation.
The 256K context window means Grok 4.2 Image can process and remember a large amount of information in a single session โ including full conversation history, multiple reference images, and complex creative briefs. This enables more coherent multi-turn editing and better consistency across a series of generations.
Yes. The API supports batch generation of multiple images in a single request, allowing efficient handling of large-scale creative tasks. This is ideal for A/B testing, content production, and generating multiple variations of a concept.
Grok 4.2 Image offers flexible control over aspect ratio and resolution. You can specify the exact dimensions needed for your use case, from standard social media formats to custom aspect ratios for specific platforms.
During generation, Grok 4.2 Image can search the live web to find current references, trending visual styles, and up-to-date cultural references. This means the model can generate images about recent events, current product designs, and latest trends that post-date its training data.
Grok 4.2 Image is ideal for content creators who need real-time visual references, marketers running A/B tests on creative concepts, designers who prefer conversational editing workflows, social media managers creating timely content, and anyone who values the combination of real-time knowledge with image generation.
โThe real-time web search is incredible for trend-based content. I can generate visuals about current events that actually look accurate.โ
โThe real-time web search is incredible for trend-based content. I can generate visuals about current events that actually look accurate.โ
โThe real-time web search is incredible for trend-based content. I can generate visuals about current events that actually look accurate.โ
โThe real-time web search is incredible for trend-based content. I can generate visuals about current events that actually look accurate.โ
Try Grok 4.2 Image โ xAI's most advanced image generation model, free on Nano Banana
Drag & drop reference images or browse files
Supported Formats: JPG, PNG, WEBP โข MAX 10MB