Real-time AI assistant with X-integrated search, coding, and multimodal generation
Visit Grok →Grok is a real-time AI assistant developed by xAI. It combines chat, web search, X search, reasoning, coding, voice, image understanding, image generation, and video generation into one product family. Available via grok.com, mobile apps, X integration, API, and the Grok Build CLI.
As of May 2026, xAI positions Grok 4.3 as its primary API model. It supports 1M-token context, text + image input, configurable reasoning, function calling, structured outputs, and Remote MCP Tools. For coding, Grok Build 0.1 powers a terminal-native agentic coding experience. For creative work, Grok Imagine API provides a separate image and video generation layer.
Grok’s biggest differentiator is its real-time information layer connected to X and its attempt to unify chat, search, voice, image, video, and coding under one API.
| Model | Context | Input | Output |
|---|---|---|---|
| Grok 4.3 | 1M tokens | $1.25/M | $2.50/M |
| Grok Build 0.1 | 256K tokens | $1/M | $2/M |
| Grok 4.1 Fast | 2M tokens | $0.20/M | $0.50/M |
Grok 4.3 is xAI’s current flagship model, focused on strong agentic tool calling, low-hallucination performance, configurable reasoning, and vision. Cached input is priced at $0.20/M across all current models.
On the API side, Grok 4.3 is priced at $1.25/M input and $2.50/M output. Image and video generation use image- or second-based pricing rather than token pricing.
Grok’s ecosystem has three main parts: Grok Build, Remote MCP Tools, and Grok Imagine.
Grok Build is a terminal-native coding agent. It supports skills, plugins, hooks, AGENTS.md, MCP servers, subagents, plan mode, code review, sandboxed execution, and headless mode. This gives Grok a developer experience closer to Claude Code and Cursor.
Remote MCP Tools let Grok connect to external MCP servers. This allows Grok to work with third-party tools or custom internal MCP servers for external system integration.
Grok Imagine is xAI’s image and video generation/editing layer. Priced separately from chat models, it expands Grok from a chat assistant into a multimodal creation platform.