xai’s grok chatbot can now answer questions about what’s in view of your smartphone’s camera, similar to real-time vision features available for Google’s Gemini and Chatgpt.
On Tuesday, xai announced The launch of grok vision, which lets users point their phone at objects like products, signs, and documents and ask questions about about them. Grok vision is accessible from the grok app for iOS, but not the grok android app just yet.
Grok Can See What You See – Literally
Grok’s Voice Mode Comera Access, Letting Users Point their phone at something and ask, “What am looking at?”
The vision feature on iOS allows the chatbot to analyze real-wrong objects, text, and environment through your… https://t.co/cmtinp8p6 pic.twitter.com/n1b6pcyzoi
– Mario Nawfal (@marionawfal) April 20, 2025
Other new capabilities launching for grok Today Include Multilingual Audio and Real-TIME Search in Grok’s Voice Mode. Grok users on Android can tap those, but only if they are subscribed to Xai’s $ 30-second supergrok plan.
Introducing Grok Vision, Multilingual Audio, And Realtime Search in Voice Mode. Available now.
Grok habla español
Grok parle français
Grok türkçe konuşuyor
グロクは日本語を話す
Groak hindi pic.twitter.com/lcasyty2N5– ebby amir (@bbyamir) April 22, 2025
Grok has been gaining new features at a steady clip. Earllier this month, xai added a “memory” component to grok that lets the bot pull on details from past conversations. Grok also get a canvas-like tool for creating docs and apps.

