News Google's AI Learns to Analyze the World Through a Smartphone Camera

Google's AI Learns to Analyze the World Through a Smartphone Camera

March 4, 2025, 01:06 PM

At MWC in Barcelona, Google introduced groundbreaking new features for its AI assistant, Gemini. Starting in March, subscribers to the Google One AI Premium plan will be able to transform their smartphones into AI-powered ‘eyes’ thanks to two key innovations—Live Video Analysis and Smart Screenshare.

Live Video Analysis enables the assistant to process real-time camera input instantly. Users can point their camera at a piece of clothing for styling advice or scan a room to receive interior design suggestions. Gemini doesn’t just "see" what’s on the screen—it actively engages in dialogue. For instance, users can ask it to optimize a navigation route or clarify a complex chart in a presentation, receiving explanations in a dynamic, conversational format.

For now, these features are available only on Android devices with multilingual support. At the Google booth, the company showcased Gemini running on Samsung, Xiaomi, and other partner devices, emphasizing cross-brand compatibility. There’s no word yet on when iOS users will get access.

The announced updates are just one step toward Google’s ambitious Astra project. By 2025, the company aims to develop a universal multimodal assistant capable of:

Analyzing video, audio, and text data simultaneously;
Maintaining conversation context for up to 10 minutes;
Integrating data from Search, Lens, and Maps for comprehensive solutions.

Although Google has not officially announced Astra as a standalone product, experts speculate that its features will gradually be integrated into Gemini, intensifying competition with ChatGPT. Notably, OpenAI has offered an expanded voice mode with screen analysis since December 2023, but Google is betting on deep integration with its own ecosystem.

The ability of AI to process visual information in real time is blurring the line between the digital and physical worlds. Users are no longer just interacting with a "talking assistant" but engaging with an active participant in their daily tasks—from shopping to learning. With the launch of Gemini Vision, AI assistants are entering an era of hyper-contextual interaction, where the key question shifts from "How do I ask?" to "What do I show?"

One major question remains: privacy. How will Google protect data transmitted through the camera and screen? The company assures that all analysis is conducted under strict security standards, but the full details will only be revealed once the features are officially released.

Arkadiy Andrienko

News Author

As a tech journalist at VGTimes, I'm equally comfortable discussing the latest GPUs and diving deep into the intricacies of classic RPGs. Writing about games and hardware since 2018, my background in sound engineering has given me a keen ear for the nuances of audio technology, and I'm always on the lookout for the next groundbreaking innovation in gaming hardware. When I'm not writing about tech, you'll likely find me exploring the post-apocalyptic wasteland of Fallout, managing a colony in RimWorld, or commanding armies in Hearts of Iron IV. For me, gaming is more than just a hobby; it's a passion that fuels my creativity and keeps me connected to the ever-evolving world of technology.

Android News Hardware and Technologies Google

Comments 0