Apple’s Visual Intelligence: Promising, But Still Lags Behind competitors
Table of Contents
While practical, the latest iteration of Visual Intelligence doesn’t offer the live interactivity of google’s Gemini Live or Microsoft’s Copilot Vision.
By [Invented Reporter] | WASHINGTON – 2025/06/14 14:20:21
Expectations where high when Apple’s Senior Vice President of Software Engineering,CRAIG FEDERIGHI,introduced the Visual Intelligence feature in iOS 26 at WWDC 2025.Many hoped for meaningful advancements beyond its current ability to identify places and objects through the iPhone camera. However, the announcement that Visual Intelligence options would be integrated directly into the iOS screenshot interface was somewhat underwhelming.
While these capabilities are undoubtedly practical, Visual Intelligence still falls short of the real-time conversational abilities offered by Google’s Gemini Live and Microsoft’s Copilot Vision. The ability to verbally interact wiht the AI about what it “sees” creates a more engaging and intuitive user experience. Although live interactivity isn’t essential, it adds a layer of excitement and naturalness that is currently missing from Apple’s offering. The foundation of Visual Intelligence is strong, but further development is needed to align with Apple’s signature approach to AI.
What Does Visual Intelligence Do Well?
Visual Intelligence, like many of iOS’s standout features, is deeply integrated into the operating system and works seamlessly with its default applications. This eliminates the need to open a separate app and upload an image for AI analysis.
The new integration with the screenshot interface enhances its utility.After taking a screenshot, users are presented with options such as “Ask,” which sends the image to ChatGPT for analysis, or “Search,” which performs on-device scanning. With the latter, Visual Intelligence can identify data about an event and automatically create a calendar entry with the relevant details.
The ability to verbally interact with the AI about what it “sees” creates a more engaging and intuitive user experience.
Frequently Asked Questions
- What is Visual Intelligence?
- Visual Intelligence refers to the ability of a device to analyze images and videos, providing users with relevant information or actions based on the visual content.
- How does Visual Intelligence work?
- Visual Intelligence uses computer vision and artificial intelligence to identify objects, scenes, and text within images and videos.
- What are the benefits of Visual Intelligence?
- Visual Intelligence can provide quick access to information, automate tasks, and enhance user experiences by understanding and interacting with the visual world.
