Google Releases Cloud Vision API for Understanding Image Content -- ADTmag

Google Releases Cloud Vision API for Understanding Image Content

By David Ramel
December 4, 2015

Google released its homegrown Cloud Vision API to help developers create applications with sophisticated image recognition technology that powers Google Photos.

"Developers can now build powerful applications that can see, and more importantly understand, the content of images," according to a post Wednesday on the Google Cloud Platform Blog. "The uses of Cloud Vision API are game changing to developers of all types of applications and we are very excited to see what happens next!"

With the Cloud Vision API, now in a limited, invitation-only preview, developers can apply several features to work with images, alone or in combinations. These include the detection of dominant entities (or major "things" that appear in an image), inappropriate content, human faces, landmarks and logos, along with optical character recognition that extracts text from an image.

"Cloud Vision provides powerful image analytics capabilities as easy-to-use APIs," said a Google spokesperson in a video accompanying the blog post. The video depicts a Raspberry Pi-based robot that can move around and identify objects such as faces. Programmed by just a few hundred lines of Python code, the robot can even detect what emotions a face is exhibiting, such as joy, anger or surprise and, for example, move toward joyful faces and away from angry faces.

**[Click on image for larger view.]** The Cloud Vision API in Action *(source: Google)*

With entity detection, the robot can identify any "thing" that a developer is interested in. It detected, and vocalized, for example, that the video spokesperson was wearing glasses and separately holding a banana, car and money.

"Cloud Vision lets developers take advantage of Google's latest machine learning technologies quite easily," the spokesperson said. Those machine learning technologies include TensorFlow, which was just recently open sourced by Google.

"With Cloud Vision API, you can build metadata on your image catalog, moderate offensive content, or enable new marketing scenarios through image sentiment analysis," Google said. It invited interested developers to sign up to participate in the limited preview of the API.

About the Author

David Ramel is an editor and writer at Converge 360.

Featured

AppTrends

Email Address*Country*

Please type the letters/numbers you see above.

Upcoming Training Events

0 AM

VSLive! 2-Day Hands-On Training Seminar: Asynchronous and Parallel Programming in C#
June 24-25, 2025

VSLive! 4-Day Hands-On Training Seminar: Immersive .NET Full Stack Training: 4-Day Hands-On Experience
July 15-18, 2025

Securing IT in the AI Era
July 23, 2025

VSLive! 4-Hour In-Depth Workshop: Immersive .NET Full Stack Training: C# Interfaces: Effective Usage while Avoiding Pitfalls
July 29, 2025

Visual Studio Live! @ Microsoft HQ
August 4-8, 2025

4-Hour VSLive! Workshop: Testability in .NET
August 27, 2025

Visual Studio Live! San Diego
September 8-12, 2025

Live! 360 2-Day Hands-On Seminar: Swimming in the Lakes of Microsoft Fabric and AI – A Hands-on Experience
September 18-19, 2025

VSLive! 2-Day Hands-On Training Seminar: Hands-On with .NET Web Development in 2025
October 7-8, 2025

Live! 360 Orlando
November 16-21, 2025

Artificial Intelligence Live! Orlando
November 16-21, 2025

Cloud & Containers Live! Orlando
November 16-21, 2025

Cybersecurity & Ransomware Live! Orlando
November 16-21, 2025

Data Platform Live! Orlando
November 16-21, 2025

Visual Studio Live! Orlando
November 16-21, 2025

VSLive! 4-Day Hands-On Training Seminar: Immersive .NET Full Stack Training: 4-Day Hands-On Experience
December 16-19, 2025

Visual Studio Live! Las Vegas
March 16-20, 2026

Free White Papers

More Tech Library