Google Expands Access to Big Data Tool
- By David Ramel
- April 16, 2015
Google Inc. today announced its Cloud Dataflow tool for Big Data analytics is being moved to beta status, making it available to more developers.
The tool, which had been in a private alpha program, enables large-scale event-time-based stream processing or batch processing of data. Google said it fits use cases such as extract, transform, load (ETL), analytics, real-time computing and orchestration of processes.
"Now, consistently processing streaming data at large scale doesn't have to be a complex and brittle endeavor that's reserved for the most critical scenarios," said exec William Vambenepe in a blog post today.
Developers can use the service with unified programming primitives that can be used in use cases such as session analysis, detection of anomalies and funnel analysis.
To help with that, an open source SDK -- using Java -- is available.
Google said the service runs on ordinary Google Compute Engine instances and integrates with other company solutions such as Cloud Storage, Cloud Pub/Sub and BigQuery.
Speaking of those services, Google also announced BigQuery -- a cloud-native, API-driven service for using SQL queries -- has received security and performance enhancements and new European zone availability.
The company said the Cloud Pub/Sub service provides many-to-many, asynchronous messaging running on its cloud infrastructure.
Google made the announcements at the Hadoop Summit in Brussels.
David Ramel is an editor and writer for Converge360.