Cask Puts Big Data Sandbox on AWS Cloud

Cask is now offering a sandbox on the Amazon Web Services Inc. (AWS) cloud to let users more easily and quickly evaluate the company's flagship Big Data platform.

The new Cloud Sandbox for AWS, available through the AWS Marketplace, provides a single-node instance of the company's Cask Data Application Platform (CDAP) with normal functionality except for a scaling limitation. It's based on CDAP 4.2, released earlier this month with support for Apache Spark 2.x and better functionality for reusing existing Spark code for new applications, among other updates.

Cask said the new sandbox simplifies the evaluation process of the platform, as users don't have to set up a Hadoop cluster to use it.

The AWS offering is functionally equivalent to the company's CDAP Local Sandbox that's run on individual on-premises machines. It bundles the CDAP SDK, runtime, CDAP UI and CDAP CLI (command-line interface) along with tools and examples.

CDAP 4.2
[Click on image for larger view.] CDAP 4.2 (source: Cask)

If use of the sandbox results in a production-ready application, Cask said it can easily be moved to a distributed CDAP environment, like the Amazon EMR (Amazon Elastic MapReduce) service.

"Users can also access Cask Market from the instance that has a lot of readily available, out of the box capabilities to experience CDAP," the company's Derek Wood said in a blog post. "We have added a number of AWS specific integrations and made them available in Cask Market -- plugins to read from S3 buckets, plugins read/write to Redshift and a plugin read from Snowflake."

Company CEO Jonathan Gray also weighed in, saying, "It is no secret that the path to extracting value from Big Data investments has been a long and rocky one for many companies. Onboarding even skilled engineers new to Hadoop, setting up a proper distributed computing environment, and dealing with the integration of multiple Big Data technologies and tools have taken a big toll on the pace with which users ramp up their Big Data projects. With the new Cloud Sandbox, CDAP users can be productive developing Big Data solutions within minutes rather than being concerned about installation and configuration questions."

About the Author

David Ramel is an editor and writer for Converge360.