Cloudera's 'Self-Service Tool for Data Scientists' Generally Available Today -- ADTmag

Cloudera's 'Self-Service Tool for Data Scientists' Generally Available Today

By David Ramel
May 1, 2017

Newly public company Cloudera Inc. today announced its "self-service tool for data scientists" is now generally available after a preview program.

The company in March announced a private beta for Cloudera Data Science Workbench, a Web-based tool that allows for Big Data analytics -- with a focus on machine learning (ML) -- using the Python, R and Scala programming languages in customizable project environments that can leverage different libraries and frameworks as needed.

The company said security and compliance functionality comes with support for Hadoop authentication, authorization, encryption and governance. Operating on-premises or in the cloud, it lets data scientists use popular Big Data tools Apache Spark and Apache Impala to directly access data in those secure Hadoop clusters.

"We are entering the golden age of machine learning and it's all about the data. However, data scientists continue to struggle to build and test new analytics projects as fast as they would like, particularly in large scale environments," said exec Charles Zedlewski in a news release.

**[Click on image for larger view.]** Using the Workbench with Python *(source: Cloudera)*

"The Data Science Workbench is a self-service tool that accelerates the ability to build, scale and deploy machine learning solutions using the most powerful technologies. This means that data scientists now have the freedom to share, collaborate and manage their data in a way that best suits them and their enterprise, resulting in an easier and faster path to production."

According to the product's Web site, data scientists can use the workbench to set up analytics pipelines as they wish, using functionality such as built-in scheduling, monitoring and e-mail alerting. Such customized pipelines allow for the development and prototyping of new ML projects before they go into to production, Cloudera said.

In citing the tool's ability to integrate with various deep learning frameworks, the company has focused on Intel's BigDL, a library for deep learning using Spark that was open sourced by Intel.

"The benefits of BigDL integration into Data Science Workbench include the ability to leverage deep learning libraries and tactics on CPU architecture without any additional hardware considerations or separate environments," the company said." The combination provides a convenient way to create Spark data science pipelines natively and integrate them with deep learning library (BigDL) and other Spark/Hadoop components on the Cloudera Data Science Workbench." More on the BigDL integration can be found in the blog post, "BigDL on CDH and Cloudera Data Science Workbench," published Friday.

About the Author

David Ramel is an editor and writer at Converge 360.

Featured

AppTrends

Email Address*Country*

Please type the letters/numbers you see above.

Upcoming Training Events

0 AM

Live! 360 2-Day Hands-On Seminar: From Traction to Production: Building Generative AI Applications with Azure AI Studio
March 25-26, 2025

VSLive! 4-Day Hands-On Training Seminar: Hands-on with Blazor
May 5-8, 2025

Cybersecurity & Ransomware Live! VirtCon 2025
May 13-15, 2025

VSLive! 3-Day Hands-On Training Seminar: Master Modern JavaScript: Unlock the Full Potential of Your Code
June 2-4, 2025

VSLive! 2-Day Hands-On Training Seminar: Asynchronous and Parallel Programming in C#
June 24-25, 2025

VSLive! 4-Day Hands-On Training Seminar: Immersive .NET Full Stack Training: 4-Day Hands-On Experience
July 15-18, 2025

Visual Studio Live! @ Microsoft HQ
August 4-8, 2025

Visual Studio Live! San Diego
September 8-12, 2025

Live! 360 2-Day Hands-On Seminar: Swimming in the Lakes of Microsoft Fabric and AI – A Hands-on Experience
September 18-19, 2025

Live! 360 Orlando
November 16-21, 2025

Artificial Intelligence Live! Orlando
November 16-21, 2025

Cloud & Containers Live! Orlando
November 16-21, 2025

Cybersecurity & Ransomware Live! Orlando
November 16-21, 2025

Data Platform Live! Orlando
November 16-21, 2025

Visual Studio Live! Orlando
November 16-21, 2025

VSLive! 4-Day Hands-On Training Seminar: Immersive .NET Full Stack Training: 4-Day Hands-On Experience
December 16-19, 2025

Free White Papers

More Tech Library