Data Science Tool Adds Apache Spark Support -- ADTmag

Data Science Tool Adds Apache Spark Support

By David Ramel
October 16, 2015

The latest update of the data science tool from startup Dataiku includes support for Apache Spark, the open source data processing engine rapidly becoming one of the most popular technologies in use for Big Data analytics.

Version 2.1 of the company's Data Science Studio (DSS) is integrated with Spark and also provides better graphs and charts, notebooks with iPython, SparkR, R, Hive and Impala code samples, and certified plugins that let developers connect to varied data sources, Dataiku said.

Spark is ascending rapidly in the data analytics ecosystem, commonly described as the most active and popular big Data-related open source project and one of the most popular open source products of any kind.

"Pairing the capabilities of Spark with the advanced analytics features of DSS creates significant opportunities for those looking to leverage very large Hadoop data sets, often ranging into the terabytes, and it also allows users to process that information much more quickly," the company said in a statement.

The core components of DSS, called Visual Recipes, can now be executed on the Spark framework, letting developers take advantage of the SparkSQL programming language.

"Apache Spark integration also gives DSS the ability to work with Spark R, SparkSQL and PySpark, which brings R, SQL, and Python-based programing to the Spark environment," Dataiku said. "Much like the other components of Spark, PySpark and Spark R eases and speeds the native capabilities found in DSS and makes Spark a viable alternative to the traditional Hadoop/Hive stack, while also allowing analysts to share data engineering recipes and limit the need to recode or redevelop algorithms."

Having this year raised $3.7 million, the Paris-based startup now claims more than 50 customers of its " all-in-one predictive analytics development platform."

About the Author

David Ramel is an editor and writer at Converge 360.

Featured

AppTrends

Email Address*Country*

Please type the letters/numbers you see above.

Upcoming Training Events

0 AM

VSLive! 2-Day Hands-On Training Seminar: Asynchronous and Parallel Programming in C#
June 24-25, 2025

VSLive! 4-Day Hands-On Training Seminar: Immersive .NET Full Stack Training: 4-Day Hands-On Experience
July 15-18, 2025

Securing IT in the AI Era
July 23, 2025

VSLive! 4-Hour In-Depth Workshop: Immersive .NET Full Stack Training: C# Interfaces: Effective Usage while Avoiding Pitfalls
July 29, 2025

Visual Studio Live! @ Microsoft HQ
August 4-8, 2025

4-Hour VSLive! Workshop: Testability in .NET
August 27, 2025

Visual Studio Live! San Diego
September 8-12, 2025

Live! 360 2-Day Hands-On Seminar: Swimming in the Lakes of Microsoft Fabric and AI – A Hands-on Experience
September 18-19, 2025

VSLive! 2-Day Hands-On Training Seminar: Hands-On with .NET Web Development in 2025
October 7-8, 2025

Live! 360 Orlando
November 16-21, 2025

Artificial Intelligence Live! Orlando
November 16-21, 2025

Cloud & Containers Live! Orlando
November 16-21, 2025

Cybersecurity & Ransomware Live! Orlando
November 16-21, 2025

Data Platform Live! Orlando
November 16-21, 2025

Visual Studio Live! Orlando
November 16-21, 2025

VSLive! 4-Day Hands-On Training Seminar: Immersive .NET Full Stack Training: 4-Day Hands-On Experience
December 16-19, 2025

Visual Studio Live! Las Vegas
March 16-20, 2026

Free White Papers

More Tech Library