Big Data News & Articles


Spark Poised To Break from Hadoop, Move to Cloud, Survey Says

The popular Apache Spark project is poised to break from the Hadoop ecosystem as an independent data processing tool, and it may shift from on-premises installations to the cloud, according to new research.

Freelancing Site Says Machine Learning Is Hottest Skill

Freelancing site Upwork identified machine learning as the hottest skill in demand by employers in its latest skills index.

Databricks Spark Platform Gets Deep Learning Boost

Databricks said it's wedding Big Data with deep learning in the latest update to its Apache Spark-based platform.

IBM, Microsoft Vie for Lead in New-Age Advanced Computing

Whether it's machine learning, artificial intelligence, cognitive computing or whatever, new-age software development is opening up huge new opportunities, with IBM and Microsoft vying for the lead in this new technological space.

BI-on-Hadoop Benchmark Reveals Analytical Engine 'Sweet Spots'

BI-on-Hadoop specialist AtScale's recent analytical engine benchmark study concludes that organizations will probably need to use multiple such engines for a successful implementation able to handle varied workloads.

Periscope Data Research Touts Amazon Redshift over Snowflake, BigQuery

Periscope Data named Amazon Redshift as the best cloud-based data warehouse offering in recent testing conducted to better advise its customers on which technology to choose.

Pentaho Platform Visualizes Big Data at All Stages

Pentaho will soon launch a new platform that provides Big Data visualization during all stages of the analytics pipeline, including early-on data preparation.

Devs Await Open Source Word After Commercial RethinkDB Effort Fails

With the company behind the RethinkDB project having failed and its engineering team scooped up by Stripe, Big Data developers are awaiting further word on plans to continue it as fully open source.

Qlik Playground Provides Online Big Data Visualization

Big Data visualization specialist QlikTech International AB announced a new playground to help Web developers try out its Qlik Analytics Platform.

What's Driving Apache Spark Growth? SQL, Streaming and Machine Learning

Databricks, the primary commercial steward behind the popular open source Apache Spark project, published a new report indicating the technology is still red-hot, driven by more use of SQL, streaming analytics and machine learning.

Big Data Product Watch 9/30/16: Apache Spark 2.0, Microservices, HDInsight, More

With an industry conference just concluded in New York, here's a roundup of this week's Big Data news, featuring new products and services from Cloudera, MapR, Hortonworks, Pentaho, Cask, Zoomdata, Blue Talon, Alation, Splice Machine and ODPi.

Apache Kafka Grabs Big Data Spotlight

Apache Kafka is increasingly generating buzz in the industry with a Spark-like climb into the Big Data spotlight.

IBM's New Project DataWorks Provides AI-Powered Decision-Making

IBM is applying the advanced cognitive computing capabilities of its Watson technology to a new cloud-based data and analytics platform called Project DataWorks.

Oracle Announces Cloud Products, Challenges Amazon

Oracle announced more than 20 new products and services for the Oracle Cloud Platform at its annual OpenWorld conference last week and took aim at its chief competitor in the cloud infrastructure market.

Data Inflexibility Wrecks Analytics Projects, Survey Indicates

A new survey of data analytics frontline pros and executives finds many have experienced project failures, with data inflexibility topping the list of challenges faced by enterprise teams.

Cloudera Teams with CenturyLink for Big Data-as-a-Service

Cloudera Inc. has teamed up with communications company CenturyLink to provide Cloudera's Apache Hadoop-based Big Data analytics software as a service.

Talend Updates Big Data Sandbox with Docker

Talend, which for years has provided a sandbox to explore the world of Big Data for free, is now powering its offering with Docker container technology.

'Developer-Ready' Aruba Mobile Platform Improves Network Programmability

Taking a page from the software-defined networking (SDN) playbook, Aruba today introduced a new 'developer-ready' mobile platform that provides more network programmability via northbound APIs.

Open Source InfluxDB 1.0 Time-Series Database Released

InfluxData Inc. said its new open source InfluxDB time-series database -- just moved to version 1.0 -- was almost three years in the making.

BigQuery Tackles 1 Billion GitHub Files To Reveal Spaces vs. Tabs Developer Preference

Google developer advocate Felipe Hoffa showed off the capabilities of the company's cloud-based BigQuery data warehouse by analyzing some 1 billion files across 400,000 GitHub repositories to see if developers prefer tabs or spaces to indent their code.