Big Data News & Articles


Open Source Big Data Tool Kafka Advances in Enterprise

Big Data is still a hot topic, and industry buzz points to Apache Kafka as one of the hottest projects in that space, a notion supported by a new survey showing its increasing use in enterprise analytics.

SQL Strides in Open Source Big Data

SQL continues to make inroads in the open source Big Data space this week, with Qubole open sourcing Quark for SQL virtualization, while MapR Technologies is converging SQL and JSON in the latest Apache Drill update.

Splunk Update Aims To Lower Cost of Big Data Storage

Splunk Inc., an "operational intelligence" specialist in Big Data analytics, today announced updates to its enterprise offering, with a focus on lowering the cost of storing historical data. The company also updated its cloud-based counterpart to its enterprise software.

Big Data Product Watch 4/1/16: Free SQL Server Dev Edition, Stream Processing, Hybrid Queries, More

No fooling here, just a roundup of news from various vendors including Microsoft, MapR Technologies, AtScale, MemSQL, Looker, Ryft, Dataiku and more who announced Big Data-related products this week.

Google Hosts Public Datasets for BigQuery Analytics

So it turns out I have the seventh most popular first name in the U.S. but a terrible author ranking on Hacker News. You can find where you rank in these areas in the public datasets Google has made available via BigQuery, its fully managed interactive analytics service.

IBM, Microsoft Lead New Spark-Based Offerings

With the Strata + Hadoop World conference underway, IBM, Microsoft and other companies are announcing new solutions based on the popular open source Big Data analytics framework, Apache Spark.

Data Scientists Love Jobs, Dislike What They Do Most: Clean Data

Paradoxically, data scientists love their jobs overall but dislike what they do most, cleaning and organizing data, according to a new survey of those lucky enough to have the "sexiest job of the 21st century."

Google Cloud Targets Machine Learning Developers

At its GCP NEXT 2016 conference, the Google Cloud Platform team announced a new offering designed to mainstream the development of machine learning apps.

Microsoft Beefs Up Power BI Data Tool

Shortly after announcing the first release candidate of SQL Server 2016, Microsoft today made more data-driven news by unveiling new capabilities for Power BI, its cloud-based analytics service targeting non-technical business users.

Altiscale Debuts Cloud for Self-Service Big Data Analytics

Altiscale wants to simplify this Big Data thing, bypassing trained developers, data scientists and expensive, proprietary systems to connect ordinary business users with Hadoop in the cloud and glean their own analytics insights with familiar tools like Tableau and Microsoft Excel.

High Demand and Salaries for Apache Cassandra, Other NoSQL Skills

The years-long Big Data skills shortage still persists despite numerous attempts to alleviate it, resulting in high demand and high salaries for developers with NoSQL skills, especially Apache Cassandra.

HP Service Brings Machine Learning to Mainstream Developers

Yet another effort to democratize notoriously difficult Big Data analytics was announced yesterday by Hewlett Packard Enterprise, which is providing a machine learning (ML) service to bring that advanced technology to mainstream developers.

MapR Upgrades Big Data Platform with Container Features

MapR Technologies today announced its Big Data platform has been upgraded with new features such as persistent storage and integrated resource management for containers.

Big Data Product Watch, 03/03/2016: GraphFrames, DataFlow, Data Funnel, More

Here's a roundup of this week's Big Data news, featuring: an updated platform and new cadence cycle from Hortonworks; GraphFrames, a graph processing library for Apache Spark, from Databricks; the open sourcing of LinkedIn's WhereHows project that provides a repository for metadata; and DMX Data Funnel from Syncsort, for data ingestion.

Cloudera Points Hadoop to Cybersecurity

Cloudera, a commercial vendor of Apache Hadoop-based software for Big Data analytics, has teamed up with other companies to tackle cybersecurity with the technology.

3 Things that Excite Spark's Creator Coming in Spark 2.0

Matei Zaharia, the creator of Apache Spark, recently detailed three "exciting" improvements to the open source Big Data analytics project coming soon in version 2.0.

Firms Provide Freebies for Big Data Students

That pesky Big Data skills shortage apparently isn't going away soon, judging from a rash of new free training resources for students offered up by vendors including MapR Technologies, Databricks and Quoble.

IBM's Open Source Quarks Pushes IoT Analytics to the Edge

IBM has open sourced new technology called Quarks to push Internet of Things analytics from centralized systems out to the actual edge devices that are collecting and spewing out vast amounts of data.

Big Data Product Watch, 2/12/16: RethinkDB on Windows, Spark-Based BI, Self-Service Data Pipelines, More

Here's a round-up of this week's Big Data news from Looker, RethinkDB, Talend and others, featuring self-service data preparation, RethinkDB on Windows, Spark- and Presto-based BI, a turnkey data pipeline creator and more.

Spark Joins Storm in StreamAnalytix Big Data Tool

Impetus Technologies today said it added Apache Spark support to its StreamAnalytix tool to complement existing functionality provided by the Apache Storm project for distributed, real-time streaming analytics.