Big Data is still a hot topic, and industry buzz points to Apache Kafka as one of the hottest projects in that space, a notion supported by a new survey showing its increasing use in enterprise analytics.
SQL continues to make inroads in the open source Big Data space this week, with Qubole open sourcing Quark for SQL virtualization, while MapR Technologies is converging SQL and JSON in the latest Apache Drill update.
Splunk Inc., an "operational intelligence" specialist in Big Data analytics, today announced updates to its enterprise offering, with a focus on lowering the cost of storing historical data. The company also updated its cloud-based counterpart to its enterprise software.
No fooling here, just a roundup of news from various vendors including Microsoft, MapR Technologies, AtScale, MemSQL, Looker, Ryft, Dataiku and more who announced Big Data-related products this week.
So it turns out I have the seventh most popular first name in the U.S. but a terrible author ranking on Hacker News. You can find where you rank in these areas in the public datasets Google has made available via BigQuery, its fully managed interactive analytics service.
With the Strata + Hadoop World conference underway, IBM, Microsoft and other companies are announcing new solutions based on the popular open source Big Data analytics framework, Apache Spark.
Paradoxically, data scientists love their jobs overall but dislike what they do most, cleaning and organizing data, according to a new survey of those lucky enough to have the "sexiest job of the 21st century."
At its GCP NEXT 2016 conference, the Google Cloud Platform team announced a new offering designed to mainstream the development of machine learning apps.
Shortly after announcing the first release candidate of SQL Server 2016, Microsoft today made more data-driven news by unveiling new capabilities for Power BI, its cloud-based analytics service targeting non-technical business users.
Altiscale wants to simplify this Big Data thing, bypassing trained developers, data scientists and expensive, proprietary systems to connect ordinary business users with Hadoop in the cloud and glean their own analytics insights with familiar tools like Tableau and Microsoft Excel.
The years-long Big Data skills shortage still persists despite numerous attempts to alleviate it, resulting in high demand and high salaries for developers with NoSQL skills, especially Apache Cassandra.
Yet another effort to democratize notoriously difficult Big Data analytics was announced yesterday by Hewlett Packard Enterprise, which is providing a machine learning (ML) service to bring that advanced technology to mainstream developers.
MapR Technologies today announced its Big Data platform has been upgraded with new features such as persistent storage and integrated resource management for containers.
Here's a roundup of this week's Big Data news, featuring: an updated platform and new cadence cycle from Hortonworks; GraphFrames, a graph processing library for Apache Spark, from Databricks; the open sourcing of LinkedIn's WhereHows project that provides a repository for metadata; and DMX Data Funnel from Syncsort, for data ingestion.
Cloudera, a commercial vendor of Apache Hadoop-based software for Big Data analytics, has teamed up with other companies to tackle cybersecurity with the technology.
Matei Zaharia, the creator of Apache Spark, recently detailed three "exciting" improvements to the open source Big Data analytics project coming soon in version 2.0.
That pesky Big Data skills shortage apparently isn't going away soon, judging from a rash of new free training resources for students offered up by vendors including MapR Technologies, Databricks and Quoble.
IBM has open sourced new technology called Quarks to push Internet of Things analytics from centralized systems out to the actual edge devices that are collecting and spewing out vast amounts of data.
Here's a round-up of this week's Big Data news from Looker, RethinkDB, Talend and others, featuring self-service data preparation, RethinkDB on Windows, Spark- and Presto-based BI, a turnkey data pipeline creator and more.
Impetus Technologies today said it added Apache Spark support to its StreamAnalytix tool to complement existing functionality provided by the Apache Storm project for distributed, real-time streaming analytics.