A data lake can be an asset to business intelligence systems. But in developing a data lake it's important to avoid pitfalls that can end up creating a data swamp.
Now it's easier for customers of the Amazon Web Services cloud to create, set up and manage data lakes with the general availability of AWS Lake Formation, graduating from preview and promising to relieve some of that associated drudgery.
MongoDB Inc., known for its popular namesake document-oriented NoSQL database that also comes in an open source version, announced it will merge its mobile-centric Realm offering with its serverless Stitch platform.
The TIOBE Index, which tracks programming language popularity based on Web searches, is out with a new report that shows Python reaching new heights, driven by its simplicity that attracts a new wave of software engineers.
The future Silicon Valley-based Big Data unicorn MapR appears to be in crisis.
Databricks has announced the release of Delta Lake, an open source solution designed to provide "reliability for both batch and streaming data" for data lakes.
LinkedIn open sourced a tool it developed to convert Apache Spark-based Big Data into a format consumable by TensorFlow, the popular open source platform for machine learning.
The Linux Foundation has added a new project to foster and advance best practices for data teamwork, borrowing from the Agile approach to software development.
Amid growing enterprise initiatives in cutting-edge technologies like artificial intelligence, Internet of Things, blockchain and others, it's old-fashioned Hadoop that was identified as the fastest-growing freelancer skill in a new report.
A new developer survey about Python reveals the programming language is now used primarily for data analysis, supplanting the previous No. 1 use case, Web development.
The new MapR Ecosystem Pack 6.1 is a new release for the program that provides for independent updates of modular, open source data projects included in the firm's data analytics platform.
Taking into account earnings potential, job satisfaction and number of openings, careers site Glassdoor has named data scientist the best job in America for 2019 -- for the fourth time running.
While perhaps not offering new conclusions about the hottest technologies in the software development arena, a new Developer Economics survey from SlashData provides more hard evidence that data science and machine learning are the top skills developers want to learn.
Data science and advanced analytics specialist Dataiku announced version 5 of its flagship platform, with new capabilities to enable scalable enterprise AI.
Hortonworks, IBM and Red Hat announced the Open Hybrid Architecture Initiative, seeking to provide a common enterprise deployment model to enable Big Data workloads to run across hybrid on-premises, multi-cloud and edge architectures.
The latest update from MapR, which now describes itself as offering "the industry's leading data platform for AI and analytics," reflects a broader Big Data industry shift from Hadoop, Spark and so on to machine learning and other artificial intelligence technologies.
Big Data specialist MapR Technologies has added support for the Amazon Elastic Container Service for Kubernetes (Amazon EKS) to its flagship data platform.
Cask Data, known for its open source Cask Data Application Platform (CDAP), recently announced that it's being acquired by Google's cloud division.
Many database developers were thrilled with Apple's decision to open source FoundationDB, a distributed datastore it acquired three years ago.
Mode Analytics, seeking to make the power of data analysis "accessible to everyone," announced a toolkit that it said will be free forever, offering some core capabilities of its business data platform that includes new support for R Notebooks.