Big Data Product Watch 8/4/14: Data Lakes, Visibility, Project Jumpstarts and More

Here's a roundup of recent Big Data product announcements from Solix Technologies Inc., MongoHQ Inc. and GoGrid, Appfluent Technology Inc., and Actian Corp.

  • Solix announced its Big Data suite with enterprise data lake and archiving functionality.

    The enterprise data management company said the Solix Big Data Suite is the first to provide an Information Lifecycle Management (ILM) framework for enterprise data analytics governance on a petabyte scale.

    The suite comprises Enterprise Data Lake and Enterprise Archiving components.

    The data lake copies incoming production data and stores it "as is" in the Hadoop Distributed File System (HDFS) for more massaging -- such as data description or transformation for analysis -- later, rather than immediately performing intensive extract transform load (ETL) processing.

    Further handling of the data, such as archiving and retirement, is done with the enterprise archiving technology that provides a "nearline" HDFS repository for less-frequently accessed data. The archiving function moves the data and then purges it from its original source based on company ILM policies.

    Solix said using the open source Apache Hadoop for a nearline repository reduces costs and boosts application performance. Universal access to data is provided through structured reports and full-text searching.

    "The Solix Big Data Suite makes the high-volume extraction and retirement of machine data simple," the company quoted customer Giri Chodavarapu at Finisar as saying. "With these larger data sets, we are able to perform meaningful analysis and leverage existing tools to determine root cause analysis and find defects."

  • MongoHQ and GoGrid announced a partnership combining the former's Database-as-a-Service (DaaS) with the latter's 1-Button Deploy solution.

    GoGrid, which describes itself as an Open Data Services company, helps companies run multiple, on-demand Big Data projects, automatically orchestrating the deployment of multi-server Big Data clusters across networks of virtual servers.

    MongoHQ provides the popular NoSQL MongoDB database instantly in the same data center with related applications. It features a real-time dashboard for monitoring, multi-redundant infrastructure and a Web-based console-like experience.

    "MongoHQ's elastic deployments on GoGrid deliver f5 hardware load balancing for application servers, cross-network redundancy and SSD performance through an easy-to-use UI," the companies said in a statement. "This will offer organizations the ability to leverage the advantages of MongoHQ's managed, scalable MongoDB DBaaS, while meeting the demands of systems that require balanced performance and high availability. Additionally, it will offer the ability to take applications to production simply and securely with elastic pricing, which offers the most cost-effective way to deploy MongoDB databases."

    Instead of asking users upfront to estimate how much storage they'll require, MongoHQ's "elastic deployments" charge only for the data used, starting at $18 per gigabyte per month.

    "The partnership will enable developers to run multiple big data evaluations on demand, fast and with a minimum investment of time or resources, and make it simple for developers, systems administrators and IT pros to retain control of their data in the cloud," the companies said.

  • Appfluent announced an upgrade of its Big Data analytic solution, Appfluent Visibility 6.1. It provides usage and performance information across multiple Big Data platforms and data warehouse and BI systems.

    Featuring a new UI, the product can monitor and analyze data usage with various systems such as Teradata, Oracle/Exadata, IBM DB2 and IBM PureData for Analytics, based on Netezza or Hadoop. It provides visibility into every analytic or ETL action on Big Data systems so organizations can see where compute or storage resources are being wasted.

    "Appfluent data usage technology combines features and processes that work together to reveal exactly how your data is used or not used, show how well BI is performing from the users' perspective, and identify the root-cause of database performance issues that are impacting the business," the company said.

  • Actian, which offers the Actian Analytics Platform, announced the Big Data 2.0 Clear Path Program to help organizations quickly get started with analytics projects or move existing projects into production.

    The Clear Path program features analytic blueprints to provide customers with a roadmap to develop solutions to address specific Big Data business challenges. These blueprints consist of a downloadable diagram that illustrates the methodology to be used to address analytics challenges; a shareable workflow that can be used with customer data; and access to an in-person workshop with expert data scientists provided through the company's Analytics Center of Excellence to help customize implementations for specific use cases and data sets.

    Customer Analytics is the first set of blueprints provided, allowing for customer profile analysis for a total view of customers, segmentation of customers down to the level of just one customer, customer prediction and influence of customer decisions, customer retention and discovery of high-value customers, among others. Other blueprints will be rolled out by the company and its partners.

    "Developed by data scientists in Actian's Analytics Center of Excellence, the [program] applies breakthrough Big Data concepts for faster time to value, increased accuracy and deeper discovery leveraging the sophisticated algorithms and lightning-fast performance of the Actian Analytics Platform," the company said.

About the Author

David Ramel is an editor and writer for Converge360.