News

Startups Address Big Data Failures, Redesign the Data Warehouse

The familiar pattern is becoming a cliché: Staffers learn cutting-edge technologies at huge companies, jump ship, form a startup, get funding, emerge from stealth and tell the world they have new ways of doing things better.

Today in the data space it's the former head of IBM Watson Labs, Matt Sanchez, announcing a company called Cognitive Scale Inc. with "the industry's first technology independent cognitive cloud platform" to address the 55 percent failure rate for Big Data projects.

Yesterday it was Bob Muglia, a former longtime Microsoft exec, emerging as the head of Snowflake Computing Inc. with "the first data warehouse built from the ground up for the cloud" based on a patent-pending new architecture.

Cognitive computing is a term popularized at IBM with its supercomputer named Watson that beat all humans in the TV game show "Jeopardy!" and went on to provide its new-age computing style as a service in the cloud. That style somewhat mimics how the human brain works, adapting to ever-changing conditions and goals, learning as it goes, understanding context and obscure relationships, and so on.

Sanchez is founder and CTO of Cognitive Scale, which today announced its Insights Fabric described as a portable cognitive cloud platform based on open technology and standards, promising to help companies get better value from Big Data investments.

Analytics climbing to the cloud.
[Click on image for larger view.] Analytics climbing to the cloud. (source: Cognitive Scale Inc.)

"It does this by extracting patterns and insights from dark data sources that are almost totally opaque today," the company said. "This includes information found in databases, devices, blogs, reviews, e-mails, social media, images and other unstructured data sources."

That opacity might contribute to the problems with current Big Data analytics implementations.

There are three jaw-dropping facts that limit the capabilities of Big Data," Sanchez said in a news release. "First is 55 percent of Big Data initiatives fail. Second is 70 to 80 percent of the world's data is trapped in silos within and outside company walls with no secure and reliable way to access it. Third, valuable insights are lost because 80 percent of data is not machine readable; this is commonly referred to as 'dark data.' We address these gaps through cognitive computing to help customers improve decision making, personalize consumer experiences and create more profitable relationships."

The company didn't source its claim of a 55 percent project failure rate for Big Data, but it might be referencing an Infochimps survey from last year that stated "55 percent of Big Data projects don't get completed, and many others fall short of their objectives."

Cognitive Scale also announced vertical industry clouds now available for use in the travel, health care, retail and financial services arena. It also said it has formed numerous partnerships to grow the cognitive computing ecosystem with companies such as Amazon Web Services Inc. (AWS), Deloitte Development LLC, Avention (formerly OneSource) and several IBM initiatives, including Watson.

The Austin, Texas-based company said it has been working on its technology for 18 months with numerous partners and is expected to close the year with "multimillion dollars" in revenue as it seeks a slice of the cognitive computing pie forecast by Deloitte to grow from $1 billion this year to more than $50 billion by 2018.

Snowflake Computing, meanwhile, yesterday announced its Elastic Data Warehouse, described as a reinvention of the data warehouse, built from the ground up to capitalize on the capabilities of cloud computing.

Snowflake architecture physically separates and logically integrates compute and storage.
[Click on image for larger view.] Snowflake architecture physically separates and logically integrates compute and storage.
(source: Snowflake Computing Inc.)

"Its patent-pending architecture decouples data storage from compute, making it uniquely able to take advantage of the elasticity, scalability and flexibility of the cloud," the company said. "As a native relational database with full support for standard SQL, Snowflake empowers any analyst with self-service access to data, which enables organizations to take advantage of the tools and skills that they already have."

The emphasis on SQL is no surprise, as CEO Bob Muglia is a former 23-year veteran at Microsoft who was involved in that company's SQL Server relational database management system as far back as the late '80s and left in 2011 as a senior exec in charge of the Server and Tools Business Unit.

"As a native relational database with full support for standard SQL, Snowflake empowers any analyst with self-service access to data, which enables organizations to take advantage of the tools and skills that they already have," the company said.

The San Mateo, Calif.-based Snowflake promises to eliminate the hassles of managing and tuning on-premises data warehouses by offering a data warehouse as a service and capitalizing on the unique on-demand benefits of the cloud, such as being able to elastically scale users, data and workloads. The company said every user and workload simultaneously use the exact amount of resources needed without contending with others.

The company's technology also provides for the native storage of semi-structured data in a relational database so it can be queried along with structured data in the same system.

Snowflake's new way of warehousing and querying data is based on 120 technology patents it holds and decades of experience on the part of staffers from companies such as Actian Corp., Cloudera Inc., Google Inc., Microsoft, Oracle Corp. and Teradata.

"Data warehousing is a market ripe for innovation," Muglia said in a statement. "Today's solutions are based on architectures that date back almost 30 years. While data analytics can transform organizations, the infrastructure requirements are prohibitive. The cloud enables a different approach, but that requires the vision and the technical ability to start from scratch. From day one, Snowflake focused on creating a software service that brings together both transactional and machine-generated data for deeper insight and business understanding."

The Snowflake Elastic Data Warehouse is now in beta, and the company invited interested parties to contact it via e-mail for more information.

About the Author

David Ramel is an editor and writer for Converge360.