Amazon Simplifies Big Data Queries of Cloud Data with SQL -- ADTmag

Amazon Simplifies Big Data Queries of Cloud Data with SQL

By David Ramel
November 30, 2016

The new Amazon Athena tool from Amazon Web Services Inc.(AWS) enables serverless queries of large amounts of data stored in Amazon Simple Storage Service (Amazon S3), obviating the need to spin up Hadoop clusters or set up data warehouses.

Amazon Athena was unveiled today at the company's re:Invent 2016 conference.

In a news release issued today, the company said, "With a few clicks in the AWS Management Console, customers can point Amazon Athena at their data stored in Amazon S3 and begin using standard SQL to run queries and get results in seconds. With Amazon Athena there are no clusters to manage and tune, no infrastructure to setup or manage, and customers pay only for the queries they run."

Athena promises to simplify the process of querying petabyte-scale data stored in standard data formats such as CSV, log files, JSON, Apache ORC and Apache Parquet.

Although it eliminates the need for using standard Big Data tools primarily found in the open source Apache Software Foundation ecosystem such as Hadoop, Spark, Hive and Pig, its underlying architecture is based on another Apache project, the Presto distributed SQL engine.

"Athena includes an interactive query editor to help get you going as quickly as possible," AWS spokesperson Jeff Barr said in a blog post today. "Your queries are expressed in standard ANSI SQL and can use JOINs, window functions, and other advanced features.

"You can run your queries from the AWS Management Console or from a SQL client such as SQL Workbench, and you can use Amazon QuickSight to visualize your data. You can also download and use the Athena JDBC driver and run queries from your favorite Business Intelligence tool."

An AWS FAQ explains more about the product, including what use cases are best suited for Athena as opposed to other Big Data services such as the Amazon Redshift data warehouse or more sophisticated data processing frameworks such as Amazon EMR.

"The announcement of AWS Athena provides further validation that the demand for data processing in the cloud has been exploding," said Bob Muglia, CEO of Snowflake Computing. "In fact, data processing and analytics has become one of the most important workloads in the cloud. Customers are demanding fast, flexible, and easy ways to store, access and analyze data in the cloud to drive business results."

Barr said Athena is now available only in the US East (Northern Virginia) and US West (Oregon) regions, but in the coming months will become available in other regions.

Furthermore, he said, "You pay only for the queries that you run; you are charged based on the amount of data scanned by each query (the console will display this information after each query). This means that you can realize significant cost savings by compressing, partitioning, or converting your data to a columnar format."

About the Author

David Ramel is an editor and writer at Converge 360.

Featured

AppTrends

Email Address*Country*

Please type the letters/numbers you see above.

Upcoming Training Events

0 AM

VSLive! 4-Day Hands-On Training Seminar: Immersive .NET Full Stack Training with CoPilot: 4-Day Hands-On Experience
July 14-17, 2026

Visual Studio Live! @ Microsoft HQ
July 27-31, 2026

Visual Studio Live! @ San Diego
September 14-18, 2026

The AI Pivot
September 25, 2026

Live! 360 6-Week Training & Certification Course: Mastering the Microsoft AI Framework: Building Enterprise-Ready AI Agents with Microsoft Foundry
October 6–November 10, 2026

VSLive! 6-Week Training & Certification Course: Blazor Developer Accelerator: Hands-On Skills for Real-World .NET Teams
October 7 – November 11, 2026

Live! 360 Orlando
November 15-20, 2026

Artificial Intelligence Live! Orlando
November 15-20, 2026

AI Enterprise Architecture Live! Orlando
November 15-20, 2026

Cybersecurity & Ransomware Live! Orlando
November 15-20, 2026

Data Platform Live! Orlando
November 15-20, 2026

Visual Studio Live! Orlando
November 15-20, 2026

Live! 360 2-Day Hands-On Seminar: AI-Powered .NET Development with Claude & Claude Code
December 8-9, 2026

VSLive! 4-Day Hands-On Training Seminar: Immersive .NET Full Stack Training with CoPilot: 4-Day Hands-On Experience
December 15-18, 2026

Visual Studio Live! Las Vegas
March 22-26, 2027

Visual Studio Live! @ Microsoft HQ
August 2-6, 2027

Free White Papers

More Tech Library