Spark 101: Online Approximate OLAP in SparkSQL

October 18, 2015 by Daniel Gutierrez

The Hadoop Summit 2015 talk below introduces G-OLA, a parallel approximate query engine built on top of BlinkDB and SparkSQL, that provides a radically different “online execution” paradigm to incrementally process massive amounts of data on clusters of hundreds or thousands of machine while returning approximate answers. G-OLA presents the user with a meaningful approximate result (with error bars) that is continuously refined at a speed comfortable to the user and enables them to control the query execution on the fly. The slides for this presentation are available HERE.

Sign up for the free insideAI News newsletter.

Filed Under: Main Feature, Uncategorized Tagged With: Weekly Newsletter Articles

From Legacy to Leading Edge: How Mainframe Data Can Transform AI and Analytics

From Legacy to Leading Edge: How Mainframe Data Can Transform AI and Analytics explores how enterprise data leaders can unlock untapped value by integrating mainframe data into modern AI and analytics ecosystems. Despite powering critical operations, mainframe data remains largely siloed. This whitepaper outlines strategic insights, common roadblocks, and a practical roadmap to help data leaders […]

Download

Comments

Revathy Hari says

October 21, 2015 at 12:31 am

Can spark be used for generate sequential patterns from dynamic streams of big data( especially considering dna sequences as data set)?

Spark 101: Online Approximate OLAP in SparkSQL

Sponsored Guest Articles

Re-Engineering Ethernet for AI Fabric

White Papers

From Legacy to Leading Edge: How Mainframe Data Can Transform AI and Analytics

Comments

Featured RSS Feed

More News from insideHPC

Spark 101: Online Approximate OLAP in SparkSQL

Sponsored Guest Articles

Re-Engineering Ethernet for AI Fabric

White Papers

From Legacy to Leading Edge: How Mainframe Data Can Transform AI and Analytics

Join Us On Social Media

Comments

Featured RSS Feed

More News from insideHPC