Showing posts with label PRESTO. Show all posts
Showing posts with label PRESTO. Show all posts

Friday, 28 February 2020

Presto-powered S3 data warehouse on Kubernetes by @joshua_robinson via @Medium

Joshua Robinson offers up a tutorial on how to set up a Presto data warehouse using Docker that could query data on a FlashBlade S3 object store, and a follow-up tutorial that explains how to move everything, including the Hive Metastore, to run in Kubernetes.

This is very useful to read and might help you to achieve something quicker than you have planned.

Saturday, 5 May 2018

Presto for Data Scientists – SQL on anything by Kamil Bajda-Pawlikowski via @kdnuggets

Presto enables data scientists to run interactive SQL across multiple data sources. This open source engine supports querying anything, anywhere, and at large scale.

I have to agree with Kamil - download a free version of it and try it - I think you will be pleasantly surprised.

Wednesday, 18 November 2015

WEBINAR: Best Fit Engineering for SQL on Hadoop - 24 November 2015


Overview
Title: Best Fit Engineering for SQL on Hadoop
Date: Tuesday, November 24, 2015
Time: 09:00 AM Pacific Standard Time
Duration: 1 hour
Summary

Best Fit Engineering for SQL on Hadoop
Join us for our latest DSC Webinar series as we discuss how enterprises have increasingly large volumes of structured and semi-structured data generated by all sorts of applications.  Much of that data is increasingly finding its way into Hadoop clusters for analytics because of its versatility and the economical, linear scalability of both data storage and compute.  And SQL is still the best option for querying it:
  • SQL is the universal connector to many BI tools and technologies
  • Prevalent SQL skills overcome the Hadoop skills gap
  • Hadooponomics enables more analytics on more data at a much lower cost
Forrester recently concluded that organizations need to choose more than one SQL-on-Hadoop tool to satisfy all requirements. Hortonworks and Teradata agree in this “best fit engineering” approach designed to match the benefits of each tool set to map to actual workload requirements, while remaining true to 100% open source innovation. 
You will learn about SQL on Hadoop best practices, including:
  • A brief history of SQL on Hadoop
  • Architecture and use cases for Hive and Presto
  • Technical deep dive and futures for Hive and Presto 
Speakers: 
Mark Shainman, Program Manager -- Teradata
Mark Lochbihler, Director, Partner Engineering -- Hortonworks
Hosted by: Bill Vorhies, Editorial Director -- Data Science Central

Teradata Hortonworks logo

Register here