Joshua Robinson offers up a tutorial on how to set up a Presto data warehouse using Docker that could query data on a FlashBlade S3 object store, and a follow-up tutorial that explains how to move everything, including the Hive Metastore, to run in Kubernetes.
This is very useful to read and might help you to achieve something quicker than you have planned.
This is a blog containing data related news and information that I find interesting or relevant. Links are given to original sites containing source information for which I can take no responsibility. Any opinion expressed is my own.
Friday, 28 February 2020
Thursday, 27 February 2020
WEBINAR: Developing and Testing Shiny Apps - 12 March 2020
Data Science Central Webinar Series Event | |||||||||||||||||||
|
Wednesday, 26 February 2020
Google has released a tool to spot faked and doctored images by Karen Hao via @techreview
Google has released a tool to spot faked and doctored images.
Very much needed - I really hope it works and is adopted widely.
Very much needed - I really hope it works and is adopted widely.
Monday, 24 February 2020
Deep learning isn’t hard anymore by Caleb Kaiser via @TDataScience
Deep learning used to require large amounts of data, deep pockets, and a novel, usually custom-built, architecture. But with transfer learning (which takes a pre-trained model and retrains the last layers of the model to focus on a new task), a single engineer can deploy a model in a new domain in a matter of days
There is a great link in the article to a primer on Transfer Learning which is well worth the time investment in reading and learning so you can take advantage of that technique.
There is a great link in the article to a primer on Transfer Learning which is well worth the time investment in reading and learning so you can take advantage of that technique.
Friday, 21 February 2020
All Machine Learning Models Explained in 6 Minutes by Terence Shin in @TDataScience
Intuitive explanations of the most popular machine learning models.
This deserves applause, a bookmark and sharing.
This deserves applause, a bookmark and sharing.
Thursday, 20 February 2020
WEBINAR: Forecasting: Prophet & Time Series Database - 25 February 2020
Data Science Central Webinar Series Event | |||||||||||||||||||
|
Wednesday, 19 February 2020
Interested in machine learning? Better learn PyTorch by Matt Asay via @infoworld
Don’t look now, but easy, straightforward PyTorch has become the hottest product in data science.
As it rivals Tensorflow I would probably suggest you get a grounding in both (if you can).
As it rivals Tensorflow I would probably suggest you get a grounding in both (if you can).
Monday, 17 February 2020
How AI is battling the coronavirus outbreak by Rebecca Heilweil via @voxdotcom
AI helped spot an early warning about the outbreak and now researchers are using flight data to predict where the coronavirus could pop up next.
AI can be hugely helpful in anything connecting with medicine as it can work out patterns and spot changes that standard methods would either miss or take so long to find that the information was out of date.
AI can be hugely helpful in anything connecting with medicine as it can work out patterns and spot changes that standard methods would either miss or take so long to find that the information was out of date.
Friday, 14 February 2020
Demand for big data-as-a-service growing at 25% annually by Bob Violino via @infomgmt
With big data-as-a-service, tools such as analytics software and storage are delivered via the cloud by a service provider.
Definitely, a cheaper way to have big data.
Definitely, a cheaper way to have big data.
Wednesday, 12 February 2020
Blockchain in 5 industries by Alison McCauley via @oreillymedia
Tuesday, 11 February 2020
WEBINAR: How a Physics-Driven Analytics Platform Detects Reliability Threats - 26 February 2020
|
Monday, 10 February 2020
AI Can Do Great Things—if It Doesn't Burn the Planet by/via @wired
OpenAI created an algorithm that successfully manipulates the pieces of a Rubik’s Cube using a robotic hand. But this accomplishment cost more than research time and effort—one estimate says it may have consumed about 2.8 gigawatt-hours of electricity, roughly equal to the output of three nuclear power plants for an hour. The computing power required for AI breakthroughs increased 300,000-fold from 2012 to 2018, creating an environmental impact that needs to be considered.
Something that we often don't consider but really should if we are serious about saving the planet.
Something that we often don't consider but really should if we are serious about saving the planet.
Friday, 7 February 2020
Google just published 25 million free datasets by @tjwaterman99 via @TDataScience
What you need to know about the largest data repository in the world.
Useful for practice if nothing else.
Useful for practice if nothing else.
Wednesday, 5 February 2020
WEBINAR: Organize a Winning AI Team from Design to Deployment - 19 February 2020
Data Science Central Webinar Series Event | |||||||||||||||||||
|
Scale the value of analytics by Rita Sallam and Carlie Idoine.via @Gartner_inc
Manually identifying patterns in the quantities of data most organizations have is not just tedious; it’s inefficient. It’s also much too easy to find only what you were looking for—missing valuable, but unanticipated, insights. Augmented analytics uses ML and AI techniques to identify actionable insights.
Brief if you are not a subscriber but interesting to read.
Brief if you are not a subscriber but interesting to read.
Monday, 3 February 2020
5 Ways Julia Is Better Than Python by Emmett Boudreau via @TDataScience
Why Julia is better than Python for DS/ML
A nice short piece that should make you consider using Julia - even if it is not in your plans please at least install it and have a play.
A nice short piece that should make you consider using Julia - even if it is not in your plans please at least install it and have a play.
Subscribe to:
Posts (Atom)