Showing posts with label DATA LIFECYCLE. Show all posts
Showing posts with label DATA LIFECYCLE. Show all posts

Sunday, 30 April 2017

Data Lineage Demystified: The What, Why, and How by Michelle Knight via @Dataversity

Trusting Big Data requires understanding its Data Lineage. Without Data Lineage, Big Data becomes synonymous with the last phrase in a game of telephone.

Michelle is right - you have to know the system of record and the data flow for the data that you are using, and you need to know that for ALL data that you use.  You need to also understand the quality of that data and what to do when values are missing (preferably that should never happen but we all live in the real world with legacy systems).

Thursday, 29 September 2016

SLIDESHOW: 7 Key Considerations When Choosing a Data Pipeline Service via @infomgmt

Picking a service that manages your data isn’t something to be taken lightly. You’ll want to research a few different services before choosing what is right for your company. Here is advice on how to look at different services to make sure you really understand the value each brings to the table.

I find slide 9 quite important even though the sideshow says it is a bonus.  In the past data has been put into a data warehouse or whatever is driving your reporting and just sits there forever.  All data has a shelf life and needs to be moved, updated or archived off in a timely manner else you run the risk of your reports and analyses being incorrect.