An introduction to Hadoop, a framework that enables you to store and process large data sets in parallel and distributed fashion.
A nice little overview of Hadoop although I do agree with the first comment by Randy about relational databases
This is a blog containing data related news and information that I find interesting or relevant. Links are given to original sites containing source information for which I can take no responsibility. Any opinion expressed is my own.
Thursday, 27 September 2018
Wednesday, 26 September 2018
Why organisations should regularly assess the KPIs they track by Kayla Matthews via @infomgmt
KPIs alone are not enough. Instead, it’s necessary to regularly re-evaluate all applicable KPIs to ensure they’re still providing information that’s relevant to the business at large and in line with its data governance strategies.
I definitely agree that it is important to adjust KPIs so that they stay relevant to your organisation and actually achieve what you want them to. Thing of all the time and money that could be wasted by not adjusting them so they are still relevant.
I definitely agree that it is important to adjust KPIs so that they stay relevant to your organisation and actually achieve what you want them to. Thing of all the time and money that could be wasted by not adjusting them so they are still relevant.
Tuesday, 25 September 2018
Artificial general intelligence: Dream goal, nightmare scenario or fantasy? by Herb Roitblat via @infomgmt
The quest for artificial general intelligence is the holy grail of artificial intelligence research, and, arguably, just as difficult to find. It may be a myth.
This is a really interesting piece by Herb and really makes you stop and think about what is and what is not possible. Definitely worth reading during a time when you have time to stop and think a bit about it.
This is a really interesting piece by Herb and really makes you stop and think about what is and what is not possible. Definitely worth reading during a time when you have time to stop and think a bit about it.
Monday, 24 September 2018
Getting to ROI with AI in the enterprise by Tom Wilde via @infomgmt
Despite its promise, and its growing adoption, there is still too little we can point to in terms of real business results from artificial intelligence.
Some great thoughts from Tom although I'm certainly not convinced that many organisations have actually achieved something very tangible as a return on ther investment in AI.
Some great thoughts from Tom although I'm certainly not convinced that many organisations have actually achieved something very tangible as a return on ther investment in AI.
Saturday, 22 September 2018
Essential Math for Data Science: ‘Why’ and ‘How’ by Tirthajyoti Sarkar via @kdnuggets
It always pays to know the machinery under the hood (even at a high level) than being just the guy behind the wheel with no knowledge about the car.
This is really useful - you can teach yourself statistics if your own skills are not up to scratch.
This is really useful - you can teach yourself statistics if your own skills are not up to scratch.
Friday, 21 September 2018
5 top strategies to make development cycles more efficient by Charles Dearing via @infomgmt
Software development is fraught with all sorts of pitfalls. Adopting the principles of Agile software development is one way to combat these inevitable pitfalls.
Some useful advice.
Some useful advice.
Thursday, 20 September 2018
WEBINAR: 4 Ways to Tackle Common Data Prep Issues - 25 September 2018
|
Register here
New open challenge seeks to promote ethics in the use of AI and the news by David Weldon via @infomgmt
Toward that goal, a new open call is offering $750,000 for ideas that will shape the impact artificial intelligence has on the field of news and information.
Something to think about entering - I'm sure we all have thoughts on this.
Something to think about entering - I'm sure we all have thoughts on this.
Tuesday, 18 September 2018
Key steps to ensure data protection amidst the growth of mobile apps by Nathan Sykes via @infomgmt
As data protection regulations grow and the laws become more stringent, it has also become much more difficult to follow them because of widespread mobile adoption.
Some useful pointers to steer you in the right direction.
Some useful pointers to steer you in the right direction.
Friday, 14 September 2018
WEBINAR: Columnar Databases: Best Choice for Real-Time Analytics - 19th September 2018
|
Register here
Thursday, 13 September 2018
AI Knowledge Map: How To Classify AI Technologies by Francesco Corea via @kdnuggets
What follows is then an effort to draw an architecture to access knowledge on AI and follow emergent dynamics, a gateway of pre-existing knowledge on the topic that will allow you to scout around for additional information and eventually create new knowledge on AI.
I love the diagram and explanations in this article - it is worth printing and keeping to hand.
I love the diagram and explanations in this article - it is worth printing and keeping to hand.
Wednesday, 12 September 2018
How blockchain technology could aid key data challenges by Kevin Peek via @infomgmt
A variety of healthcare provider organisations and health insurers are just beginning to deploy distributed information to solve vexing data issues.
Yes blockchain gives definite benefits and I think that organisations should be looking at it seriously to see if it solves some of the problems they are experiencing.
Yes blockchain gives definite benefits and I think that organisations should be looking at it seriously to see if it solves some of the problems they are experiencing.
Tuesday, 11 September 2018
Master data management is not the answer to GDPR compliance by Aaron Zornes by @infomgmt
By themselves, neither data governance nor MDM offer sufficient capabilities to meet GDPR requirements. Together, we are much more empowered as an organisation.
This article contains some really interesting points that I had not realised. Worth reading and thinking about - maybe they are true in your own organisation and you are not aware and may need to make some changes.
This article contains some really interesting points that I had not realised. Worth reading and thinking about - maybe they are true in your own organisation and you are not aware and may need to make some changes.
Monday, 10 September 2018
Digital 'fixation' causing firms to throw good money at bad projects by Bob Violino via @infomgmt
Organisations risk wasting millions of dollars in the next 12 months, as they rush into flawed digital projects, according to a new study.
I have to agree - you must find a benefit from each project and it is also important that you look afterwards to ensure that the project actually DID give the benefits that was suggested - it might meant you have a few failures but in the long term it can only improve the process of providing evidence of potential benefits fr proposed projects.
I have to agree - you must find a benefit from each project and it is also important that you look afterwards to ensure that the project actually DID give the benefits that was suggested - it might meant you have a few failures but in the long term it can only improve the process of providing evidence of potential benefits fr proposed projects.
Sunday, 9 September 2018
Machine Learning Cheatsheet via @readthedocs
Brief visual explanations of machine learning concepts with diagrams, code examples and links to resources for learning more.
Definitely something to be bookmarked.
Definitely something to be bookmarked.
Saturday, 8 September 2018
Data Visualisation Cheat Sheet by @jschwabish via @kdnuggets
Core principles for successful data visualisation, including tips on how to reduce clutter, preattentive processing and how to integrate text within the graph.
This is so very useful and is worthy of a bookmark for sure.
This is so very useful and is worthy of a bookmark for sure.
Friday, 7 September 2018
How advanced OCR found new life in big data systems by Anna Johansson via @infomgmt
Today, optical character recognition, in combination with natural language processing, allows businesses to perform complex data extraction tasks.
A great idea - use OCR to scan in old paper documents to fill the gaps in your online data - you will never get accurate results on analytics if you are missing data.
A great idea - use OCR to scan in old paper documents to fill the gaps in your online data - you will never get accurate results on analytics if you are missing data.
Thursday, 6 September 2018
o succeed at digital transformation, do a better job of data governance by Darren Cooper via @infomgmt
To set the stage for initiatives like AI and machine learning, companies need a rock-solid governance framework.
Great suggestions by Darren in this article.
Great suggestions by Darren in this article.
Wednesday, 5 September 2018
GDPR compliance the perfect opportunity to modernise data architecture by Amandeep Khurana via @infomgmt
Compliance with the data privacy and security mandate enables organisations to become more agile in their product and service development and rollouts, and more efficient and effective in their ability to respond to market trends and competitive threats.
Yes this is exactly right - everything has to turn onto it's head and be data centric not application centric. I think we need to concentrate on:
WHERE is the data created
WHERE is it also stored (so where is it interfaced to)
HOW it is updated
WHAT changes when it is updated
HOW do you delete the data in ALL systems?
I would suggest you do something like a data flow diagram so you can document all of this for every piece of data.
Yes this is exactly right - everything has to turn onto it's head and be data centric not application centric. I think we need to concentrate on:
WHERE is the data created
WHERE is it also stored (so where is it interfaced to)
HOW it is updated
WHAT changes when it is updated
HOW do you delete the data in ALL systems?
I would suggest you do something like a data flow diagram so you can document all of this for every piece of data.
Tuesday, 4 September 2018
The bias problem with artificial intelligence, and how to solve it by Sanjay Srivastava via @infomgmt
AI bias may come from incomplete datasets or incorrect values. Bias may also emerge through interactions overtime, skewing the machine’s learning. Moreover, a sudden business change, such as a new law or business rule, or ineffective training algorithms can also cause bias.
I agree - you need good quality and representative training data if you want to get good results from any AI and ML you want to use. My advice would be:
1. Take your time - rushing always leads to mistakes so be realistic with plans.
2. Be careful with the methodology you use to create and split your data into Training and Data.
3. Try to use separate teams to test the same piece of code - the hope being that it will help to avoid the bias. Think of it as a human version of a small parallel ML solution.
4. Check, check and check again.
I agree - you need good quality and representative training data if you want to get good results from any AI and ML you want to use. My advice would be:
1. Take your time - rushing always leads to mistakes so be realistic with plans.
2. Be careful with the methodology you use to create and split your data into Training and Data.
3. Try to use separate teams to test the same piece of code - the hope being that it will help to avoid the bias. Think of it as a human version of a small parallel ML solution.
4. Check, check and check again.
Monday, 3 September 2018
Community lenders tell big tech vendors to get up to speed by Nathan DiCamillo via @infomgmt
Small banks and credit unions say slow responses and outdated products from the establishment tech vendor can become a drag on their innovation efforts.
I partially agree with him - yes large organisations move slow (particularly when you are a small customer and therefore your business is not a big loss to them if you move on) but small ones are less stable and sometimes that can be an unacceptable risk to the business (particularly in the financial sector where you just cannot afford an issue). So do really careful risk management and have SLAs to protect yourself.
I partially agree with him - yes large organisations move slow (particularly when you are a small customer and therefore your business is not a big loss to them if you move on) but small ones are less stable and sometimes that can be an unacceptable risk to the business (particularly in the financial sector where you just cannot afford an issue). So do really careful risk management and have SLAs to protect yourself.
Subscribe to:
Posts (Atom)