A number of advancements have now decreased data preparation time while increasing the time available for exploration and applications.
I think there are several possible tools that enable users to do their own querying and this article talks about the one that the author is most familiar with. Some organisations use Tableau, Pentaho or Power BI. I'm sure there are others I have not listed.
This is a blog containing data related news and information that I find interesting or relevant. Links are given to original sites containing source information for which I can take no responsibility. Any opinion expressed is my own.
Monday, 30 April 2018
Sunday, 29 April 2018
Security pros complain the cloud obscures compliance issues by Bob Violino via @infomgmt
Half of the organisations surveyed said existing tools aren’t effective in the cloud and an overabundance of tools makes it almost impossible to prioritise IT and security investments.
I can relate to their concerns - I'm not convinced that cloud is mature enough and all the tools are in place to manage all the areas that should be. I think we need more centralisation of security tools and especially those in the cloud area - yes there are some tools that exist as part of cloud management suites but they do not interface or fit in with the other sides of data security and we need something to make sure it can become more central.
I can relate to their concerns - I'm not convinced that cloud is mature enough and all the tools are in place to manage all the areas that should be. I think we need more centralisation of security tools and especially those in the cloud area - yes there are some tools that exist as part of cloud management suites but they do not interface or fit in with the other sides of data security and we need something to make sure it can become more central.
Understanding Feature Engineering - 4 part article by Dipanjan Sarkar via @TDataScience
Great 4 part series that you really need to set some time aside so you can sit and read these:
1 - Strategies for working with continuous, numerical data
2 - Strategies for working with discrete, categorical data
3 - Traditional strategies for taming unstructured, textual data
4 - Newer, advanced strategies for taming unstructured, textual data
1 - Strategies for working with continuous, numerical data
2 - Strategies for working with discrete, categorical data
3 - Traditional strategies for taming unstructured, textual data
4 - Newer, advanced strategies for taming unstructured, textual data
Saturday, 28 April 2018
7 Books to Grasp Mathematical Foundations of Data Science and Machine Learning by Ajit Jaokar via @kdnuggets
It is vital to have a good understanding of the mathematical foundations to be proficient with data science. With that in mind, here are seven books that can help
A great list of books to help with this area. Certainly I know I need to be better at this.
A great list of books to help with this area. Certainly I know I need to be better at this.
Friday, 27 April 2018
Understanding fast data and its importance in an IoT-driven world by Kayla Matthews via @infomgmt
Processing high volumes and continuous streams of information in real-time with low to medium latency, it is scalable, has a high uptime and can quickly recover from failure situations.
I think for me you have to have :
- Clean Data - it has to be good data that is not rubbish.
- Data Management - you have to understand exactly what you have and what it means. It has to have consistent definition and there must be some sort of validation to make sure it is correct.
- Process Consistency - your processes have to be consistent too so everyone works off the same thing.
I think for me you have to have :
- Clean Data - it has to be good data that is not rubbish.
- Data Management - you have to understand exactly what you have and what it means. It has to have consistent definition and there must be some sort of validation to make sure it is correct.
- Process Consistency - your processes have to be consistent too so everyone works off the same thing.
Thursday, 26 April 2018
9 key mistakes organizations make when analyzing data by Larry Alton via @infomgmt
The accessibility and ubiquity of information has led to an increased number of amateur mistakes in analysis. Here are some of the most common, and how to overcome them.
I think this list needs to be bookmarked, printed out and more importantly referred to in order to try and check for all of these in order to improve the standard of your analytics.
I think this list needs to be bookmarked, printed out and more importantly referred to in order to try and check for all of these in order to improve the standard of your analytics.
Wednesday, 25 April 2018
Overcoming hidden data risks when managing third parties by Baan Alsinawi and Adriaen Morse via @infomgmt
Here are steps that will extend a risk management program to include outside vendors and reduce the likelihood of a breach due to factors outside an organization’s control.
I'd like to think that none of these are new or surprises but recent breaches and legislation (like GDPR) turn a much higher focus on this kind of thing. It's almost a master list for anything that is put out to an external organisation to complete for you.
I'd like to think that none of these are new or surprises but recent breaches and legislation (like GDPR) turn a much higher focus on this kind of thing. It's almost a master list for anything that is put out to an external organisation to complete for you.
Tuesday, 24 April 2018
Understanding the relationship between agile software development and Kanban by Anthony Coggine via @infomgmt
Kanban is not a paradigm in and of itself, but a way to manage projects, tasks and team planning. Projects are done just in time, on a rolling basis.
I'm going to repeat the very last paragraph in this article because it is so important and so clear - "Constant communication, assessment, and reorganization is necessary to complete high quality software projects. Increasing transparency, visibility, and cooperation among team members is the best way to facilitate software development."
I really think we should all print that section out in a large font and put it up where we work in order to make sure we remember it and try to stick to it.
I'm going to repeat the very last paragraph in this article because it is so important and so clear - "Constant communication, assessment, and reorganization is necessary to complete high quality software projects. Increasing transparency, visibility, and cooperation among team members is the best way to facilitate software development."
I really think we should all print that section out in a large font and put it up where we work in order to make sure we remember it and try to stick to it.
Monday, 23 April 2018
3 steps to help ensure your digital transformation efforts succeed by Justin Rodenbostel via @infomgmt
These projects are risky and 90 percent fail. But the payoff could be market-leading products and services that beat out a close competitor or prevent potential disruptors from stealing market share.
These three are the absolute minimum requirements to try any kind of digital transformation. I think there is a lot of pre work that should be done too. I think money is crucial to tie down first as far too many projects like this end in a spiral as the cost goes up and up - financial and therefore results must be locked and controlled with a rod of iron to avoid a huge failure.
These three are the absolute minimum requirements to try any kind of digital transformation. I think there is a lot of pre work that should be done too. I think money is crucial to tie down first as far too many projects like this end in a spiral as the cost goes up and up - financial and therefore results must be locked and controlled with a rod of iron to avoid a huge failure.
Friday, 20 April 2018
WEBINAR: How data catalogs can drive real results with business intelligence and analytics - 26 April 2018
Web Seminar How data catalogs can drive real results with business intelligence and analytics
Apr. 26, 2018 | 2 PM ET/11 AM PT
Hosted by Information Management
Hosted by Information Management
Across all industries, and in organizations large and small, everyone wants to do more with data.
More specifically, everyone wants to use data to get closer to customers, to create new efficiencies,
to drive smart business decisions, and to boost the bottom line. But that’s a tall order when
organizations continue to create and collect virtual mountains of information. The challenge is to
know which data has real value. But too often discussions about data and its value between
information technology professionals and business stakeholders turns overly technical.
Business users simply don’t understand what data they possess and where to find it.
This webinar will look at how data catalogs can ease that process and present data
assets in ways that everyone in the organization can understand and take advantage of.
George Yuhasz Director, US Business Intelligence & Data Services Keystone Foods (Speaker) | Jay Zaidi Managing Partner AlyData (Speaker) | David Weldon Editor-In-Chief Information Management (Moderator) |
Sponsored By:
Register here
If Your Data Is Bad, Your Machine Learning Tools Are Useless by Thomas C. Redman via @HarvardBiz
Poor data quality is enemy number one to the widespread, profitable use of machine learning. While the caustic observation, “garbage-in, garbage-out” has plagued analytics and decision-making for generations, it carries a special warning for machine learning.
Very good advice in this article that I think should be bookmarked or notes should be taken because you really need to be following these steps if you want to be successful with machine learning.
Very good advice in this article that I think should be bookmarked or notes should be taken because you really need to be following these steps if you want to be successful with machine learning.
Thursday, 19 April 2018
Choose the right AI method for the job by Stephan Jou via @VentureBeat
It’s hard to remember the days when artificial intelligence seemed like an intangible, futuristic concept. Today, AI is everywhere. This has been decades in the making, however, and the past 90 years have seen both renaissances and winters for the field of study.
Some great comments from Stephan.
Some great comments from Stephan.
Wednesday, 18 April 2018
6 ways to attain top benefits from artificial intelligence & machine learning by Maxim Lukichev via @infomgmt
It can seem overwhelming to choose the right implementation approaches to these hot technologies. Here are six effective ways to attain quantifiable results from AI and ML.
I definitely agree with the 6 ways he has suggested. I would add a few of my own to his list.
7 Check the data definitions are consistent. For example I have in the past worked at an organisation that called the same data different names in different systems, used different formats for that common data item, or even different values for the same thing in different systems. You need to sort out the MDM of the data first so you always compare apples to apples and pears to pears.
8 Handle the same fields, same data, etc in different physical formats - for example is it text, number, decimal, number. What happens when you convert to a common format - does it change the value or format.
I definitely agree with the 6 ways he has suggested. I would add a few of my own to his list.
7 Check the data definitions are consistent. For example I have in the past worked at an organisation that called the same data different names in different systems, used different formats for that common data item, or even different values for the same thing in different systems. You need to sort out the MDM of the data first so you always compare apples to apples and pears to pears.
8 Handle the same fields, same data, etc in different physical formats - for example is it text, number, decimal, number. What happens when you convert to a common format - does it change the value or format.
Tuesday, 17 April 2018
Graph databases and machine learning will revolutionise MDM strategies by Aaron Zornes via @infomgmt
These technologies will become widely adopted in 2018 and 2019, and will augment master data management and data governance to provide increased agility and scalability.
Anything that enhances and improves the understanding of data gets my vote. It is crucial to the production of correct results in any system or reporting that the data and relationships between that data are understood so that any results can be trusted 100%
Anything that enhances and improves the understanding of data gets my vote. It is crucial to the production of correct results in any system or reporting that the data and relationships between that data are understood so that any results can be trusted 100%
Monday, 16 April 2018
WEBINAR: Build for the Future of AI and Machine Learning - 19 April 2018
|
Data scientists that produce data-driven products rule the market by Adam Keene via @infomgmt
These professionals are core to the success of software companies, and this role can quickly lead to leadership opportunities and top salary potential.
This gives you some ideas on what is possible in the Data Scientist job area. I would sound a huge caution on some of this - there are many people doing these kinds of role that do not and are not likely to command the salaries suggested in this article. The best salaries go to those that are in the rock star level of data scientist.
This gives you some ideas on what is possible in the Data Scientist job area. I would sound a huge caution on some of this - there are many people doing these kinds of role that do not and are not likely to command the salaries suggested in this article. The best salaries go to those that are in the rock star level of data scientist.
Sunday, 15 April 2018
Tired of Social Network Platforms that are affected by bots and trolls? There is an alternative
Tired of Social Network Platforms that are affected by bots and trolls? There is an alternative
We've all seen the news headlines and reports about mischievous or hostile nations using trolls, bots or data analytics to influence opinions or even elections in other countries and despaired. I'm sure like myself many of you will have thought about recent elections and wondered if the result was correct or even how we got there. Examples are the US Presidential election in 2017 or the UK In or Out of the European Union vote in 2017. No matter the colour of your political beliefs do we really believe all elections have been correct even if we agreed with the result?But there is an alternative. Hacktivist The Jester has set up a completely new Social Network Platform that blocks contact from any IP space originating in Russia, China, North Korea, Iran, Pakistan or Syria, along with a list of over 100,000 VPN and proxy services. Those abilities combined ensure that the platform is safe and free from any other those negative influences. It is also an environment where discussion is encouraged and all points of view are respected. New users are welcomed and given suggested hashtags to use to find help (although there will always be someone around to help if you are lost). It is such a safe haven that many users close their other accounts and focus on this new one - even committing to help towards the cost via Patreon or Bitcoin.
So if you want to try something new that encourages respect for the individual and different opinions come and join the discussion on Counter Social here.
Find out more:
Counter Social page
The Jester Wikipedia page
JΞSŦΞR ✪ ΔCŦUΔL³³Âº¹ on Twitter
Friday, 13 April 2018
WEBINAR: Minimizing Model Risk with Automated Data Preparation & Machine Learning - 19 April 2018
|
|
To protect artificial intelligence from attacks, show it fake data by Jackie Snow via @techreview
Google Brain’s Ian Goodfellow explains how AI systems defend themselves, onstage at EmTech Digital.
A good point that deserves serious thought and consideration.
A good point that deserves serious thought and consideration.
Thursday, 12 April 2018
Who Needs A Data Model Anyway? by @BarryDevlin via @TDWI
Will AI eliminate the need for data models?
I come from a data modelling background so I'm biased but I still think there is a home for a data model. An Enterprise Data Model would be very useful as a tool to help non-technical people understand the business and how the data within it relates.
I come from a data modelling background so I'm biased but I still think there is a home for a data model. An Enterprise Data Model would be very useful as a tool to help non-technical people understand the business and how the data within it relates.
Wednesday, 11 April 2018
The rise of blockchain and blockchain-as-a-service by Antonis Papatsaras via @InfoMgmt
The technology creates the perfect conditions for organisations to provide vendors, customers and employees with visibility into their operations.
The potential of this technology is huge but we also need to be really careful not to get blinded by the hype surrounding this technology. You need to both understand it and test it properly so you have clear benefits before implementing.
The potential of this technology is huge but we also need to be really careful not to get blinded by the hype surrounding this technology. You need to both understand it and test it properly so you have clear benefits before implementing.
Tuesday, 10 April 2018
How artificial intelligence and machine learning can revolutionise ecommerce by Bud Goswami via @InfoMgmt
With the latest advancements in AI, retailers are beginning to really hone in on creating a custom, brand-focused experience for each visitor.
I have to agree with Bud - I think both AI and ML are going to make major differences for all retailers and especially those with large eCommerce bases.
I have to agree with Bud - I think both AI and ML are going to make major differences for all retailers and especially those with large eCommerce bases.
Monday, 9 April 2018
Thanks to Facebook, expect GDPR to spread beyond the EU by Lisa Loftis via @InfoMgmt
The strong and immediate reaction to this data misuse incident should serve as a warning shot for all companies collecting and using consumer personal data.
I agree with this article - the GDPR legislation doesn't seem that difficult now if you want to preserve your reputation and survive going forward. This Facebook disaster should serve as a lesson to other companies as to what could happen to them if they are not careful.
I agree with this article - the GDPR legislation doesn't seem that difficult now if you want to preserve your reputation and survive going forward. This Facebook disaster should serve as a lesson to other companies as to what could happen to them if they are not careful.
Sunday, 8 April 2018
SLIDESHOW: 5 technologies that will reshape business and society by David Weldon via @InfoMgmt
Within the next five years, blockchain, AI, lattice cryptography and quantum computing will revolutionise software development and its impact, says IBM Research’s global labs.
There are some really exciting developments in this slideshow - I'm particularly looking forward to lattice cryptography.
There are some really exciting developments in this slideshow - I'm particularly looking forward to lattice cryptography.
Friday, 6 April 2018
Learning AI if You Suck at Math by @Dan_Jeffries1 via @hackernoon
"Maybe you'd love to dig deeper and get an image recognition program running in TensorFlow or Theano? Perhaps you're a kick-ass developer or systems architect and you know computers incredibly well but there's just one little problem: You suck at math."
Good suggestions. There are also maths courses on Coursera and other MOOCs. Of course many tools have functions that you can use to help you get over the maths problem too.
Good suggestions. There are also maths courses on Coursera and other MOOCs. Of course many tools have functions that you can use to help you get over the maths problem too.
Thursday, 5 April 2018
Why AI Cannot Survive Without Big Data by Philip Piletic via @SmartDataCo
Data scientists are struggling to create structure out of the jumble of big data out there – structure that is essential for AI to function properly.
Yes AI needs lots of data, but it also needs to have some kind of known structure to that data as well as an understanding of the meaning of that data.
Yes AI needs lots of data, but it also needs to have some kind of known structure to that data as well as an understanding of the meaning of that data.
Wednesday, 4 April 2018
8 Common Pitfalls That Can Ruin Your Prediction by Norbert Obsuszt by @kdnuggets
A good prediction can help your work and make it easier. But how can you be sure that your prediction is good? Here are some common pitfalls that you should avoid.
This is good and deserves a bookmark so you can refer back to it.
This is good and deserves a bookmark so you can refer back to it.
Tuesday, 3 April 2018
Will GDPR Make Machine Learning Illegal? by Gregory Piatetsky via @kdnuggets
Does GDPR require Machine Learning algorithms to explain their output? Probably not, but experts disagree and there is enough ambiguity to keep lawyers busy.
It sounds like it could be a job creation scheme for lawyers and legislators in the future. I think it might also mean that organisations need to change their privacy policies in order to cover the use of data in ML and get customers to agree to it in order to protect the use going forward.
It sounds like it could be a job creation scheme for lawyers and legislators in the future. I think it might also mean that organisations need to change their privacy policies in order to cover the use of data in ML and get customers to agree to it in order to protect the use going forward.
Subscribe to:
Posts (Atom)