| ||||||||||||||||||||||||||||||||||||||||||||||||||||||
This is a blog containing data related news and information that I find interesting or relevant. Links are given to original sites containing source information for which I can take no responsibility. Any opinion expressed is my own.
Showing posts with label UNSTRUCTURED DATA. Show all posts
Showing posts with label UNSTRUCTURED DATA. Show all posts
Tuesday, 26 July 2022
WEBINAR: Extract Data from PDFs at Scale - 4 August 2022
Tuesday, 7 September 2021
WEBINAR: AI vs unstructured data: Best practices for scaling video AI - 15 September 2021
|
Sunday, 29 April 2018
Understanding Feature Engineering - 4 part article by Dipanjan Sarkar via @TDataScience
Great 4 part series that you really need to set some time aside so you can sit and read these:
1 - Strategies for working with continuous, numerical data
2 - Strategies for working with discrete, categorical data
3 - Traditional strategies for taming unstructured, textual data
4 - Newer, advanced strategies for taming unstructured, textual data
1 - Strategies for working with continuous, numerical data
2 - Strategies for working with discrete, categorical data
3 - Traditional strategies for taming unstructured, textual data
4 - Newer, advanced strategies for taming unstructured, textual data
Sunday, 28 January 2018
Unstructured content: An untapped fuel source for AI and machine learning by Alex Welsh via @sdtimes
Advancements in AI (Artificial Intelligence) and machine learning now make it possible and affordable to sift through and find meaning in vast amounts of unstructured data obtained from video and audio files, emails, logs, social media posts and even notifications from Internet of Things (IoT) devices.
Nice to know that there is hope and something useful for unstructured data.
Nice to know that there is hope and something useful for unstructured data.
Saturday, 25 November 2017
Dirty Data Is OK, How You Cleanse It Matters by Chirag Shivalker via @DZone
It has been an unsolved mystery for companies if they should get their data cleansed first to opt for data analytics or if they should opt for data analytics to conclude whether their data is dirty.
There are some really good points in this article. I cannot emphasise enough the single source of truth point. We must all have worked for organisations where department A's figures don't match department B's. You cannot run an organisation if the numbers in your reporting don't match, and even worse you have no idea why they don't match. You need data management, agreed definitions for data, and just the one source of the truth across the entire company.
There are some really good points in this article. I cannot emphasise enough the single source of truth point. We must all have worked for organisations where department A's figures don't match department B's. You cannot run an organisation if the numbers in your reporting don't match, and even worse you have no idea why they don't match. You need data management, agreed definitions for data, and just the one source of the truth across the entire company.
Subscribe to:
Posts (Atom)