This is a blog containing data related news and information that I find interesting or relevant. Links are given to original sites containing source information for which I can take no responsibility. Any opinion expressed is my own.
Wednesday, 31 August 2016
Manulife Continues Exploration of AI in Innovation Lab by Danni Santana via @infomgmt
Indico’s platform will enable the Canadian insurer to evaluate data from news articles and analyst reports and recommend investment decisions to portfolio managers.
A Recipe for Cooking with the Hadoop Ecosystem by David Menninger via @infomgmt
The open source model has had a major impact on the big data market, yet in some ways, the open source approach has succeeded despite its shortcomings.
Open source is definitely here to stay.
Open source is definitely here to stay.
Tuesday, 30 August 2016
The Biggest Barrier for Companies That Want to Leverage Their Own Data by Simon Owens via @infomgmt
When leveraged effectively, the data that companies have collected can allow them to gain customer insights, increase efficiency, and reduce costs.
I agree - mindset is a major barrier that is not easy to overcome.
I agree - mindset is a major barrier that is not easy to overcome.
Personalizing travel search: How data leads to happier travels by @MargaretAdy via BigDataMadeSimple
An overdue evolution is happening in travel search, especially in the hotel sector. Doesn’t searching for a “hotel in downtown Miami” seem archaic when the travel industry has so much more information to work with?
Surely the more detailed information with the more dimensions the easier it is to tailor the search.
Surely the more detailed information with the more dimensions the easier it is to tailor the search.
Monday, 29 August 2016
Machine learning becomes mainstream: how to increase your competitive advantage by @Ronald_vanLoon via BigDataMadeSimple
First there was big data – extremely large data sets that made it possible to use data analytics to reveal patterns and trends, allowing businesses to improve customer relations and production efficiency.
Interesting and definitely right - machine learning is pretty much everywhere.
Interesting and definitely right - machine learning is pretty much everywhere.
Facebook's New Open Source Software Can Learn 1 Billion Words in 10 Minutes by @wheresKR via @inc
FastText is the tool that powers Facebook's AI--and now it's available for anyone.
Sounds worth accessing and having a bit of a play.
Sounds worth accessing and having a bit of a play.
Sunday, 28 August 2016
Data Partitioning in Big Data Application with Apache Hive by Vijay Aegis via CodeInnovationsBlog
Big data consulting company professionals are introducing the concept of partitioning in big data application. You need to read the post completely to understand how to do partitioning in such app using Apache Hive. If you don’t know how to do it, experts will help.
Useful blog.
Useful blog.
Blockchain’s Backers Gather to Push Governance for Technology by Olga Kharif and Peter Coy via @infomgmt
Now that companies in finance and other industries are looking to adopt blockchain -- essentially a new kind of a database for recording transactions -- more standards-setting organisation is needed.
Definitely a standard is needed especially because the finance industry is adopting it.
Definitely a standard is needed especially because the finance industry is adopting it.
Saturday, 27 August 2016
SLIDESHOW: The Top Companies for Metadata Management by David Weldon via @infomgmt
Gartner has released its Magic Quadrant for Metadata Management Solutions. The report looks at nine players in the metadata product space, including leaders, visionaries, challengers, and niche players. Here’s a look at who made the quadrant, and why.
Interesting to see who they think are the visionaries and challengers.
Interesting to see who they think are the visionaries and challengers.
Man Who Introduced Millions to Bitcoin Says Blockchain Is a Bust by Matthew Leising via @infomgmt
Industries from finance to healthcare to utilities are working with blockchains to radically change how payments are tracked, securities and derivatives trades are processed, and health records are stored,
Very interesting considering how many people and implementing it.
Very interesting considering how many people and implementing it.
Friday, 26 August 2016
How Campaigns and Companies Use Data to Win the Race by Amir Orad via @Data_Informed
Sisense CEO Amir Orad discusses how political campaigns are leveraging data analytics to target individual voters and guide their advertising spend, how campaigns’ data challenges mirror those of enterprises, and how the analytics efforts of current candidates compare.
Definitely food for thought.
Definitely food for thought.
Data without Context is No Data At All by Rajan Chandras via @Data_Informed
Reference data provides meaning to data and the information that enables data integration, analytics, and governance, as well as applications such as machine learning and natural language processing. But the market lacks a focus on skills and solutions for governing this data.
I agree - if there is no context data is just data.
I agree - if there is no context data is just data.
Thursday, 25 August 2016
WEBINAR: Combining IBM SPSS Statistics and R for competitive advantage - 1 September 2016
Overview
Title: Combining IBM SPSS Statistics and R for competitive advantage
Date: Thursday, September 01, 2016
Time: 09:00 AM Pacific Daylight Time
Duration: 1 hour
Summary
Combining IBM SPSS Statistics and R for competitive advantage
In today’s world, the data is flowing from all directions: social media, phones, weather, location and sensor equipped devices, and more. Competing in this digital age requires the ability to analyse all of this data, and use it to drive decisions that mitigate risk, increase customer satisfaction and grow revenue. Using a combination of proprietary software and open source technology can give your data scientists and statisticians the analytical power they need to find and act on insights quickly.
IBM® SPSS® Statistics provides all of the data analysis tools you need, and integrates with thousands of R extensions for maximum power and flexibility. In this next Data Science Central Webinar event, we will show how SPSS Statistics can help you keep up with the influx of new data and make faster, better business decisions without coding.
Speakers:
Murali Prakash, Portfolio Manager -- IBM Analytics
Alex Oftelie, Subject Matter Expert -- IBM Analytics
Murali Prakash, Portfolio Manager -- IBM Analytics
Alex Oftelie, Subject Matter Expert -- IBM Analytics
Hosted by:
Bill Vorhies, Editorial Director -- Data Science Central
Register here
Cloud computing as a utility is going mainstream by Bob O'Donnell via @Recode
The idea is to leverage power, storage space and fast network connection pipes to deliver computing much like power or electricity.
Interesting vision of the future.
Interesting vision of the future.
Wednesday, 24 August 2016
A CIO writes (sort of): I’m terrified of cloud lock-in - what should I do? by Mat Keep via @cloud_comp_news
I’m in a three-year relationship which I don’t think I can get out of. It started off great, but now I’m getting bled dry and I don’t have the freedom I thought I would have. What should I do?
I'm sure this is an increasing problem unless we are careful.
I'm sure this is an increasing problem unless we are careful.
Understanding Bias: A Pre-requisite For Trustworthy Results by @akelleh via Medium
Algorithms are never neutral. They routinely reflect (often unintentionally) the perceptions of their creators. In this post, Adam Kelleher explores what causes bias and how to correct it.
I really enjoyed reading this and it is worth reading and remembering.
I really enjoyed reading this and it is worth reading and remembering.
Tuesday, 23 August 2016
WEBINAR: Modern infrastructure with NoSQL and containers - 31 August 2016
Modern infrastructure with NoSQL and containers
DATE: Wednesday, August 31, 2016
TIME: 1:00PM EDT | 10:00AM PDT
The best way to deploy and scale a NoSQL database is with elastic infrastructure, with containers being one of the most efficient platforms – deployed on-premise or in the cloud. In this webinar, we’ll discuss how development and operations teams can benefit from running NoSQL databases on containers, as well as the options for doing so.
We will discuss how running NoSQL on containers:
- Improves business agility
- Increases developer productivity
- Simplifies deployment and scaling
And, we will highlight options for running a NoSQL database with:
- Docker Swarm, Docker Machine, and Docker Compose
- Docker on Kubernetes or Apache Mesos (DC/OS)
- Docker on AWS, Azure, Google Cloud
FEATURED SPEAKER:
Shane K Johnson
Manager of Product Marketing, Couchbase
Register here
Bring Your Dark Data into the Light by Mika Javanainen via @Data_Informed
These days, people are accustomed to easily finding the information they need. They simply type keywords into a search engine and access material from any corner of the digital universe.
Interesting.
Interesting.
How Artificial Intelligence Can Improve Automated Customer Care by Tara Kelly via @Data_Informed
There are notable exceptions but, in general, people are dissatisfied with the level of customer service they receive today. The American Customer Satisfaction Index shows that customer-satisfaction rates have been trending downward for eight consecutive quarters.
Anything that improves customer satisfaction is worth trying.
Anything that improves customer satisfaction is worth trying.
Monday, 22 August 2016
Data Science for Beginners: Fantastic Introductory Video Series from Microsoft by Brandon Rohrer via @kdnuggets
Recent breakthroughs in deep learning by Yann LeCun via @Quora
Yann LeCun, director of AI research at Facebook, recently had an interesting Q&A session on Quora. (Spoiler alert, AI is not an existential threat to humanity, unless we are very stupid.)
Sunday, 21 August 2016
Improving operations using data analytics by Parviz Deyhim and Arti Garg via @OReillyMedia
How combining data and applying time-series techniques can provide insights into a company’s operational strengths and weaknesses.
Great article with good examples to help with understanding. Much easier as they describe - I remember doing this exercise manually in Excel.
Great article with good examples to help with understanding. Much easier as they describe - I remember doing this exercise manually in Excel.
How to become a data driven business by @DQMartinDoyle via BigDataMadeSimple
A data driven business utilises data to inform every business decision they make. By analysing relevant data and evaluating it they are able to form a conclusion and predict trends. Data-driven businesses ensure their company culture evolves to encourage innovation and agility.
Interesting. I think it is very difficult to become a data driven business as there are all those legacy systems with data data all over the place/
Interesting. I think it is very difficult to become a data driven business as there are all those legacy systems with data data all over the place/
Saturday, 20 August 2016
Why Data and Analytics Aren’t Enough to Change Healthcare by Rich Krueger via @infomgmt
Now that hospitals have updated to their first and even second generation electronic health records, the data exists for a similar revolution in healthcare administration. All that remains is the will to change.
I could do some analytics on almost anything, but that doesn't mean to say it has a point or is useful.
I could do some analytics on almost anything, but that doesn't mean to say it has a point or is useful.
Understanding the Empirical Law of Large Numbers and the Gambler’s Fallacy by Mehmet Suzen via @kdnuggets
Law of large numbers is a important concept for practising data scientists. In this post, The empirical law of large numbers is demonstrated via simple simulation approach using the Bernoulli process.
Nice explanation
Nice explanation
Friday, 19 August 2016
WEBINAR: Enterprise Security: Deploying Defense-in-Depth - 25 August 2016
Complimentary Web Seminar
August 25, 2016
2 PM ET/11 AM PT
Hosted by Information Management
As enterprises become more dependent on mobile workers, the cloud and the Internet-of-Things, they need new tools and techniques to make sure their information resources are secure.
One approach is to implement a defense-in-depth plan, one that covers:
perimeter security protection
network access and activity monitoring
threat identification and isolation
vulnerability management and remediation
and more.
Featured Presenters:
Moderator:
David Weldon
Editor-in-Chief
Information Management
Sponsored By:
Register here
3 Big Data Housekeeping Measures You Can No Longer Overlook by Dan Potter via @Data_Informed
As big data analytics matures, measures once considered a speed bump to analysts' progress must be taken to ensure the success of big data initiatives, writes Dan Potter of Datawatch.
These things SHOULD be important no matter what.
These things SHOULD be important no matter what.
Five data science projects to learn data science by Kunal Jain via @AnalyticsVidhya
Tune up your system, refresh your skills - here we're providing you with five datasets, available for free over internet, to help you enhance your skills, improve on strategies and working on new techniques. Start practising here:
No one has an excuse not to use these.
No one has an excuse not to use these.
Thursday, 18 August 2016
WEBINAR: From Disparate Data to Tell-all Dashboards at Talkdesk -24 August 2016
Webinar Event Details
Date: August 24, 2016
Time: Noon ET/ 9:00 am PT
Duration: 60 minutes (including Q&A)
Time: Noon ET/ 9:00 am PT
Duration: 60 minutes (including Q&A)
What You'll Learn
Inefficient data pulls to update stale Excel spreadsheets. Manual SQL queries against your production database. Data locked away in apps with unusable analytics interfaces. When you don’t have a solution for data, you fall back on slow, suboptimal tactics for accessing data, denying business users and decision makers access to the most accurate, up to date, and relevant information.
Attend this webinar August 24 to hear how Talkdesk went from a chaotic world like the one above to being a company where people can easily explore data and quickly get a 360° view of every account. Using FiveTran and Looker, Talkdesk’s product manager was able to tie all their now-centralized data together to create a powerful and stable environment for data discovery. In this webinar, you will learn: • What a centralized data store can do for you and how you can achieve that • Why you should be transforming your data at the time of query, not before you load it into your database • How one man created a company-wide data platform in one month Join us to understand how to take control of your data and use it to drive results. |
Presenters
Taylor Brown is one of the Cofounders at Fivetran where he oversees marketing, sales, and partnerships. He strives to help companies access and analyze their data, while also driving to create a data driven culture at Fivetran. When he is not focused on helping customers or optimizing marketing efforts, he can be found playing soccer, or cycling around Marin.
Ernest Wong is a Product Manager by day and a Data Nerd by night - but sometimes he loses track of what time it is. When he's not prioritizing a roadmap or crunching some data, he is probably getting eaten alive by mosquitos on a hike, dining somewhere woefully overpriced, or sleeping in like a boss.
Nouras Haddad is the Director of Technology Alliances at Looker. He helps Looker offer complete analytics solutions to our customers by partnering with best of breed technologies in the big data ecosystem. Originally from Croatia, Nouras received his B.A. in Economics from the Zagreb School of Economics and Management and MBA from The Wharton School at the University of Pennsylvania.
|
Register here
WEBINAR: The Inside Scoop on Apache Sqoop - 25 August 2016
Apache Sqoop is the standard tool for loading and exporting data between Hadoop and traditional data stores such as relational databases or SaaS applications, through a standard JDBC interface. Sqoop serves as the data access layer for the Hadoop ecosystem to connect external structured data. What could go wrong connecting a massively distributed data processing system with your core business data?
Join our JDBC experts, Idaliz Baez from Progress and Alex Silva from Pluralsight, to learn about the best practices for using Sqoop and interoperability with JDBC data sources from relational to cloud.
Title: The Inside Scoop on Apache Sqoop
Date: August 25, 2016
Time: 11:00 am ET
Date: August 25, 2016
Time: 11:00 am ET
Speakers:
Idaliz Baez, Progress DataDirect
Alex Silva, Pluralsight
Idaliz Baez, Progress DataDirect
Alex Silva, Pluralsight
During this webinar, you’ll:
- Get an introduction to Apache Sqoop
- Learn which data sources are accessible via JDBC for Sqoop
- Receive the latest best practices and lessons from the field
Get in and get the Sqoop!
Design Better Data Tables by @ilikescience via Medium
We’ve all seen poor visual design of tables: left-aligned numbers? Tons of useless formatting? There’s a lot that goes into making tabular data easy to consume, and with all the attention that goes into data viz today, the UI of tabular data often gets overlooked.
Really nice post which points out many things which should be obvious but are not.
Really nice post which points out many things which should be obvious but are not.
Asymmetric Information Is Economists' Little Secret by Noah Smith via @infomgmt
When today we debate issues like financial regulation or high frequency trading, it helps to think about financial markets as being driven by differences in how much people know.
Interesting.
Interesting.
Wednesday, 17 August 2016
The Emerging Data Design: Bitemporal Data by Mike Lapenna via @infomgmt
With Microsoft joining the club, we now have Oracle, IBM (DB2), Teradata and Microsoft supporting some portion or all of the bitemporal design.
Good to see it become more mainstream - I've been adding the field to support this in data warehouses for years, but to use it properly is clunky so more support for this has got to be a good thing.
Good to see it become more mainstream - I've been adding the field to support this in data warehouses for years, but to use it properly is clunky so more support for this has got to be a good thing.
SLIDESHOW: 15 Top Paying US Job Markets For Data Scientists by David Weldon via @infomgmt
Where are the top paying job markets for data scientists? That depends on the cost to live in a particular city or region. Job site Indeed has just released its report “Where are the Highest Paying Tech Jobs in the US?” which looks at best salary offers adjusted for the cost of living. Here are the top 15.
Tuesday, 16 August 2016
WEBINAR: Supersize Your Career with Big Data - 24 August 2016
Supersize Your Career with Big Data
August 24, 2016 | 10-11 am PTYou’ve read the headlines and seen the quotes: “Big Data is Creating Big Career Opportunities” (CNBC), “Big data is becoming an effective basis of competition in pretty much every industry” (Michael Chui, McKinsey Global Institute), “It’s super hard to find the right talent” (Sherry Shah, LinkedIn head of data recruiting). Now learn how you can expand your technical skills and horizons by taking advantage of the online data science courses, certificates, and degrees offered via the Stanford Center for Professional Development. In this webinar Dr. Larry Lagerstrom will outline the variety of big data programs offered by Stanford to working professionals and answer common questions. Registration for Autumn classes is open, so don’t miss out on this opportunity to find the program that’s right for you. Explore more topics with Stanford faculty on how vast amounts of data can be harnessed to make a positive impact on business and society.
About the Speaker
Larry Lagerstrom is the Acting Director of Academic Programs for the Stanford Center for Professional Development, where he oversees eleven Master’s degree and twenty-eight graduate certificate programs in engineering and related fields. He also conducts research applying data science methods to the study of online learning and education. Before coming to Stanford he taught computer science and engineering at U.C. Berkeley and U.C. Davis for sixteen years. His degrees include a Ph.D. in history of science and an M.A. in physics from U.C. Berkeley.
Presented By
Online Stanford Data Science Programs certificate programs
Questions?
Please contact us at scpd-customerservice@stanford.edu or 650-204-3984
Register here
How Does Big Data Analytics Help in Decision Making by @farooqSL via @Datafloq
Staying ahead in the game is paramount for any business organisation to survive in this competitive world. The future poses challenges that need tackling in the present. Every decision made today has a significant impact on the future of that organisation. The rate at which a company responds to challenges in the present and the future is what determines their rate of success. Data Science and Big Data analytics can help organisations in decision making and drive the company to a realistic future.
It seems to say on top these days you HAVE to use analytics.
It seems to say on top these days you HAVE to use analytics.
With Prevention Efforts Failing, IT Security Focus Turns to Reaction by David Weldon via @infomgmt
“Organisations are increasingly focusing on detection and response, because taking a prevention approach has not been successful in blocking malicious attacks,” noted analyst Elizabeth Kim.
I can understand that approach - if you can't beat them one way, try to take another approach and beat them another way.
I can understand that approach - if you can't beat them one way, try to take another approach and beat them another way.
Monday, 15 August 2016
Economics Without Math Is Trendy But Doesn't Add Up by Noah Smith via @infomgmt
The redefinition of the foundations of economics that is currently being done by economists who do not not agree with established beliefs or standards will inevitably result in many of the models beloved of academic economists becoming obsolete.
I don't see this as a bad thing - just progress - I'm sure what we see as established were new and established once.
I don't see this as a bad thing - just progress - I'm sure what we see as established were new and established once.
Growing Acceptance Will Drive Advanced Analytics Market Growth By 22% by Bob Violino via @infomgmt
The growth of the market is due in large part to the increased acceptance of data analytics, which helps eliminate the work involved in understanding customers and data tracking processes.
I think analytics are now seen as a must have tool in business.
I think analytics are now seen as a must have tool in business.
Sunday, 14 August 2016
Machine Learning as a Service: How Data Science Is Hitting the Masses by @ledambrosio via @HuffPostTech
The world of machine learning and predictive analytics is opening up to developers and companies of all sizes, with machine learning (ML) providers offering their products through a subscription-based model or open sourcing some of their technology.
Great blog outlining the concept, availability and players.
Great blog outlining the concept, availability and players.
More Companies Turn to Machine Learning to Leverage Analytics by Bob Violino via @infomgmt
ABI Research estimates machine learning-based data analytics tools and services revenues to hit nearly $20 billion in 2021 as machine-learning-as-a-service (MlaaS) models take off.
I can definitely see everything going that way.
I can definitely see everything going that way.
Saturday, 13 August 2016
Top Data Analytics Tools to Unleash the Potential of Big Data by R Bhargav via @simplilearn
There is a data revolution going on around the world and data analytics is the shiny new thing in the job market that has been alluring professionals. If you thought it’s just hyperbole, check out these statistics:
Germany enlists machine learning to boost renewables revolution by Quirin Schiermeier via @nature
Germany is in the top three nations for both wind and solar power, with renewables already providing a third of its power needs. But weather is erratic, and that's a challenge when trying to operate an efficient grid. In June, meteorologists, engineers, and utility firms began to test whether big data and machine learning can make these power sources more grid friendly.
Interesting use of machine learning.
Interesting use of machine learning.
Friday, 12 August 2016
Statistical Data Analysis in Python by Christopher Fonnesbeck via @kdnuggets
This tutorial will introduce the use of Python for statistical data analysis, using data stored as Pandas DataFrame objects, taking the form of a set of IPython notebooks.
Useful but it does contain links to courses that don't exist.
Useful but it does contain links to courses that don't exist.
Bayesian Machine Learning, Explained by Zygmunt ZajÄ…c via @kdnuggets
Want to know about Bayesian machine learning? Sure you do! Get a great introductory explanation here, as well as suggestions where to go for further study.
Interesting.
Interesting.
Thursday, 11 August 2016
Data and electric power by Sean Murphy via @oreillymedia
Utilities face a lot of interesting challenges that most people are completely unaware of. This report from O'Reilly explores how data science and cutting-edge tools are used to manage some of the most interesting of those challenges.
Interesting report.
Interesting report.
SLIDESHOW: 4 Key IT Sectors To See Major ‘Cloud Shift’ via @infomgmt
More than $111 billion in IT spending has shifted to the cloud and that amount will increase to $216 billion in four years, according to just-released report from Gartner.
Interesting.
Interesting.
Wednesday, 10 August 2016
WEBINAR: Data Analysis to Predict Voter Turnout and Outcome - 16 August 2016
Overview
Title: Data Analysis to Predict Voter Turnout and Outcome
Date: Tuesday, August 16, 2016
Time: 09:00 AM Pacific Daylight Time
Duration: 1 hour
Summary
Data Analysis to Predict Voter Turnout and Outcome
It goes without saying, we live in a very data-rich age. In the political arena, sophisticated analytic firms like Deep Root manage and analyze an ever-growing list of data sources to project voter turnout and predict vote choice. To do this, they must first access and acquire the data, and then build complex data blending and analysis workflows to turn a variety of unlinked data sources into a single, actionable database of information. Only then can they decide which voters to speak with, with what message and through which media.
Join us for our latest Data Science Central Webinar and learn how Alteryx, Amazon Web Services, and Deep Root Analytics work together to leverage numerous data sources to quickly deliver critical insights.
You will learn how to:
- Quickly blend and analyse data from all sources - cloud and local
- Apply predictive and geo-spatial analytics to big data
- Enable data analysts with the cloud computing power of Amazon Web Services
- Empower the organisation at large with analytic visualisations from Tableau
Speakers:
Raman Kaler, Alliance Manager -- Alteryx
Nick Tussing, Solutions Engineer -- Alteryx
Danielle Mendheim, Database Analyst -- Deep Root Analytics
Moselle Freitas, Sr. Partner Manager – Big Data ISV Segment -- Amazon Web Services
Raman Kaler, Alliance Manager -- Alteryx
Nick Tussing, Solutions Engineer -- Alteryx
Danielle Mendheim, Database Analyst -- Deep Root Analytics
Moselle Freitas, Sr. Partner Manager – Big Data ISV Segment -- Amazon Web Services
Hosted by:
Bill Vorhies, Editorial Director -- Data Science Central
Register here
8 Things Your Company Needs to Know About Cyber Security by @AndrewDeen14 via @Datafloq
Understanding cyber-crime is an essential part of protecting valuable data, particularly when this data involves important business or organisational affairs. Cyber hackers have a variety of motives for hacking networks and most often cyber hackers seek financial gain through bribes, identity fraud or credit card information. These attacks cost firms dearly as cyber criminals take sensitive information and sell in the deep web where crime is primarily hidden. Therefore, here are 8 things to know about cyber security.
Interesting points.
Interesting points.
R code for model-free, data-driven confidence intervals by Claudio Lucio do Val Lopes posted by Vincent Granville via @Analyticbridge
Great piece of code which I think many R users would find useful if they don't already have their own snippet.
Tuesday, 9 August 2016
Big Data Helps Courage Kenny Connect with Patients by Greg Gillespie via @infomgmt
The rehabilitation institute analyses data from multiple sources to better identify patients who may need help.
Interesting.
Interesting.
Apache Spark: The Future of Big Data Science? by Matthew Thomson via @infomgmt
Spark is different from the myriad other solutions because it allows data scientists to develop simple code to perform distributed computing.
Yes this is definitely more flexible so is more efficient.
Yes this is definitely more flexible so is more efficient.
Monday, 8 August 2016
Data Governance Interview - Nick Keen by Nicola Askham via @infomgmt
The DG lead at the Environment Agency talks about how the organisation looks after its data.
I find this really interesting.
I find this really interesting.
What the Privacy Shield Approval Really Means by Patrick Salyer via @Data_Informed
Compliance certification began Monday for Privacy Shield, the new framework for transferring the personal information of European Union citizens from the EU to the United States. Gigya CEO Patrick Salyer discusses what companies need to know, including the costs of noncompliance and who will be watching.
Sunday, 7 August 2016
7 Steps to Understanding NoSQL Databases by Matthew Mayo via @kdnuggets
Are you a newcomer to NoSQL, interested in gaining a real understanding of the technologies and architectures it includes? This post is for you.
This is incredibly useful and a great overview. Recommended.
This is incredibly useful and a great overview. Recommended.
Data Science Statistics 101 by Jean-Nicholas Hould via @kdnuggets
Statistics can often be the most intimidating aspect of data science for aspiring data scientists to learn. Gain some personal perspective from someone who has travelled the path.
Good advice. I did a few courses with Coursera to try to plug the gaps with my own knowledge.
Good advice. I did a few courses with Coursera to try to plug the gaps with my own knowledge.
Saturday, 6 August 2016
Why Big data Fuels Significant Change in the Real Estate Market by @davidglenn97 via @Datafloq
Big data fuels significant change in the real estate market. Multiple sites offer detailed real estate data to make informed decisions. Gone are the days when real estate brokers and agents had proprietary access to information the average buyer or seller could not easily find. Depending on your role within the real estate industry, there are definitely pros and cons associated with the growth of big data.
I can definitely see how big data can hep real estate just like it can help a bank or insurance company.
I can definitely see how big data can hep real estate just like it can help a bank or insurance company.
How to Build the Internal Reputation of Your Insight Team by @LaughlinPaul via @Datafloq
Why do some insight teams have a better internal reputation than others? Did some leaders just get lucky, with a great culture & receptive directors? If so, many did not get so lucky. Lack of internal influence, being neglected or treated as just a service function are common concerns raised with insight leaders. Do you know how to build awareness or manage PR for your ‘insight brand’?
Useful 6 points to help make this a success.
Useful 6 points to help make this a success.
Friday, 5 August 2016
Why The Internet of Things is Getting Real Now by @ricknotdelgado via @Datafloq
Few things have gotten as much hype as the Internet of Things (IoT). Some say it will be the biggest technological revolution since the rise of the internet itself. To be honest, the predictions aren’t that far off. If the IoT manages to live up to expectations, the impact it will have dwarfs anything that’s come before. But what would the Internet of Things mean for the individual?
They seem to be appearing everywhere. I have no issue with something that keeps track of my exercise but I'm a little uncomfortable with the concept of my fridge being accessible over the internet.
They seem to be appearing everywhere. I have no issue with something that keeps track of my exercise but I'm a little uncomfortable with the concept of my fridge being accessible over the internet.
Why Uber Engineering Switched from Postgres to MySQL by Evan Klitzke via @UberEng
Uber Engineering explains the technical reasoning behind its switch in database technologies, from Postgres to MySQL.
I loved this explanation and the level of detail behind it.
I loved this explanation and the level of detail behind it.
Thursday, 4 August 2016
6 ‘data’ buzzwords you need to understand by Katherine Noyes via @NetworkWorld
Take one major trend spanning the business and technology worlds, add countless vendors and consultants hoping to cash in, and what do you get? A whole lot of buzzwords with unclear definitions. In the world of big data, the surrounding hype has spawned a brand-new lingo.
Not heard of all of these myself.
Not heard of all of these myself.
4 Ways to Shrink the Gap between Data Integration and Insight by Yaniv Mor via @Data_Informed
According to a study conducted last year by my company, Xplenty, nearly one-third of business intelligence professionals say they spend between 50 and 90 percent of their time just cleaning raw data for analytics. As a result of valuable time and talent devoted to preparing data, businesses are often slow to unlock its insights or to act on them.
Good to be reminded of these.
Good to be reminded of these.
Wednesday, 3 August 2016
Google Buys Machine Learning Startup Moodstocks by @jessicadavis via @InformationWeek
Google has acquired Paris-based machine learning startup Moodstocks as part of its ongoing effort to improve visual recognition technology.
I know this is one of the harder ML elements to crack so maybe they can get further working together.
I know this is one of the harder ML elements to crack so maybe they can get further working together.
A New Take on Data Discovery, Data Management, and its Relationships by Jennifer Zaino via @Dataversity
Having herself held senior roles in IT at Wall Street companies including Deutsche Bank and Morgan Stanley Smith Barney, Oksana Sokolovsky is quite familiar with the challenge of Data Management and data discovery. As co-founder and CEO of ROKITT, her goal was “to build a product that solves that challenge,” she says.
I found this really interesting.
I found this really interesting.
Tuesday, 2 August 2016
M&A in the Predictive Marketing Space: eBay Acquires SalesPredict by Allison Snow via @infomgmt
This extension would likely offer the right items to the right individuals at the right price – brought to the attention of eBay shoppers by predictive-enabled, contextual and customised recommendations.
This is definitely the way to go to increase sales.
This is definitely the way to go to increase sales.
The Making of a Data Scientist by Sarah Lukens via @infomgmt
I was lucky enough to find my calling in numerical analytics and scientific computing, but how can we inspire an entire generation to track along career paths which emphasise quantitative reasoning.
Interesting and worth reading.
Interesting and worth reading.
Monday, 1 August 2016
WEBINAR: A Pragmatic Approach to Processing and Refining Big Data - 3 August 2016
A Pragmatic Approach to
Processing and Refining Big Data Wednesday, August 3, 2016 | 8 am PT/16:00 BST
Want to learn how to approach your Hadoop data processing and analytics projects without sacrificing governance and control?
In a big data world, business users need on-demand access to governed data sets on highly diverse sources, regardless of scale. By focusing on the right principles from both existing data warehousing approaches and emerging data lake use patterns, it is possible to drive automatic processing, refinement, and publishing of Hadoop data sets for immediate interactive analysis
Join this webinar to learn how:
Register here
In a big data world, business users need on-demand access to governed data sets on highly diverse sources, regardless of scale. By focusing on the right principles from both existing data warehousing approaches and emerging data lake use patterns, it is possible to drive automatic processing, refinement, and publishing of Hadoop data sets for immediate interactive analysis
Join this webinar to learn how:
- Enterprise data warehouse and data lake design patterns fit today's analytic landscape
- Organizations can approach Hadoop data processing and analytics without sacrificing governance and control
- Pentaho provides an approach to delivering refined on-demand data marts to end users in a big data environment
- Pentaho customer FINRA was able to leverage Pentaho's big data capabilities to rapidly accelerate fraud detection
Register here
Subscribe to:
Posts (Atom)