Monday 31 August 2020

Going Beyond Superficial: Data Science MOOCs with Substance by Matthew Mayo via @kdnuggets

 Data science MOOCs are superficial. At least, a lot of them are. What are your options when looking for something more substantive?

This points you at four Data Science courses that are available via MOOCs that look substantial and well worth doing as well as you can in order to gain good skills that will be useful to you going forward. As with all MOOCs, there are free paths as well as paid ones that provide you with a certificate and a capstone. I would recommend doing the paid-for option if you can as it will be cheaper than taking a proper course at a University.

Friday 28 August 2020

10 Must-Read Books for Software Engineers by@simonholdorf via @BttrProgramming

 Getting better as an engineer is as much about reading code as it is about writing it.

Simon is completely right - I have to admit to you that I read code a lot better than I can write it - but that is ok as it is by reading code that I can improve how I write code. It also means that I can read code in some languages that I know I could not write in - but as long as I can understand it and even communicate to those who actually can (and probably actually did write it) how to improve it by doing something different with the data, that I can contribute to the understanding of and manipulation of the data to provide insights and information.

Wednesday 26 August 2020

Should You Get A Degree In Business Analytics Or Data Science by/via @Analyticsindiam

Business analytics and data science are often used interchangeably but are different at many levels. While both are used in solving business problems and decision-making processes in organisations, both have different job descriptions and skills requirements.

Interesting career advice that could prove invaluable and save making a mistake on the way you choose between these two subjects.

Monday 24 August 2020

Containerization of PySpark Using Kubernetes by Ajaykumar Baljoshi via @sigmoidInc

 This article demonstrates the approach of how to use Spark on Kubernetes. It also includes a brief comparison between various cluster managers available for Spark.

I thought this was a really good article with a great level of detail. If you are interested in doing this in real life I recommend you read this first as there are code snippets and it will get you ahead of the curve.

Friday 21 August 2020

4 Simple Ways to Refactor Your Python Code by @jhsu98 via @Medium

 Learn new techniques and optimize your existing code.

There are lots of useful Python posts on Medium by Jonathan so I suggest you read a few of them as it might improve your code a great deal.

Wednesday 19 August 2020

The cloud skills shortage and the unemployed army of the certified by @jbesw via @Medium

 Why it’s so hard to find roles in cloud technology, while jobs go unfilled.

I found this really interesting and could be insightful for anyone who is struggling to get a job even though they did certifications in the attempt to be more qualified.  I have to agree, relevant experience is key. 

Monday 17 August 2020

Augmented Intelligence is the New Artificial Intelligence by Priya Dialani via @analyticsinme

Computers are getting more clever and increasingly inventive, offering terrific prospects to improve the human condition. There’s a call to rethink Artificial Intelligence as Augmented Intelligence, to stress the capability of people working with AI instead of being replaced by AI. 

I found this very interesting and well worth a read and think about it.

Friday 14 August 2020

Why You Should Get Google’s New Machine Learning Certificate by Frederik Bussler via @kdnuggets

 Google is offering a new ML Engineer certificate, geared towards professionals who want to display their competency in topics like distributed model training and scaling to production. Is it worth it?

This was really interesting and definitely made the case as to when you should or should not take the certification exam. I like the idea that you might be better off getting lots of experience and publish some results to get known for your skill and knowledge in machine learning instead.

Wednesday 12 August 2020

Monday 10 August 2020

How to Understand COVID-19 Numbers by @CarolineYLChen via @propublica

Between case counts, deaths, hospitalizations, positivity rates, rolling averages, etc, it's hard to gauge how the pandemic is progressing. Here's a detailed guide to the numbers and, ultimately, insights into what's worth paying attention to.

I found this incredibly interesting both from a person POV to make sure I understood all the numbers politicians throw around and from a data science perspective because it makes you think a bit more about if you need to explain all the numbers and statistics at people to make sure they really understand what you are giving them.

Friday 7 August 2020

How Much Math do you need in Data Science? by Benjamin Obi Tayo via @kdnuggets

There exist so many great computational tools available for Data Scientists to perform their work. However, mathematical skills are still essential in data science and machine learning because these tools will only be black-boxes for which you will not be able to ask core analytical questions without a theoretical foundation. #DataScienceHub

My advice is to do a series of mathematical and statistics courses online - many are completely free especially via MOOCs - and bring your skills up to scratch. I certainly had to do that as my own skills were not good enough (and probably still aren't if I am honest with myself). 

Wednesday 5 August 2020

Labelling Data Using Snorkel by Alister D’Costa and others at NLP4H via @kdnuggets

In this tutorial, we walk through the process of using Snorkel to generate labels for an unlabelled dataset. We will provide you with examples of basic Snorkel components by guiding you through a real clinical application of Snorkel.

This was really interesting and could be a way to save time in the long run.

Monday 3 August 2020

Do You Know Python Has A Built-In Database? by Christopher Tao via @TDataScience

An introduction of the Python built-in library - sqlite3.

This is very useful and something that I never really understood until I read this.  Includes some code examples and a link to the documentation.