This is a blog containing data related news and information that I find interesting or relevant. Links are given to original sites containing source information for which I can take no responsibility. Any opinion expressed is my own.
Monday, 3 October 2022
Wednesday, 28 September 2022
Monday, 11 July 2022
Pydantic or dataclasses? Why not both? Convert Between Them by Diego Barba via @TDataScience
Wednesday, 29 June 2022
Primary Supervised Learning Algorithms Used in Machine Learning by Kevin Vu via @kdnuggets
In this tutorial, they are going to list some of the most common algorithms that are used in supervised learning along with a practical tutorial on such algorithms.
This is really useful and worth a bookmark or printout.
Friday, 17 June 2022
Explaining negative R-squared by Tan Nian Wei via @TDataScience
Why and when does R-squared, the coefficient of determination, go below zero?
Interesting and good to be able to confirm it is how I thought it was.
Wednesday, 15 June 2022
Python Single Responsibility Principle by/via PYTHONTUTORIAL.NET
In this tutorial, you’ll learn about the single responsibility principle and how to implement it in Python.
This was very clear and easy to use. I really think you should use this website to learn more about Python.
Monday, 6 June 2022
Parallelization with MultiProcessing in Python by Vatsal via @TDataScience
Run your data science tasks in parallel to speed up computation time.
Great examples with code to make is easier to do this with your code.
Wednesday, 18 May 2022
Data Cleaning Toolbox by Olivia Tanuwidjaja via @TDataScience
Compiling the aspects to look out for before analyzing your data.
This is a great checklist and I think could help you not to forget a step or even do something in the wrong order which could also produce wrong results.
Monday, 16 May 2022
How To Convert Pandas DataFrame Into NumPy Array by Giorgos Myrianthous via @TDataScience
Converting a pandas DataFrame into a NumPy array.
A great guide which shows just how easy it can be to do that conversion.
Wednesday, 11 May 2022
How to Use Wikipedia as a Data Source by Alan Jones via @TDataScience
How to load information from Wikipedia into Pandas by finding the best team in the English Premier League.
I like that this worked example could be used to do all sorts of things.
Monday, 9 May 2022
8 Visualizations with Python to Handle Multiple Time-Series Data by Boriharn K via @TDataScience
Visualization ideas for coping with overlapping lines in multiple time-series plots.
Some of these ideas are quite neat. Don't stick to the one you prefer the most - some sets of data may suit one of these examples more than the others.
Monday, 25 April 2022
Introduction to GraphSAGE in Python by @maximelabonne via @TDataScience
Scaling Graph Neural Networks to billions of connections.
I like this which is very clear and easy to use and understand. Great code examples.
Friday, 22 April 2022
10 SQL Queries You Should Know as a Data Scientist by Uğur Savcı via @Medium
Learn the Most Used SQL Queries in 5 Minutes with Examples
You need to keep these somewhere so you can access them. I have in the past used text files in a directory or Evernote. It is really easy then to copy and edit the code.
Wednesday, 20 April 2022
How to Run 30 Machine Learning Models with a Few Lines of Code by Ismael Araujo via @TDataScience
Learn how to run multiple machine learning models using lazy predict.
This is really neat and so you need to bookmark or add it to something like Evernote so you can use this in your Python code.
Wednesday, 13 April 2022
6 Killer Productivity Apps for Programmers by Tapajyoti Bose via @Medium
Are you a Programmer? Want to take your productivity to a whole new level? Look no further, these apps will do the job nicely.
Some of these look good - definitely a few here worth investigation.
Wednesday, 30 March 2022
Python List Print — 7 Different Ways to Print a List You Must Know by Dario Radečić via @TDataScience
Explore endless possibilities of printing and formatting lists in Python.
Some of these were new for me and so I expect others will discover something new here too.
Wednesday, 23 March 2022
50 python one-liners everyone should know by Allwin Raju via @Medium
He was always amazed by how easily things can be done using python. Some of the tedious tasks can be done in a single line of code using python. He has gathered some of his favourite one-liners from python.
This is a great resource and something that you may learn from.
Monday, 21 March 2022
8 Essential Python Techniques for Data Engineers and Analysts (with code samples) by @camwarrenm via @TDataScience
Wednesday, 16 March 2022
10 Python operations for 70% of your Data Analysis by Anmol Tomar via @Medium
Monday, 14 March 2022
Take 5 Minutes to Get Started With Multi-threading in Python by Amit Chauhan via @Medium
Run several applications at the same time.
I needed to find a way to do this so I am really grateful to Amit for showing me how to in this article. Definitely a must-read if you want to do anything interesting in Python.