Friday 29 January 2021

K-Means 8x faster, 27x lower error than Scikit-learn in 25 lines by Jakub Adamczyk via @kdnuggets

K-means clustering is a powerful algorithm for similarity searches, and Facebook AI Research's faiss library is turning out to be a speed champion. With only a handful of lines of code shared in this demonstration, faiss outperforms the implementation in scikit-learn in speed and accuracy.

Definitely, a new one to try and see if you like the results better.

How To Become A Task Automation Hero Using Python [With Examples] by @monterail via @Medium

Performing repetitive tasks can bore even the most resilient of us out of our minds. Lucky for us, the digital age we live in offers us a bevvy of tools to relieve ourselves of that sort of tedious labour.

I loved this and it looks incredibly useful - worth the time to investigate so that you can use it going foward.

Wednesday 27 January 2021

10 Things You Should Know About Tuples in Python by @ycui01 in @thestartup_

Tuples are very handy to use in Python programming. Every Python programmer should know its usage really well.

Some really useful code snippets that really help you to understand all of this.

Looking For A Profitable Coding Project? Take This One by keypressingmonkey in @gitconnected

 Here’s a project with real demand, real use and a lot of potentials.

An interesting quick read which is worth a quick read if you are able.

Monday 25 January 2021

9 Easy Steps To Make Great Charts by @Thuwarakesh via @TDataScience

 How to tell exciting stories in presentation slides with elegantly re-organized charts.

A chart can make or break a presentation and so the time you invest in it now will pay dividends later.

Google Research: Looking Back at 2020, and Forward to 2021 by Jeff Dean via @googleai

Google has a massive impact on the tools, applications and research that help steer the data science community. As with prior years, this retrospective by Jeff Dean is amazing in its scope. Includes useful summaries, screenshots, videos and linked references throughout.

I found this really interesting and showed a number of areas that I was not aware of the scale of Google involvement. The look forward was a great place to learn of areas that are going to come at some point and worth planning for as it will be there and possible soon.

Friday 22 January 2021

Algorithms For Data Scientists — Insertion Sort by Richmond Alake via @TDataScience

 One of the easiest algorithms you’ll ever learn. You might find the Wikipedia page on Insertion Sort too.

A good reminder of this algorithm and how to use it. Please note that this is only good for small datasets as it can be slow to build the final result one at a time.  Consider using Quicksort instead.

Wednesday 20 January 2021

WEBINAR: The Intelligence Revolution: Driving ROI with AI - 27 January 2021

 

Sponsored News from Data Science Central

Almost every business in every corner of the world is seeing some use of AI within its business. But the question is, are they using AI in ways that will help them drive the biggest ROI?

Join us for a lively discussion on this topic with Lou Celli, Chief Executive and Founder of ESI ThoughtLab, Peter Henstock, Machine Learning and AI Lead at Pfizer, and Ari Kaplan, AI Evangelist at DataRobot, as they discuss the findings of ESI ThoughtLab’s benchmarking study of 1200 organizations across world markets.

In the webinar, you will learn:
checkmarkInsights around the current state of AI at major companies and how those organizations plan to continue to make major investments in AI.
checkmarkThe five best practices to be successful at AI and drive high performance.
checkmarkHow Pfizer is already using AI to meet challenges and “lead from the middle.”
DataRobot_The_Intelligence_Revolution_Driving_ROI_with_AI_Sharing_card_v1.0.jpg
Register today

Stop Using Print to Debug in Python. Use Icecream Instead by @KhuyenTran16 via @TDataScience

 Are you Using Print or Log to Debug your Code? Use Icecream instead.

Definitely, something new to try that I had never heard of before. To avoid Medium if you don't have a subscription you can find this (or something similar) on Github.

Monday 18 January 2021

Best Python IDEs and Code Editors You Should Know by Claire D Costa via @kdnuggets

Developing machine learning algorithms requires implementing countless libraries and integrating many supporting tools and software packages. All this magic must be written by you in yet another tool -- the IDE - that is fundamental to all your code work and can drive your productivity. These top Python IDEs and code editors are among the best tools available for you to consider and are reviewed with their noteworthy features.

This is really useful and very clear. You might find a new favourite by going through this list.

Excel Automation Using Python Xlwings by Sandeep Burnwal via @Medium

Automation has been the buzzword for a long time and will continue to be one in the times to come. It can take many forms and this great article by Sandeep explains how you can automate Excel using Python XIwings.

A very clear and very useful thing to learn. Useful code snippets to help you get your own code right.

Friday 15 January 2021

Big Data Architecture in Data Processing and Data Access by Stephanie Shen via @DataScienceCtrl

Over the past 20+ years, it has been amazing to see how IT has been evolving to handle the ever-growing amount of data, via technologies including relational OLTP (Online Transactional Processing) database, data warehouse, ETL (Extraction, Transformation and Loading) and OLAP (Online Analytical Processing) reporting, big data and now AI, Cloud and IoT.

This was very clear and insightful. Worth a read as I think it could clear up a few misunderstandings.

All Machine Learning Algorithms You Should Know in 2021 by Terence Shin via @TDataScience

Many machine learning algorithms exist that range from simple to complex in their approach, and together provide a powerful library of tools for analyzing and predicting patterns from data. If you are learning for the first time or reviewing techniques, then these intuitive explanations of the most popular machine learning models will help you kick off the new year with confidence.

This will help you get your machine learning right by using the correct algorithm.

Wednesday 13 January 2021

SQL vs NoSQL: 7 Key Takeaways by Alex Williams via @kdnuggets

People assume that NoSQL is a counterpart to SQL. Instead, it’s a different type of database designed for use-cases where SQL is not ideal. The differences between the two are many, although some are so crucial that they define both databases at their cores.

I enjoyed reading this thoughtful article. I think it helps to clear up some potential confusion and ensures that you really understand via his careful use of diagrams.

Efficient Time-Series Analysis Using Python’s Pmdarima Library by Muriel Kosaka via @TDataScience

Demonstrating the efficiency of pmdarima’s auto_arima() function compared to implementing a traditional ARIMA model.

I think this would be worth playing with as I believe it would be worth using to improve your results of time series analysis.

Monday 11 January 2021

Optimization in Python — Peephole by Chetan Ambi via @gitconnected

A brief introduction to Python’s Peephole optimization technique.

I think this could save you time if you can learn how to use and understand it properly. Definitely worth reading.

Multicloud: A cheat sheet by/via @techrepublic

 This comprehensive guide covers the use of services from multiple cloud vendors, including the benefits businesses gain and the challenges IT teams face when using multi-cloud.

There are a lot of links in this article which acts as an index for cloud information.

Friday 8 January 2021

Python Sets are Underrated by Thomas Hikaru Clark via @TDataScience

 A guide to the elegant and helpful uses of built-in Python sets.

A great tutorial on a very important but basic functionality in Python code.

New to Python? 10 Acronyms That Every Python Programmer Should Know by @ycui01 via @TDataScience

 Programming principles, rules, and some fun facts.

I could guess at some of these but didn't know them for sure.

Wednesday 6 January 2021

Top 7 Data Libraries You Will Absolutely Need for Your Next Deep Learning Project @orhangaziyalcin via @TDataScience

You might be an expert in TensorFlow or PyTorch, but you must take advantage of these open-source Python libraries to succeed.

A very useful list of some of the most relevant libraries to use if you want to do a Deep Learning project. This could save you the time you would have spent searching for the right libraries.

Intro to Python Classes and Objects by Muriel Kosaka via @TDataScience

 Explaining the basics of Python objects and classes using examples.

Great explanations and code segments and is really helpful in understanding them.

Monday 4 January 2021