Monday 28 February 2022

6 Lesser-Known SQL Techniques to Save You 100 Hours a Month by @camwarrenm via @TDataScience

Use these simple techniques to make your analysis and data extracts easier.

I'm sure we all have our own library containing useful SQL code (I certainly do) and I think these could be added to supplement them.

Wednesday 23 February 2022

Data Mesh & Its Distributed Data Architecture by Yash Mehta via @kdnuggets

Going forward, data professionals have found a new way to address the scalability of sources through data mesh.

I like that this is distributed and that you do not need to build a huge data warehouse or data lake.

Monday 21 February 2022

Python’s F-Strings Are A Lot More Useful Than You Might Have Thought by @emmettboudgie via @TDataScience

Some cool things most people do not realize f-strings can do in Python,

Interesting to read and think about as I had no idea about some of these things.

Thursday 17 February 2022

VIRTUAL EVENT - Zero Gravity – the Modern Cloud Data Pipeline Event - 26 May 2022

 

Sponsored News from Data Science Central

JOIN US LIVE AT THE INDUSTRY’S FIRST CLOUD DATA PIPELINE EVENT

Data science and analytical workloads are rapidly transitioning from on-premises infrastructure to the cloud. But while the cloud offers many benefits, the migration is riddled with challenges and potential pitfalls.

That’s why we’re bringing together technical experts, thought leaders, and industry visionaries with experience building and managing cloud data pipelines to share lessons learned and discuss emerging technologies.

This event is virtual, 100% LIVE and open to everyone—at no cost.

Group 1835@2x
download-2-1
ThomasKurian502@2x-1

KEYNOTE SPEAKER

THOMAS KURIAN

Chief Executive Officer, Google Cloud

Our keynote speaker will be Thomas Kurian, CEO of Google Cloud.
New speakers will be added regularly, so keep checking back for updates!

Group 1835@2x
download-2-1

DO YOU HAVE UNIQUE INSIGHTS OR EXPERTISE ON CLOUD DATA PIPELINES?

We are now accepting submissions from practitioners in the data architecture and engineering community who can share their experience and expertise on the following cloud data pipeline topics:

Cloud architecture

Best practices in cloud migration

Data integrity and durability

Optimizing performance

Managing costs

Leveraging native services

Emerging technologies

Open-source technologies

New analytic approaches

Security and governance

The call for speakers is now open until Feb 25th – submit your speaking proposal today!

Group 1825@2x
download-2-1

IN PARTNERSHIP WITH

OUR SPONSORS

Incorta-1
Group 1802-2
AWS
MicrosoftAzure
download-2-1
Group 1833@2x

Wednesday 16 February 2022

Data Science Programming Languages and When To Use Them by Nate Rosidi via @kdnuggets

Read this guide through the most common data science programming languages and when to use them in data science.

This is a great level of detail and very useful to use if you wonder if your choice was correct or if you should expand into another language.

Monday 14 February 2022

Top 10 Pandas Functions for Preparing Data by Holly Dalligan via @BttrProgramming

Because she wanted to create useful, accurate analysis with as little work as possible.

I found this really interesting and it looks very useful too - data preparation if done well can help you to produce much better results from your analysis,

Wednesday 9 February 2022

A Step-by-Step Guide to Calculating Autocorrelation and Partial Autocorrelation by Eryk Lewinson via @TDataScience

How to calculate the ACF and PACF values from scratch in Python.

This was very clear and really helped me to understand how to calculate them in Python as I'm really not good at that language no matter how hard I try.

Tuesday 8 February 2022

WEBINAR: Improving Your DataOps Practices with Data Fabric - 16 February 2022

 

Sponsored News from Data Science Central


Webinar: Improving your DataOps Practices with Data Fabric

Today's enterprises struggle to unlock real value out of data that is stuck in legacy relational databases, data warehouses and data lakes. Modernizing disparate data silos to a single data fabric is key to managing change over time and creating data-driven innovation and value.

Join this session to learn how you can build a modern data fabric leveraging the Lumada DataOps Suite from Hitachi Vantara along with MongoDB Atlas.

In addition, hear from a utility customer that is reaping the benefits of the data fabric and see a demo of how you can create a modern data pipeline to drive great enterprise data value. Register to watch this webinar live when it airs February 16th.



Hope to see you there,
Sean Welch
Data Science Central

Hitachi

Monday 7 February 2022

A Senior Engineer at Google Reveals “The Best Programming Language To Learn in 2022” by @minhazav via @BttrProgramming

He is a senior software engineer at Google Singapore and very often he is asked about which programming language to learn.

I can agree that learning one is great and that it really depends on what you are working with. Learning the basic concepts via one of them is useful no matter what. Arrays, variables, loops etc are all basic skills that can be transferred across languages.