Friday 29 October 2021

Write Better And Faster Python Using Einstein Notation by Bilal Himite via @TDataScience

Make your code more readable, concise, and efficient using “einsum”

I had never heard of this and was fascinated to find out more. I also found this additional article useful:

Understanding einsum for Deep learning: implement a transformer with multi-head self-attention from scratch

Wednesday 27 October 2021

Stop Using CSVs for Storage — This File Format Is 150 Times Faster by Dario Radečić via @TDataScience

CSV’s are costing you time, disk space, and money. It’s time to end it.

Definitely, CSVs are great if you want to edit the file but it's not that fast - even a text file is faster.

Monday 25 October 2021

WEBINAR: Maximizing Data Labeling Operations in High-Stakes Industries: Tips for Tools and Teams - 2 November 2021

 

Maximizing Data Labeling Operations in High-States Industries

Are you interested in learning about overcoming data annotation challenges like scaling teams, labeling complex data, and handling edge cases?

There's an art and science to choosing the processes and teams used to extract and structure the data found in images, video, and documents for AI and business insights. In this interactive LinkedIn Live chat, we're talking to Alberto Rizzoli, CEO & Co-founder of V7 Labs, a data labeling platform for text and visual data. CloudFactory and V7 regularly collaborate to optimize data labeling operations for customers and build high-quality datasets for global innovators.

Join us on November 2 at 11 am ET / 4 pm BST: Maximizing Data Labeling Operations in High-Stakes Industries: Tips for Tools and Teams

Here are a few topics we plan to discuss with V7:

Fascinating real-world examples of computer vision development in agriculture and healthcare
Maximizing data operations resources and scalability by combining SMEs like medical doctors and experienced data annotators during the AI lifecycle
Optimizing human and computer collaboration to process edge cases that baffle text annotation tools like optical character recognition (OCR)
Preparing for data annotation challenges by choosing proven tools, processes, and human in the loop workforces
Register Now

P.S. Have questions? Contact CloudFactory anytime here. You might enjoy learning about CloudFactory's collaboration with V7 on Covid-19 AI training data.

Visual Studio Code for Python and Data Science? Top 3 Plugins You Must Have by Dario Radečić via @TDataScience

Is it the best code editor for Python and Data Science?

Interesting - I had never thought of doing it that way.

Friday 22 October 2021

8 Must-Have Git Commands for Data Scientists by @snr14 via @kdnuggets

Git is a must-have skill for data scientists. Maintaining your development work within a version control system is absolutely necessary to have a collaborative and productive working environment with your colleagues. This guide will quickly start you off in the right direction for contributing to an existing project at your organization.

I good quick reminder of the commands you need to use in Git - worth a printout or adding it to something like an Evernote folder.

Thursday 21 October 2021

WEBINAR: Gain Insights from SAP Data with Qlik and Microsoft - 28 October 2021

 

Sponsored News from Data Science Central

Qlik Logo

Webinar: Gain Insights from SAP Data with Qlik and Microsoft
What: free online webinar exclusively for IT Pros
When: Thursday, October 28th, 9 AM PST / 12 PM EST
Where: From the convenience of your personal computer 

Register Now

In this latest Data Science Central Webinar, you’ll hear how global manufacturer Greene Tweed developed an efficient, cost-effective, and logical strategy to connect Qlik Data integration and Microsoft Azure Synapse to build an intelligent supply chain for better business insight.

Learn how Greene Tweed addressed their challenges and how they use data liberated from SAP for strategic analytics initiatives. In addition, you will discover:

  • How to create a plan of attack to extract insights from massive amounts of SAP data
  • How important change data capture is to building low latency extracts for massive data volumes
  • Why automation with Qlik Data Integration is crucial to expedite data availability
  • How SAP data drives insights to optimize Manufacturing & Supply Chain
  • The gains Greene Tweed realized by moving SAP data to Azure Synapse

Register today to start your journey to improving data operations, accelerating reporting, reducing systems overhead and enabling AI operations with Qlik and Azure Synapse.

Speakers:
Matt Hayes, VP of SAP Business, Qlik
David Hufnagle, Manager, Enterprise Data and Analytics, Greene Tweed
Greg Vigil, Industry Solution Director – Manufacturing, Microsoft

WEBINAR: How to build customer-oriented applications using third-party data - 28 October 2021

 

Datafloq

NEWSLETTER

.
aws marketplaceAWS Data Exchange
How to build customer-oriented applications using third-party data
REGISTER NOW
.
You’re Invited!
📅THURSDAY, OCTOBER 28
🕓11AM PT | 2PM ET
⌛60 MIN SESSION
REGISTER NOW
Join this webinar to learn how using third-party data enhances applications to better prioritize your target customer – helping you build a more customer-centric business.
In this virtual session, AWS Data Exchange will host a discussion with thought leaders from companies such as Foursquare and Nextdoor. They will share real-world examples on leveraging datasets to create a customer-centric strategy and improve business outcomes.
Key takeaways include:
✓
How Nextdoor uses point-of-interest (POI) data to help improve global business data coverage and quality, discovery, verification, and onboarding experiences
✓
How Nextdoor uses POI data to improve lead generation of local businesses and grow page claim rates
✓
How to provide personalized recommendations using Amazon
✓
How to discover, find, and use third-party data with AWS Data Exchange
Moderator
Mohsen Malik
AWS Data Team Lead, Customer Advisory
aws marketplace
Mohsen leads a team of Customer Advisors for AWS Data Exchange to help customers discover, procure, and use traditional and alternative data assets in the AWS Cloud. In this capacity, his team engages with data subscribers globally across key data domains such as location and geographic information system (GIS), consumer insights, and audience data. Mohsen’s team also works with key data providers across their data domains to reduce their operational costs and expand their customer base by leveraging cloud-native distribution.
Presenters
Josh Cohen
Senior Vice President Product, Foursquare
FOURSQUARE
Previously, Josh was at Google where he was the group product manager for Publisher Advertising Platforms and Business Product Manager for Google News, responsible for global product strategy, marketing, and publisher outreach. He was also vice president of business development for the consumer media team at Reuters Media and director of business development for SmartMoney.com, a joint venture between Dow Jones and Hearst. Josh holds degrees from the University of Michigan and Columbia Business School, where he graduated Beta Gamma Sigma.
Rahul Sureka
Engineering Leader, Nextdoor
🏠 Nextdoor
Rahul is an Engineering Leader at Nextdoor, in charge of driving search, discovery, and marketplaces. He is a results-oriented leader with over 10 years of experience in building top-performing products. Rahul joined Nextdoor in October 2015 and led several initiatives globally to develop Nextdoor‘s first monetization platform. Rahul received a Master’s in Computer Science at University of Southern California from 2006 to 2007.
Angel Goñi Oramas
Enterprise Solutions Architect, AWS
aws marketplace
Angel Goñi is an Enterprise Solutions Architect at AWS. He helps enterprise customers drive their digital transformation process by leveraging AWS services. His current focus is supporting consumer packaged goods (CPG) customers with emphasis on SAP migrations to AWS.
*The views and opinions of Foursquare, Nextdoor, and their presenters are their own and do not necessarily reflect the positions of AWS.
About AWS Data Exchange:
AWS Data Exchange makes it easy to find, subscribe to, and use third-party data in the cloud. Once subscribed to a data product, you can use the AWS Data Exchange API to load data directly into Amazon S3 and then analyze it with a wide variety of AWS analytics and machine- learning services. Click here to browse thousands of data products now available from more than 80 qualified data providers in AWS Marketplace.
Visit aws.amazon.com/data-exchange to learn more.
REGISTER NOW
aws marketplace
© 2021 AWS Marketplace.

Wednesday 20 October 2021

24 Key Concepts to Know for Mastering Python Functions by Yong Cui via @BttrProgramming

Check the breadth of your Python knowledge.

I really enjoyed going through this and it told me that I wasn't as knowledgeable as I thought I was (which is always a good thing as it meant I learnt something and improved!).

Monday 18 October 2021

Aggregations on time-series data with Pandas by @OlegZero13 by @TDataScience

Python Pandas and SQL - time aggregations and syntax explained.

This is a great reminder of the syntax and helped me to remember some things I had obviously forgotten.

Friday 15 October 2021

How Intelligent Marketers Use AI by Steve Jones via @Datafloq

Few technological innovations have delivered as many welcomed improvements to the marketplace as artificial intelligence has.

A good article that will help you to understand the use of AI in marketing.

Wednesday 13 October 2021

Is PyPy Really Faster Than Python? Here Are 5 Benchmarks

HTTP, databases, algorithms, and more.

Good to see some benchmarks and get some data behind some assumptions I'm sure we have all been making.

Monday 11 October 2021

All Top Python Libraries for Data Science Explained (with Code) by Frank Andrade via @TDataScience

Python libraries for data science in plain English and resources to learn them for free.

Some great libraries and definitely a few that I wasn't aware of.

Friday 8 October 2021

3 Cool Features of Python Altair by @snr14 via @TDataScience

It is more than a data visualization library.

This looks really cool and you might find it becomes your chosen data visualisation method in code moving forward - it's certainly worth a play and considering.

Wednesday 6 October 2021

The 4 Tiers of Digital Transformation by @profmohans via @HBR

Companies often assume that if they embrace digital technology in any way, they’re digitally transforming their business. As a result, they often make only ad hoc changes and investments in the digital arena, with ineffectual results. 

I really enjoyed these tiers which helped to compartmentalize it into workable chunks of work or areas to focus on.

Monday 4 October 2021

26 Useful Python Snippets for Lazy Developers by Jainharsh via @PY_PlainEnglish

Here are some of the author's most useful code snippets that will indefinitely make your life easier as a programmer!

These are really useful and worth a printout if not a bookmark or placement in Evernote.