Delivered once every Week. No Spam Ever.

Issue - 156

Worthy Read

In this post, we’ll create a demo to see how simple it is to develop a machine learning-based service using Python’s Flask library.
machine learning

This is an update to a old blog post from 2006 called Fastest way to uniquify a list in Python. But this, time for Python 3.6. Why, because Python 3.6 preserves the order when inserting keys to a dictionary. How, because the way dicts are implemented in 3.6, the way it does that is different and as an implementation detail the order gets preserved. Then, in Python 3.7, which isn't released at the time of writing, that order preserving is guaranteed.

This week on the show Scott talks to Data Scientist and AI expert Paige Bailey. What's the difference between machine learning and deep learning? Do I need to learn R and Python to use machine learning models? Do models need to deploy regularly or can I use them forever? All these questions and more, this week!

Imagine being able to use a camera that’s able to tell when you’re playing a guitar, or creating a new dance, or just learning new skateboard tricks. It could use the raw image data to tell if you landed a trick or not. Or if you’re doing a new dance routine, what the series of poses are, and how they fit to the music.
deep learning

Compare yourself to over 1,000 DevOps peers to see how they manage their processes.

GoCD is a continuous delivery tool supporting modern infrastructure with elastic on-demand agents and cloud deployments. With GoCD, you can easily model, orchestrate and visualize complex workflows from end to end. It’s open source, free to use and download.


I got in a small argument at a meetup about R. Something about python being a BetterLanguage[tm] than R. One of the arguments was that python is better because the language has support for generators. This was an interesting moment because I definately agree that the way that generators work in python is great. I would even argue that there are parts in python that work better for many tasks than R might (and vise versa). But I wouldn't argue that R does not a very similar feature to generators in python, but they do require you to think differently.

This post tries to explain web application performance. Performance means the number of requests per second that can be served by a deployed application.

I was curious, how much more efficient is Msgpack at packing a bunch of data into a file I can emit from a web service.

Whenever we do classification in ML, we often assume that target label is evenly distributed in our dataset. This helps the training algorithm to learn the features as we have enough examples for all the different cases. For example, in learning a spam filter, we should have good amount of data which corresponds to emails which are spam and non spam.
machine learning

Python has turned into a data science and machine learning mainstay, while Julia was built from the ground up to do the job.

Python 3.6.4 is the fourth maintenance release of Python 3.6.
new release


Simple linear regression is a statistical method that allows us to summarise and study relationships between two continuous (quantitative) variables. I hope today to prove to myself that I understand and can demonstrate linear regression by coding it from scratch in Python without using Scikit Learn.
linear regression

Do you run a web application or web service? You probably do a couple of things to optimize the performance of your site. Make sure the database response quickly and more. But did you know a well of performance improvements live in your web servers themselves?

At Instagram, we have the world’s largest deployment of the Django web framework, which is written entirely in Python. We began using Python early on because of its simplicity, but we’ve had to do many hacks over the years to keep it simple as we’ve scaled. Last year we tried dismissing the Python garbage collection (GC) mechanism (which reclaims memory by collecting and freeing unused data), and gained 10% more capacity. However, as our engineering team and number of features have continued to grow, so has memory usage. Eventually, we started losing the gains we had achieved by disabling GC. 
garbage collection


open-paperless - 1204 Stars, 50 Fork
Scan, index, and archive all of your paper documents.

nmtpytorch - 245 Stars, 23 Fork
Neural Machine Translation Framework in PyTorch.

poor-mans-deep-learning-camera - 117 Stars, 21 Fork
Build a thin client deep learning camera with the Raspberry Pi, Flask, and YOLO.

tache - 32 Stars, 1 Fork
A tag based invalidation caching library

evolving-simple-organisms - 22 Stars, 5 Fork
Evolving simple organisms using a genetic algorithm and deep learning from scratch with python.

firefox-privacy-restorer - 15 Stars, 1 Fork
A script to modify your Firefox preferences to disable telemetry, built-in advertisements, and data collection anti-features.

persistent-dict - 10 Stars, 0 Fork
A Python dict which stores data in Redis.

albert - 2 Stars, 0 Fork
An Open Source Public API for making Machine Learning powered recommendation system.

colour-detection - 2 Stars, 0 Fork
python module for doing colour detection.