Delivered once every Week. No Spam Ever.

Issue - 113

Worthy Read

Talkpython interview with Guido van Rossum aka BDFL.

We help companies like Airbnb, Pfizer, and Artsy find great developers. Let us find your next great hire. Get started today.

The people who introduced me to Python chose it because of the elegance of the language, and it's aesthetic qualities. Would they choose it again, I wonder? Would I?.

This book is for people with some experience in an object oriented programming language. This book will help you get better at module/class level design. Hopefully, it will teach you to identify good design from bad.

Python material in data science, analysis, and modeling, and optimization. Here is the youtube video channel of the site

This time, it was different though. My distributed web crawler seemed to be slowing down over time. Adding more nodes only had a temporary performance boost; the overall crawling speed gradually declined afterwards. So simply put, it couldn't scale. But why?. In this post, you'll find out what techniques and tools I used to diagnose scaling issues - and to an extent, more general performance issues - in my Python-based web crawler.

Code reuse is a very common need. It saves you time for writing the same code multiple times, enables leveraging other smart people’s work to make new things happen. Even just for one project, it helps organize code in a modular way so you can maintain each part separately. When it comes to python, it means format your project so it can be easily packaged. This is a simple instruction on how to go from nothing to a package that you can proudly put it in your portfolio to be used by other people.


Note - The video is old, but worth watching for emacs users.

deep learning

image processing

PyWren, Tfdeploy, Luigi, Kubelib, PyTorch. Note - We used luigi at my previous workplace and it's a solid library to custom pipelines for batch processing. In our case it was used to enforce database migrations.
machine learning

Type Tracing - as a program runs you trace it and record the types of variables coming in and out of functions, and being assigned to variables.

I've written a Python package called pdftabextract that contains several helpful functions for that task and I'm explaining how to use them in that blog post.
data mining


Pune, Maharashtra, India
iDatalabs ( is hiring for a junior data scientist (with 0.5-2 years of work experience).


tkui - 166 Stars, 11 Fork
A visual introspective GUI maker with live editing of the GUI and its editor at the same time

tweetfeels - 18 Stars, 1 Fork
Real-time sentiment analysis in Python using twitter's streaming api

WallpapersFromReddit - 13 Stars, 3 Fork
Download all the hot images from subreddit every 24 hours to a local device and set an image from those local files as a wallpaper, which updates automatically every 30 minutes!

lda2vec-tf - 12 Stars, 1 Fork
Tensorflow port of the lda2vec model for unsupervised learning of document + topic + word embeddings.

ieighteen - 10 Stars, 1 Fork
Speed up your Localization/Internationalization efforts by automating translation with single script

fish-hook - 8 Stars, 1 Fork
A console tool which manages your github webhooks efficiently.

ipyaml - 8 Stars, 1 Fork
IPython notebooks with YAML file format

Scrapstagram - 3 Stars, 0 Fork
An Instagram Scrapper

kimo - 3 Stars, 0 Fork
Find OS processes of MySQL queries