Jens Laufer
writes about Software Development, Data Science, Entrepreneurship, Traveling and Sports

Make A Living in Data Science and Not Trading Time for Money

5 product ideas for Single Person Entrepreneurs

You made yourself a plan. You want to make a living in Data Science. Well, why are you applying for a data science job then? There are tons of them right now, as we all know data science is hot right now. You tell me that you want to create a valuable, repeatable and scalable business model and you don’t want to trade time for money. I tell ... Read more

Example Use Cases of Docker in the Data Science Process

...or how to avoid the It-works-on-my-computer-but-nowhere-else-problem

The excellent comic by Jeff Lofvers illustrates what you often face in software development but also in data science. You are preparing a data analysis or predictive model, but when you want to share it, then it does not work on someone else machine. It fails, because libraries are missing, libraries are having the wrong version (“dependency ... Read more

Image aesthetics quantification with a convolutional neural network (CNN)

Project report for training a MobileNetV1 based convolutional neural network (CNN) with only 14,000 images with transfer learning

I. Definition Project Overview The “A picture is worth a thousand words” stresses how important images are in the modern world. The quality of images, e.g. influences our decisions in different domains. Especially in eCommerce, where we cannot touch things they are essential. They have therefore a significant influence on our product purchasin... Read more

10 Lessons Learned from Scraping Websites

Valuable insights which I gained from retrieving data from many websites over the last years which I want to share with you

“Data is the new oil. It’s valuable, but if unrefined it cannot really be used. It has to be changed into gas, plastic, chemicals, etc. to create a valuable entity that drives profitable activity; so must data be broken down, analysed for it to have value.” - Clive Humby A frequent criticism of Kaggle and MOOC (Massive Open Online Courses) ... Read more

Missing value visualization with tidyverse in R

A short practical guide how to find and visualize missing data with ggplot2, dplyr, tidyr

Finding missing values is an important task during the Exploratory Data Analysis (EDA). They can affect the quality of machine learning models and need to be cleaned before training models. Detecting the missing values let’s you also evaluate the quality of your data retrieval process. This short practical guide will show you how to find missing... Read more