Jens Laufer
writes about Software Development, Data Science, Entrepreneurship, Traveling and Sports

Image aesthetics quantification with a convolutional neural network (CNN)

Project report for training a MobileNetV1 based convolutional neural network (CNN) with only 14,000 images with transfer learning

I. Definition Project Overview The “A picture is worth a thousand words” stresses how important images are in the modern world. The quality of images, e.g. influences our decisions in different domains. Especially in eCommerce, where we cannot touch things they are essential. They have therefore a significant influence on our product purchasin... Read more

10 Lessons Learned from Scraping Websites

Valuable insights which I gained from retrieving data from many websites over the last years which I want to share with you

“Data is the new oil. It’s valuable, but if unrefined it cannot really be used. It has to be changed into gas, plastic, chemicals, etc. to create a valuable entity that drives profitable activity; so must data be broken down, analysed for it to have value.” - Clive Humby A frequent criticism of Kaggle and MOOC (Massive Open Online Courses) ... Read more

Missing value visualization with tidyverse in R

A short practical guide how to find and visualize missing data with ggplot2, dplyr, tidyr

Finding missing values is an important task during the Exploratory Data Analysis (EDA). They can affect the quality of machine learning models and need to be cleaned before training models. Detecting the missing values let’s you also evaluate the quality of your data retrieval process. This short practical guide will show you how to find missing... Read more

Case Study: How to find a Business Opportunity in Sports Psychology

…or how to be BOD (=Business Opportunity Detective) A few days ago I was at a beach bar with a friend who is a freelancer like me. We met a young lady working at the counter who just finished university in sports psychology. She was interviewing us about our freelancership and asked us many questions. The questions were mostly about administrat... Read more

Mongo DB Aggregation vs R dplyr

There are powerful options to do the processing you want: Mongo DB aggregation framework: Aggegation operations process data records results. Aggregation operations group values from multiple documents together, and can perform a variety of operations on the grouped data to return a single result. dplyr framework in R ... Read more