Jens Laufer
writes about Software Development, Data Science, Entrepreneurship, Traveling and Sports

Missing value visualization with tidyverse in R

A short practical guide how to find and visualize missing data with ggplot2, dplyr, tidyr

Finding missing values is an important task during the Exploratory Data Analysis (EDA). They can affect the quality of machine learning models and need to be cleaned before training models. Detecting the missing values let’s you also evaluate the quality of your data retrieval process. This short practical guide will show you how to find missing... Read more

Case Study: How to find a Business Opportunity in Sports Psychology

…or how to be BOD (=Business Opportunity Detective) A few days ago I was at a beach bar with a friend who is a freelancer like me. We met a young lady working at the counter who just finished university in sports psychology. She was interviewing us about our freelancership and asked us many questions. The questions were mostly about administrat... Read more

Mongo DB Aggregation vs R dplyr

There are powerful options to do the processing you want: Mongo DB aggregation framework: Aggegation operations process data records results. Aggregation operations group values from multiple documents together, and can perform a variety of operations on the grouped data to return a single result. dplyr framework in R ... Read more