Tag: Python
-
From Least Squares Benchmarks to the Marchenko–Pastur Distribution
In this blog post, I tell the story how I learned about a theorem for random matrices of the two Ukrainian🇺🇦 mathematicians Vladimir Marchenko and Leonid Pastur. It all started with benchmarking least squares solvers in scipy. Setting the Stage for Least Squares Solvers Least squares starts with a matrix and a vector and one…
-
DuckDB: Quacking SQL
“R Python” continued… DuckDB: Quacking SQL
-
Random Forests with Monotonic Constraints
“R Python” continued… Random forests with monotonic constraints
-
Personal Highlights of Scikit-Learn 1.0
Yes! After more than 10 years, scikit-learn released its 1.0 version on 24 September 2021. In this post, I’d like to point out some personal highlights apart from the release highlights. 1. Feature Names This one is listed in the release highlights, but deserves to be mentioned again. This is not yet available for all…
-
Feature Subsampling For Random Forest Regression
TLDR: The number of subsampled features is a main source of randomness and an important parameter in random forests. Mind the different default values across implementations. Randomness in Random Forests Random forests are very popular machine learning models. They are build from easily understandable and well visualizable decision trees and give usually good predictive performance…
-
Strong Random Forests with XGBoost
“R Python” continued… Strong random forests with XGBoost
-
Least Squares Minimal Norm Solution
Beyond the least squares TLDR: an illustration of the minimal norm solution in Python.
-
A Curious Fact on the Diamonds Dataset
“R Python” continued… A Curious Fact on the Diamonds Dataset
-
A Beautiful Regression Formula
“R Python” continued… A beautiful formula for R-squared.
-
Swiss Mortality
A look at Swiss mortality for 20 and over 100 years.
-
Covid-19 Deaths per Mio
“R Python” continued… How are normalized Covid-19 deaths developing across country?
-
Illustrating The Central Limit Theorem
This is the first article in our series “Lost in Translation between R and Python”. We start it by illustrating the famous Central Limit Theorem.