Beautiful Dataframes
Some Applied Data Science in Python

This is a web site on some Data Science topics.
It includes content that I taught during an academic course on ‘Data science in Python’ in 2022 and 2023 leading to a bachelor degree.
The value proposition of this page is to show how the various techniques are applied to get to a meaningful analysis of data. For each detail, I recommend to look it up on the respective page or forum.

At this start of 2024, I am preparing for the repetition of this course on ‘Data Science in Python’.

Setup and Coding

Okay, let’s get started with the Setup 2024 for Pandas and Plotly on Ubuntu

Mostly, I use Python, Pandas and Plotly here.

Special topic of interest:

Results

First Names

Let’s start with a data set that is straightforward to consume and to visualize.

The statistics department in Scotland has a great website from which a lot of interesting data can be downloaded, e.g. Birth names
How does my first name compare to the currently most popular male first name?

../assets/images/img_blog/Python_2024/FirstNames/C_25_Line_Fixed_Names.png
Total number of names per year in Scotland shown for tow specific first names.

Well, my first name has some potential for growth. However, considering the increasing diversity of names and hence the lower maximum values for the top names, the difference to the top names is getting smaller.

More on the technical part of this analysis and more on first names can be found here: Pandas - Analysis first names

Modelling - Time series

Modelling - Categorizations

Continue on Categorization

Machine Learning (i.e. no neuronal networks) concludes the part on Python. (No PyTorch here.)


Julia

In 2022 and 2023, Python was sufficient for all my requirements. No updates hence on my pages on Julia dataframes. The content on Julia reflects my version and my knowledge of Julia as of 2021.

Here are the pages on Julia dataframes you were interested in 2023.

Copyright 2021-2022