This is a web site on some Data Science topics.
It includes content that I taught during an academic course on ‘Data science in Python’ in 2022 and 2023 leading to a bachelor degree.
Update (January 2024):
At this start of 2024, I am preparing for the repetition of this course.
Okay, let’s get started with the Setup 2024 for Plotly, Pandas on Ubuntu
If you are new to the topic, you might want to start here:
Most of the work might be due to mastering Pandas and fine tuning the visualizations. For providing you an overview, I moved the details to the layer below this page.
I currently recommend to split a task into steps:
An alternative visualization using Plotly. Plotly also offers the possibility to extend graphs to a dashboard.
When selecting the number of cylinders in the drop down box, only the respective data are shown. The second graph shows the average fuel consumption.
A common use case is to predict this year’s data from last years data.
Once such a prediction was made, you might want to point out clearly the deviation of new value from the prediction.
Continue on Time Series …
Some Machine Learning algorithms are shown here.
The ‘Iris’ data is the standard data set used for clustering and classification.
Continue on Categorization …
Machine Learning (i.e. no neuronal networks) concludes the part on Python. (No PyTorch here.)
In 2022 and 2023, Python was sufficient for all my requirements. No updates hence on my pages on Julia dataframes. The content on Julia reflects my version and my knowledge of Julia as of 2021.
Here are the pages on Julia dataframes you were interested in 2023.