Group assignment 1.4#
Due: Friday, September 26th 2025. 12:30
GA 1.4 focuses on the fit and validation of a continuous parametric probability distribution to two data sets. There are three datasets to choose from (the analysis is exactly the same for all three!).
There are several files to be aware of:
README.md
3 subdirectories containing the files for each of the 3 data sets, containing the “usual” files (
Analysis.ipynb
andReport.md
)
First, you should choose a dataset to use for GA 1.4. The analysis is identical for each set, so choose that which aligns best with the interests of your group, as the context of the application may help with the interpretation of your results.
01_wind_gusts
: Compute the wind gust factor in Delft02_traffic
: Compute the traffic density at a highway near Helsinki03_river_flow_velocity
: Compute the flow velocity of the river Thames
Note that the notebook lists specific distribution that you should choose from, which are unique for each data set; read the notebook to see which distributions you should choose from!
As in previous weeks:
Complete the analysis in
Analysis.ipynb
then answer the questions inReport.md
(but some group members can start working on the Report in parallel!).Markdown cells in the notebook are primarily for your own benefit; answers in the Report are more important.
When submitting your assignment make sure your Analysis.ipynb
and Report.md
files are in the root directory of the repository (i.e., in the same folder as this README file).
Task Overview#
We recommend leaving around 45-60 minutes at the end of the session to work on the Report.
Many of the tasks in the notebook, as well as some of the Report, can be completed independently, here are a few tips:
Inspect the data together and write down a few observations that you all agree on. This will become your first answer in the Report.
You can split up and implement the code needed for many of the notebook tasks (some of the code from the WS will be useful to recycle).
Once the analysis is complete, meet with *all group members and observe the results, then decide on the “best” distribution (write down a few reasons to add to the justification in the Report)
If you are short on time, remember that it is more important to have answers in the Report than a completed notebook. Explain any issues you faced with the programming in your Report.
By Max Ramgraber, Patricia Mares Nasarre and Robert Lanzafame, Delft University of Technology. CC BY 4.0, more info on the Credits page of Workbook.