Report traffic dataset

Contents

Report traffic dataset#

Questions#

Question 1

Give a short description of the provided dataset in statistical terms, based on the tasks in the notebook. Visualize the data and choose a parametric distribution function for each variable between the indicated ones: choose between a lognormal and exponential distribution to model the vehicle flow, and between a Gaussian and Gumbel distribution for the average velocity. Justify your choice based on the previous description and visualization.

You should describe your data with only a few sentences, and be sure to use quantitative information! Refer to that description to choose the parametric distribution function. You can also include some plots that may support your reasoning.

Distribution chosen for <VARIABLE_1>: your distribution here.

Your justification here.

Distribution chosen for <VARIABLE_2>: your distribution here.

Your justification here.

Question 2

Fit and assess the goodness of fit of the selected parametric distribution functions to the dataset. Use (at least) Kolmogorov-Smirnov test and one graphical technique. For the graphical technique, choose between the QQplot and logscale plot. Describe in a few sentences how the chosen parametric distribution functions performs.

Remember to use quantitative information based on the goodness of fit metrics that you have used. Also, you can include some examples of the differences in the computed and observed non-exceedance probabilities.

Question 3

Propagate the uncertainty through the equation to estimate the traffic density, \(D\), using both observations and simulated samples. Provide a bulleted list that summarizes differences between the two obtained distributions.

  • your

  • bulleted

  • list

  • here

Compare the simulated samples and observations \((F,v)\) in a scatter plot, then prepare a bulleted list that describes the differences. Is there anything you could improve in the analysis? Provide with recommendations to improve the performed analysis. They can be both about the univariate distributions and about the propagation of uncertainty method you have used.

Include your figure here. Be sure to use high contrast data symbols/colors and a legend to differentiate the two data sets clearly.

Last Question: How did things go? (Optional)

Use this space to let us know if you ran into any challenges while working on this GA, and if you have any feedback to report.

By Max Ramgraber, Patricia Mares Nasarre and Robert Lanzafame, Delft University of Technology. CC BY 4.0, more info on the Credits page of Workbook.