Numerical integration and differentiation

Numerical integration and differentiation#

Part 1 Numerical Integration of Gumbel distribution#

Variations in stochastic parameter values can be defined by means of a distribution function, or Probability Density Function (PDF). A PDF describes the probability that a parameter has a certain value x. The most well-known PDFs are the Uniform distribution and the Normal (or Gaussian) distribution, which are both symmetric around the mean. More information about distribution functions and their use will be treated later in MUDE. Here, we will just use it as a mathematical function demonstrating numerical integration.

Besides the PDF, there is the Cumulative Distribution Function (CDF), which describes the probability that a parameter value is lower than a given value of x; it is the integral of the PDF up to x, for all possible x-values.

Gumbel is another distribution, which is non-symmetric. The Gumbel PDF is given by the following function:

\[ f(x) = e^{-(x+e^{-x})} \]

The Gumbel CDF can be obtained by analytical integration, and is defined as:

\[ F\left( x \right) = \int\limits_{ - \infty }^x {f\left( x \right)dx = {e^{ - {e^{ - x}}}}} \]

\(\text{Task 1.1:}\)

Run the cells below to define and visualise the two functions:

import matplotlib.pyplot as plt
%config InlineBackend.figure_formats = ['svg']
import numpy as np

def f(x):
    return np.exp(-(x + np.exp(-x)))


def F(x):
    return np.exp(-np.exp(-x))

x = np.linspace(-4, 6, num=100)
y = f(x)
Y = F(x)

x_target = np.array([0, 2, 4])
Y_target = F(x_target)
print(Y_target)

fig, ax = plt.subplots(1, 2, figsize=(15, 5))
ax[0].plot(x, y)
ax[1].plot(x, Y)
ax[1].scatter(x_target, Y_target, color="red")
ax[0].grid()
ax[1].grid()
ax[0].set_xlabel("x")
ax[1].set_xlabel("x")
ax[0].set_ylabel("f(x)")
ax[1].set_ylabel("F(x)")
ax[0].set_title("PDF")
ax[1].set_title("CDF")
plt.show()

The red dots in the CDF represent concrete points (x_target = [0, 2, 4]) where we will evaluate the CDF function value by numerical integration of the PDF.

Left Riemann integral:#

\(\text{Task 1.2:}\)

Formulate the Left Riemann integral solution and complete the code below:

\(\text{Tip:}\)

Practically, we can start the numerical integration from x=-4 rather than -inf.

Note

We slightly updated the code to make it clearer where the 10. in the definition of dx comes from. Because we integrate over an interval from -4 to 6, the total length of the interval is ten. In the previous version, we defined: dx = 10. / nsteps

nsteps = 10
Y1 = np.zeros(nsteps+1)

x1 = np.linspace(-4, 6, num=nsteps+1)
interval_length = x1[-1] - x1[0]
dx = interval_length / nsteps

for i in range(nsteps):
    Y1[i+1] = ### YOUR CODE HERE ###
    
def integral_at_targetvalues(x1, Y1, x_target):           # Function to extract target solutions from targeted x-values
    index = [np.where(x1 == val)[0][0] for val in x_target]
    return Y1[index]
    
print('Left Riemann rule:   ', integral_at_targetvalues(x1, Y1, x_target))
print('Analytical solution: ', integral_at_targetvalues(x1, F(x1), x_target))

Right Riemann integral:#

\(\text{Task 1.3:}\)

Formulate the Right Riemann integral solution and complete the code below:

Y1 = np.zeros(nsteps+1)

for i in range(nsteps):
    Y1[i+1] = ### YOUR CODE HERE ###

print('Right Riemann rule:  ', integral_at_targetvalues(x1, Y1, x_target))
print('Analytical solution: ', integral_at_targetvalues(x1, F(x1), x_target))

Midpoint rule:#

\(\text{Task 1.4:}\)

Formulate the Midpoint rule solution and complete the code below:

Y1 = np.zeros(nsteps+1)

for i in range(nsteps):
    Y1[i+1] = ### YOUR CODE HERE ###

print('Midpoint rule:       ', integral_at_targetvalues(x1, Y1, x_target))
print('Analytical solution: ', integral_at_targetvalues(x1, F(x1), x_target))

Trapezoidal rule:#

\(\text{Task 1.5:}\)

Formulate the Trapezoidal rule solution and complete the code below:

Y1 = np.zeros(nsteps+1)

for i in range(nsteps):
    Y1[i+1] = ### YOUR CODE HERE ###

print('Trapezoidal rule:    ', integral_at_targetvalues(x1, Y1, x_target))
print('Analytical solution: ', integral_at_targetvalues(x1, F(x1), x_target))

Now, run the code below to plot the entire CDF as obtained from numerical integration of the PDF (Trapezoidal rule), together with the analytical CDF

plt.plot(x, Y, linewidth=1)
plt.plot(x1, Y1, linewidth=1)
plt.xlabel("x")
plt.ylabel("F(x)")
plt.title("CDF")
plt.show()

\(\text{Task 1.6:}\)

Play with the number of steps (nsteps) to see how this number changes the accuracy of the numerical solution

Part 2 Numerical Integration and differentiation of earthquake record#

In this exercise we will apply numerical integration and numerical differentiation on given measurement data. The data involves a velocity time series from earthquake recordings. In fact, the original earthquake records also include acceleration and displacement time series, but the purpose of this exercise is to calculate the displacement and acceleration from the given velocity by numerical integration and differentiation, respectively. Afterwards, the results are compared with the original data.

Source: Wikimedia, Yamaguchi先生 (https://commons.wikimedia.org/wiki/File:Kinemetrics_seismograph.jpg), licensed under CC BY SA

Earthquake recordings are relevant as input signals to geotechnical and structural analysis to back-analyse or design structures for the (additional) forces and deformations that may occur during earthquakes. In a so-called one-dimensional site response analysis, geotechnical engineers can also predict whether soil liquefaction might happen at a project location if an earthquake would occur. Based on this they determine if or what measures need to be taken to improve the ground conditions.

\(\text{Task 2.1:}\)

Open the file Earthquake_velocity.txt in a text editor (like VS code) to see how the content looks like.

The file contains velocities that were recorded at a frequency of 200 per second during the event of an earthquake. Hence, the total duration of the earthquake measurements (in seconds) is the total number of readings divided by 200.

\(\text{Task 2.2:}\)

Run the code below to download the datafiles

import os
from urllib.request import urlretrieve

data_files = [
    "Earthquake_velocity.txt",
    "Earthquake_acceleration.txt",
    "Earthquake_displacement.txt",
]

for filename in data_files:
    if not os.path.isfile(filename):
        print(f"Downloading {filename}...")
        urlretrieve("http://files.mude.citg.tudelft.nl/" + filename, filename)

f = open("Earthquake_velocity.txt", "r")
v = np.array([])

while True:
    line = f.readline()
    if not line:  # If line is empty, we've reached the end of file
        break

    try:
        # Use split() to split on whitespace and filter out empty strings
        string_values = line.split()
        if len(string_values) > 1:  # to skip the first couple of lines with text
            # Convert to float array
            values = np.array([float(x) for x in string_values])
            v = np.append(v, values)
    except ValueError:
        # Skip lines that can't be parsed as numbers
        continue

f.close()  # Close the file
num = len(v)
t_end = (num - 1) / 200
t = np.linspace(
    start=0,
    stop=t_end,
    num=num,
)

Note

We slightly changed the definition of t compared to the assignment version. The last time point is now at (num - 1) / 200 instead of num / 200. This was necessary to ensure that the frequency of measurements is indeed exactly 200 per second, as explained in the text above.

\(\text{Task 2.3:}\)

Run the code below to plot the velocity time series

plt.figure(figsize=(12, 4))
plt.plot(t, v, linewidth=0.5)
plt.xlabel("Time (s)")
plt.ylabel("Velocity (cm/s)")
plt.title("Velocity vs Time")
plt.grid(True)
plt.show()

Displacement#

First, we will integrate the velocity to displacement, to see what the maximum displacement is at the measurement point and how much permanent deformation it has undergone. Thereby we will use three numerical integration methods: Left Riemann, Right Riemann and Trapezoidal rule. At the end of this block, we will compare the results from the various methods with each other and with the displacement from the earthquake record.

\(\text{Task 2.4:}\)

Formulate the Left Riemann integral solution and complete the code below, using the velocity data (v) at corresponding times (t):

d_LR = np.zeros([num])

for i in range(0,num-1):
    d_LR[i+1] = ### YOUR CODE HERE ###

Plot the displacement

plt.figure(figsize=(12, 4))
plt.plot(t, d_LR, linewidth=0.5)
plt.xlabel("Time (s)")
plt.ylabel("Displacement (cm)")
plt.title("Displacement vs Time")
plt.grid(True)
plt.show()

\(\text{Task 2.5:}\)

Formulate the Right Riemann integral solution and complete the code below:

d_RR = np.zeros([num])

for i in range(1,num):
    d_RR[i] = ### YOUR CODE HERE ###

Plot the displacement

plt.figure(figsize=(12, 4))
plt.plot(t, d_RR, linewidth=0.5)
plt.xlabel("Time (s)")
plt.ylabel("Displacement (cm)")
plt.title("Displacement vs Time")
plt.grid(True)
plt.show()

\(\text{Task 2.6:}\)

Formulate the Trapezoidal rule solution and complete the code below:

d_TR = np.zeros([num])

for i in range(0, num-1):
    d_TR[i+1] = ### YOUR CODE HERE ###

Note that in this case, where data is given only at specific time intervals, it is not possible to apply the Midpoint rule, since we only have data at the given points in time, and not in the middle between two successive points.

Plotting all together

plt.figure(figsize=(12, 4))
plt.plot(t, d_LR, linewidth=0.5)
plt.plot(t, d_RR, linewidth=0.5)
plt.plot(t, d_TR, linewidth=0.5)
plt.xlabel("Time (s)")
plt.ylabel("Displacement (cm)")
plt.title("Displacement vs Time")
plt.grid(True)
plt.show()

Since the measurements have a rather high sampling frequency, the results seem quite accurate and the differences between the integration methods are minimal in this case.

\(\text{Task 2.7:}\)

Open the ‘Earthquake_displacement.txt’ file and store the displacement data in a one-dimensional array (d_orig). Compare the calculated displacements with original displacement data.

Conclude on the accuracy of your solution.

f = open("Earthquake_displacement.txt", "r")
d_orig = np.array([])

while True:
    line = f.readline()
    if not line:  # If line is empty, we've reached the end of file
        break

    try:
        # Use split() to split on whitespace and filter out empty strings
        string_values = line.split()
        if len(string_values) > 1:  # to skip the first couple of lines with text
            # Convert to float array
            values = np.array([float(x) for x in string_values])
            d_orig = np.append(d_orig, values)
    except ValueError:
        # Skip lines that can't be parsed as numbers
        continue

f.close()  # Close the file

print(d_LR[-1], d_RR[-1], d_TR[-1], d_orig[-1])
plt.figure(figsize=(12, 4))
plt.plot(t, d_TR, linewidth=0.5)
plt.plot(t, d_orig, linewidth=0.5)
plt.xlabel("Time (s)")
plt.ylabel("Displacement (cm)")
plt.title("Displacement vs Time")
plt.grid(True)
plt.show()

\(\text{Task 2.8:}\)

Conclude on the maximum displacement and the permanent displacement (although the measurement point has not yet come to a complete rest). You may answer in [cm] and round off to integer values (zero decimals).

Acceleration#

Next, we will differentiate the velocity to acceleration, to see what the maximum acceleration is at the measurement point. Thereby we will use three numerical differentiation methods: Forward difference, Backward difference and Central difference. At the end of this block, we will compare our results with each other and with the acceleration from the earthquake record.

\(\text{Task 2.9:}\)

Formulate the solution according to the Forward Difference method and complete the code below

df_FD = np.zeros([num])

for i in range(0,num-1):
    df_FD[i] = ### YOUR CODE HERE ###

Plot the acceleration

plt.figure(figsize=(12, 4))
plt.plot(t, df_FD, linewidth=0.5)
plt.xlabel("Time (s)")
plt.ylabel("Acceleration (cm/s2)")
plt.title("Acceleration vs Time")
plt.grid(True)
plt.show()

\(\text{Task 2.10:}\)

Formulate the solution according to the Backward Difference method and complete the code below

df_BD = np.zeros([num])

for i in range(1,num):
    df_BD[i] = ### YOUR CODE HERE ###

Plot the acceleration

plt.figure(figsize=(12, 4))
plt.plot(t, df_BD, linewidth=0.5)
plt.xlabel("Time (s)")
plt.ylabel("Acceleration (cm/s2)")
plt.title("Acceleration vs Time")
plt.grid(True)
plt.show()

\(\text{Task 2.11:}\)

Formulate the solution according to the Central Difference method and complete the code below

df_CD = np.zeros([num])

for i in range(1,num-1):
    df_CD[i] = ### YOUR CODE HERE ###

Plotting all together:

plt.figure(figsize=(12, 4))
plt.plot(t, df_FD, linewidth=0.5)
plt.plot(t, df_BD, linewidth=0.5)
plt.plot(t, df_CD, linewidth=0.5)
plt.xlabel("Time (s)")
plt.ylabel("Acceleration (cm/s2)")
plt.title("Acceleration vs Time")
plt.grid(True)
plt.show()

\(\text{Task 2.12:}\)

Run the code below to open the ‘Earthquake_acceleration.txt’ file and store the acceleration data in a one-dimensional array (a_orig). Compare the calculated acceleration with original acceleration data.

Conclude on the accuracy of your solution.

f = open("Earthquake_acceleration.txt", "r")
a_orig = np.array([])

while True:
    line = f.readline()
    if not line:  # If line is empty, we've reached the end of file
        break

    try:
        # Use split() to split on whitespace and filter out empty strings
        string_values = line.split()
        if len(string_values) > 1:  # to skip the first couple of lines with text
            # Convert to float array
            values = np.array([float(x) for x in string_values])
            a_orig = np.append(a_orig, values)
    except ValueError:
        # Skip lines that can't be parsed as numbers
        continue

f.close()  # Close the file

print(df_FD[-1], df_BD[-1], df_CD[-1], a_orig[-1])
print(df_FD[-2], df_BD[-2], df_CD[-2], a_orig[-2])
plt.figure(figsize=(12, 4))
plt.plot(t, df_CD, linewidth=0.5)
plt.plot(t, a_orig, linewidth=0.5)
plt.xlabel("Time (s)")
plt.ylabel("Acceleration (cm/s2)")
plt.title("Acceleration vs Time")
plt.grid(True)
plt.show()

\(\text{Task 2.13:}\)

Conclude on the maximum acceleration that the measurement point has undergone. You may round off to multitudes of 1 m/s² (0.1 G).

What time interval do you consider the ‘heaviest’ part of the earthquake?

By Ronald Brinkgreve and Anna Störiko, Delft University of Technology. CC BY 4.0, more info on the Credits page of Workbook.