Using Continuous-Time Bayesian Networks

In [1]:

import pyagrum as gum
import pyagrum.lib.notebook as gnb
import pyagrum.ctbn as ct
import pyagrum.ctbn.notebook as ctbnnb

A CTBN(Continuous-Time Bayesian Networks) is a graphical model that allows a Bayesian Network to evolve over continuous time.

(For more details see https://pyagrum.readthedocs.io/en/latest/ctbn.html)

Creating a CTBN

In [2]:

# 1. Creating the ctbn
ctbn = ct.CTBN()

# 2. Adding the random variables
ctbn.add(gum.LabelizedVariable("cloudy?", "cloudy?"))
ctbn.add(gum.LabelizedVariable("rain?", "rain?"))

# 3. Adding the arcs
ctbn.addArc("cloudy?", "rain?")
ctbn.addArc("rain?", "cloudy?")

# 4. Filling the CIMS (Conditional Intensity Matrixes)
ctbn.CIM("cloudy?")[{"rain?": "0"}] = [[-0.1, 0.1], [1, -1]]
ctbn.CIM("cloudy?")[{"rain?": "1"}] = [[-100, 100], [0.5, -0.5]]

ctbn.CIM("rain?")[{"cloudy?": "0"}] = [[-0.5, 0.5], [1000, -1000]]
ctbn.CIM("rain?")[{"cloudy?": "1"}] = [[-2, 2], [2, -2]]

# 5. Plotting the CTBN (whith/whithout the CIMs)
ctbnnb.showCtbnGraph(ctbn)
ctbnnb.showCIMs(ctbn)

../_images/notebooks_71-PyModels_CTBN_5_0.svg

		cloudy?#j
rain?	cloudy?#i	0	1
0	0	-0.1000	0.1000
0	1	1.0000	-1.0000
1	0	-100.0000	100.0000
1	1	0.5000	-0.5000

CIM(cloudy?)

		rain?#j
cloudy?	rain?#i	0	1
0	0	-0.5000	0.5000
0	1	1000.0000	-1000.0000
1	0	-2.0000	2.0000
1	1	2.0000	-2.0000

CIM(rain?)

cloudy?#i gives the state cloudy? is currently in and cloudy?#j gives its next state.

The CIM object, built around a pyagrum.Tensor is used to store the parameters of the variables, describing the waiting time before switching to another state.

A useful tool for CIMs, mainly used for simple inference, is amalgamation. It allows to merge many CIMs into one.

In [3]:

cimA = ctbn.CIM("cloudy?")
cimB = ctbn.CIM("rain?")
amal = cimA.amalgamate(cimB)  # or cimA * cimB

amal.getTensor()  # as a Tensor

Out[3]:

			cloudy?#i
cloudy?#j	rain?#i	rain?#j	0	1
0	0	0	-0.6000	1.0000
	0	1	0.5000	0.0000
	1	0	1000.0000	0.0000
	1	1	-1100.0000	0.5000
1	0	0	0.1000	-3.0000
	0	1	0.0000	2.0000
	1	0	0.0000	2.0000
	1	1	100.0000	-2.5000

In [4]:

print("\nJoint intensity matrix as numpy array\n")
print(amal.toMatrix())  # as a numpy array


Joint intensity matrix as numpy array

[[-6.0e-01  1.0e-01  5.0e-01  0.0e+00]
 [ 1.0e+00 -3.0e+00  0.0e+00  2.0e+00]
 [ 1.0e+03  0.0e+00 -1.1e+03  1.0e+02]
 [ 0.0e+00  2.0e+00  5.0e-01 -2.5e+00]]

A random variable of a ctbn can be identified by either its name or id.

In any case, most class functions parameters allow for a variable to be identified by either its name or id (type NameOrId).

In [5]:

labels_with_id = ctbn.labels(0)
labels_with_name = ctbn.labels("cloudy?")
print(f"labels using id : {labels_with_id}, using name : {labels_with_name}")

variable_with_id = ctbn.variable(0)
variable_with_name = ctbn.variable("cloudy?")
print(f"The variables 0 and 'cloudy?' are the same ? {variable_with_id == variable_with_name}")

labels using id : ('0', '1'), using name : ('0', '1')
The variables 0 and 'cloudy?' are the same ? True

All functions with NameOrId parameters work the same way

Random CTBN generator

The CtbnGenerator module can be used to quickly generate CTBNs. The parameters value are drawn at random from a value range. The diagonal coefficients are just the negative sum of the ones on the same line (in a CIM). It is also possible to choose the number of variables, the maximum number of parents a variable can have, and the domain size of the variables.

In [6]:

randomCtbn = ct.randomCTBN(valueRange=(1, 10), n=8, parMax=2, modal=2)

ctbnnb.showCtbnGraph(randomCtbn)
ctbnnb.showCIMs(randomCtbn)

../_images/notebooks_71-PyModels_CTBN_15_0.svg

			V1#j
V2	V7	V1#i	v1_1	v1_2
v2_1	v7_1	v1_1	-4.5238	4.5238
	v7_1	v1_2	6.7567	-6.7567
	v7_2	v1_1	-9.3730	9.3730
	v7_2	v1_2	6.1091	-6.1091
v2_2	v7_1	v1_1	-2.2376	2.2376
	v7_1	v1_2	9.3658	-9.3658
	v7_2	v1_1	-5.4311	5.4311
	v7_2	v1_2	3.3397	-3.3397

CIM(V1)

			V2#j
V4	V8	V2#i	v2_1	v2_2
v4_1	v8_1	v2_1	-3.8094	3.8094
	v8_1	v2_2	3.5527	-3.5527
	v8_2	v2_1	-7.4880	7.4880
	v8_2	v2_2	1.8395	-1.8395
v4_2	v8_1	v2_1	-4.0664	4.0664
	v8_1	v2_2	9.6202	-9.6202
	v8_2	v2_1	-2.9086	2.9086
	v8_2	v2_2	3.1460	-3.1460

CIM(V2)

			V3#j
V4	V8	V3#i	v3_1	v3_2
v4_1	v8_1	v3_1	-7.2309	7.2309
	v8_1	v3_2	8.7389	-8.7389
	v8_2	v3_1	-2.5405	2.5405
	v8_2	v3_2	6.2676	-6.2676
v4_2	v8_1	v3_1	-5.0501	5.0501
	v8_1	v3_2	7.5082	-7.5082
	v8_2	v3_1	-3.1189	3.1189
	v8_2	v3_2	3.6901	-3.6901

CIM(V3)

			V4#j
V8	V1	V4#i	v4_1	v4_2
v8_1	v1_1	v4_1	-5.6298	5.6298
	v1_1	v4_2	6.4722	-6.4722
	v1_2	v4_1	-2.3080	2.3080
	v1_2	v4_2	2.5318	-2.5318
v8_2	v1_1	v4_1	-1.1925	1.1925
	v1_1	v4_2	6.0062	-6.0062
	v1_2	v4_1	-6.4354	6.4354
	v1_2	v4_2	2.2978	-2.2978

CIM(V4)

			V5#j
V1	V3	V5#i	v5_1	v5_2
v1_1	v3_1	v5_1	-5.6367	5.6367
	v3_1	v5_2	2.4886	-2.4886
	v3_2	v5_1	-5.1983	5.1983
	v3_2	v5_2	9.8270	-9.8270
v1_2	v3_1	v5_1	-8.0440	8.0440
	v3_1	v5_2	9.3535	-9.3535
	v3_2	v5_1	-6.1621	6.1621
	v3_2	v5_2	1.3689	-1.3689

CIM(V5)

		V6#j
V5	V6#i	v6_1	v6_2
v5_1	v6_1	-3.2660	3.2660
v5_1	v6_2	9.3583	-9.3583
v5_2	v6_1	-9.3525	9.3525
v5_2	v6_2	7.3381	-7.3381

CIM(V6)

		V7#j
V2	V7#i	v7_1	v7_2
v2_1	v7_1	-2.9789	2.9789
v2_1	v7_2	6.7120	-6.7120
v2_2	v7_1	-6.0397	6.0397
v2_2	v7_2	3.0019	-3.0019

CIM(V7)

			V8#j
V2	V4	V8#i	v8_1	v8_2
v2_1	v4_1	v8_1	-2.9119	2.9119
	v4_1	v8_2	9.0987	-9.0987
	v4_2	v8_1	-4.6708	4.6708
	v4_2	v8_2	8.2560	-8.2560
v2_2	v4_1	v8_1	-9.5460	9.5460
	v4_1	v8_2	3.0559	-3.0559
	v4_2	v8_1	-1.8994	1.8994
	v4_2	v8_2	1.4912	-1.4912

CIM(V8)

Inference with CTBNs

Exact inference & Forward Sampling inference

Amalgamation is used to compute exact inference. It consists in merging all the CIMs of a CTBN into one CIM. Then we use the properties of markov process : \(P(X_t) = P(X_0)*exp(Q_Xt)\) for enough time to notice convergence.
Forward sampling works this way:
- At time \(t\) :
- Draw a “time until next change” \(dt\) value for each variable’s exponential distribution of their waiting time.
- Select the variable with smallest transition time : \(next time = t + dt\).
- Draw the variable next state uniformly.
- loop until \(t = timeHorizon\).

When doing forward sampling, we first iterate this process over a given amount of iterations (called burnIn). We consider the last values as the initial configuration for sampling. The reason is that the initial state of a CTBN is unknown without real data.

In [7]:

# 1. Initialiaze Inference object
ie = ct.SimpleInference(ctbn)
ie_forward = ct.ForwardSamplingInference(ctbn)
gnb.sideBySide(ctbnnb.getCtbnGraph(ctbn), ctbnnb.getCtbnInference(ctbn, ie), ctbnnb.getCtbnInference(ctbn, ie_forward))

In any case, inference can be made using makeInference() function from every Inference object.

In [8]:

import time

# Simple inference
time1 = time.time()
ie.makeInference(t=5000)
pot1 = ie.posterior("cloudy?")
elapsed1 = time.time() - time1

# Sampling inference
time2 = time.time()
ie_forward.makeInference(timeHorizon=5000, burnIn=100)
pot2 = ie_forward.posterior("cloudy?")
elapsed2 = time.time() - time2

gnb.sideBySide(
  gnb.getTensor(pot1),
  gnb.getTensor(pot2),
  captions=[f"Simple inference {round(elapsed1 * 1000, 2)} ms", f"Sampling inference {round(elapsed2 * 1000, 2)} ms"],
)

cloudy?
0	1
0.8334	0.1666

Simple inference 0.71 ms

cloudy?
0	1
0.8379	0.1621

Sampling inference 228.64 ms

Plotting a sample

Here is an example of a trajectory plot

In [9]:

# 1_1. Find trajectory stored in the inference class
ie_forward.makeInference()
traj = ie_forward.trajectory  # a single trajectory List[Tuple[float, str, str]]

# 1_2. Or find it in a csv file
ie_forward.writeTrajectoryCSV("out/trajectory.csv", n=1, timeHorizon=1000, burnIn=100)  # to save a trajectory
traj = ct.readTrajectoryCSV("out/trajectory.csv")  # a dict of trajectories

# 2. Plot the trajectory
ct.plotTrajectory(ctbn.variable("cloudy?"), traj[0], timeHorizon=40, plotname="plot of cloudy?")

../_images/notebooks_71-PyModels_CTBN_22_0.svg

It is also possible to plot the proportions of the states a variable is in over time.

Let’s take the exemple of a variable with domain size=5.

In [10]:

# 1. Create CTBN
new_ctbn = ct.CTBN()
X = gum.LabelizedVariable("X", "X", ["a1", "a2", "a3", "a4", "a5"])
new_ctbn.add(X)
new_ctbn.CIM("X")[:] = [
  [-10, 2, 4, 0, 4],
  [0.5, -1, 0.5, 0, 0],
  [5, 0.5, -10, 3.5, 1],
  [50, 0.5, 50, -200.5, 100],
  [1, 0, 2, 7, -10],
]
# 2. Sampling
ieX = ct.ForwardSamplingInference(new_ctbn)
ieX.writeTrajectoryCSV("out/traj_quinary.csv", n=3, timeHorizon=100)
trajX = ct.readTrajectoryCSV("out/traj_quinary.csv")

# 3. Plotting
ct.plotFollowVar(X, trajX)

../_images/notebooks_71-PyModels_CTBN_24_0.svg

CTBN Learning with pyAgrum

The ctbn library has tools to learn the graph of a CTBN as well as the CIMs of its variables

Learning the variables

From a sample, we can find the variables name and their labels. However, if the sample is too short, it is possible that some information may be missed.

In [11]:

newCtbn = ct.CTBNFromData(traj)

print("variables found and their labels")
for name in newCtbn.names():
  print(f"variable {name}, labels : {newCtbn.labels(name)}")

variables found and their labels
variable cloudy?, labels : ('0', '1')
variable rain?, labels : ('0', '1')

Learning the structure of the CTBN

In [12]:

# 1. Create a Learner.
learner = ct.Learner("out/trajectory.csv")

# 2. Run learning algorithm
learned_ctbn = learner.learnCTBN()

# 3. Plot the learned CTBN
gnb.show(learned_ctbn)

../_images/notebooks_71-PyModels_CTBN_29_0.svg

Let’s note that having a test object is necessary since we want to be able to test independency between variables. For now only 2 tests are available : (Fisher and Chi2) test and Oracle test. Oracle test is only used for benchmarking since it uses the initial ctbn as a reference.

Fitting parameters

It is also possible to approximate the transition time distributions of each variable, i.e the CIMs.

Note that conditioning variables are taken from the ctbn.

In [13]:

cims_before = ctbnnb.getCIMs(ctbn)
learner.fitParameters(ctbn)
cims_after = ctbnnb.getCIMs(ctbn)

gnb.sideBySide(cims_before, cims_after, captions=["original CIMs", "learned CIMs"])

		cloudy?#j
rain?	cloudy?#i	0	1
0	0	-0.1000	0.1000
0	1	1.0000	-1.0000
1	0	-100.0000	100.0000
1	1	0.5000	-0.5000

CIM(cloudy?)

		rain?#j
cloudy?	rain?#i	0	1
0	0	-0.5000	0.5000
0	1	1000.0000	-1000.0000
1	0	-2.0000	2.0000
1	1	2.0000	-2.0000

CIM(rain?)

original CIMs

		cloudy?#j
rain?	cloudy?#i	0	1
0	0	-0.0900	0.0900
0	1	1.2300	-1.2300
1	0	-137.3930	137.3930
1	1	0.5610	-0.5610

CIM(cloudy?)

		rain?#j
cloudy?	rain?#i	0	1
0	0	-0.5140	0.5140
0	1	907.2300	-907.2300
1	0	-1.9110	1.9110
1	1	2.0430	-2.0430

CIM(rain?)

learned CIMs

Example from [Nodelman et al, 2003]

This example comes from :

U. Nodelman, C.R. Shelton, and D. Koller (2003). "Learning Continuous Time Bayesian Networks." Proc. Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI) (pp. 451-458).

In [14]:

# 1. Creating the ctbn
drugEffect = ct.CTBN()

drugEffect.add(gum.LabelizedVariable("Eating", ""))
drugEffect.add(gum.LabelizedVariable("Hungry", ""))
drugEffect.add(gum.LabelizedVariable("Full stomach", ""))
drugEffect.add(gum.LabelizedVariable("Uptake", ""))
drugEffect.add(gum.LabelizedVariable("Concentration", "?"))
drugEffect.add(gum.LabelizedVariable("Barometer", ""))
drugEffect.add(gum.LabelizedVariable("Joint pain", ""))
drugEffect.add(gum.LabelizedVariable("Drowsy", ""))

drugEffect.addArc("Hungry", "Eating")
drugEffect.addArc("Eating", "Full stomach")
drugEffect.addArc("Full stomach", "Hungry")
drugEffect.addArc("Full stomach", "Concentration")
drugEffect.addArc("Uptake", "Concentration")
drugEffect.addArc("Concentration", "Drowsy")
drugEffect.addArc("Barometer", "Joint pain")
drugEffect.addArc("Concentration", "Joint pain")

gnb.show(drugEffect)

../_images/notebooks_71-PyModels_CTBN_34_0.svg

In [15]:

from pyagrum.ctbn.CTBNGenerator import randomCIMs

randomCIMs(drugEffect, (0.1, 10))
drugEffect.CIM("Eating")[{"Hungry": 0}] = [[-0.1, 0.1], [10, -10]]
drugEffect.CIM("Eating")[{"Hungry": 1}] = [[-2, 2], [0.1, -0.1]]
drugEffect.CIM("Full stomach")[{"Eating": 0}] = [[-0.1, 0.1], [10, -10]]
drugEffect.CIM("Full stomach")[{"Eating": 1}] = [[-2, 2], [0.1, -0.1]]
drugEffect.CIM("Hungry")[{"Full stomach": 0}] = [[-0.1, 0.1], [10, -10]]
drugEffect.CIM("Hungry")[{"Full stomach": 1}] = [[-2, 2], [0.1, -0.1]]

ieX = ct.ForwardSamplingInference(drugEffect)
ieX.writeTrajectoryCSV("out/drugEffect.csv", n=3, timeHorizon=100)
trajX = ct.readTrajectoryCSV("out/drugEffect.csv")

ct.plotFollowVar(drugEffect.variable("Hungry"), trajX)
ct.plotFollowVar(drugEffect.variable("Full stomach"), trajX)
ct.plotFollowVar(drugEffect.variable("Eating"), trajX)
ct.plotFollowVar(drugEffect.variable("Concentration"), trajX)

../_images/notebooks_71-PyModels_CTBN_35_0.svg

../_images/notebooks_71-PyModels_CTBN_35_1.svg

../_images/notebooks_71-PyModels_CTBN_35_2.svg

../_images/notebooks_71-PyModels_CTBN_35_3.svg

Input/output with pickle for CTBN

In [16]:

import pickle

with open("test.pkl", "bw") as f:
  pickle.dump(drugEffect, f)
drugEffect

Out[16]:

In [17]:

with open("test.pkl", "br") as f:
  copyDrug = pickle.load(f)
copyDrug

Out[17]: