ananke.datasets

Subpackages

ananke.datasets.helpers

Helper functions that help load datasets in Ananke.

ananke.datasets.helpers.load_afixable_data()[source]

Load toy data for an adjustment fixable setting where T is the treatment and the outcome is CD4 counts.

Returns

pandas dataframe.

ananke.datasets.helpers.load_conditionally_ignorable_data()[source]

Load toy data for the conditionally ignorable model where the confounder is Viral Load, T is the treatment and the outcome is CD4 counts.

Returns

pandas dataframe.

ananke.datasets.helpers.load_frontdoor_data()[source]

Load toy data for frontdoor setting where T is the treatment and the outcome is CD4 counts.

Returns

pandas dataframe.

ananke.datasets.helpers.load_wisconsin_health_study()[source]

Load the dataset extract from the Wisconsin Health Study presented in [1]. Columns are defined as follows: X: an indicator of whether family income in 1957 was above $5k; Y: an indicator of whether the respondents income in 1992 was above $37k; M: an indicator of whether the respondent was drafted into the military; E: an indicator of whether the respondent had education beyond high school. count: the count of each event in (X, Y, M, E)

[1] R. J. Evans and T. S. Richardson, “Smooth, identifiable supermodels of discrete DAG models with latent variables,” Bernoulli, vol. 25, no. 2, pp. 848–876, May 2019, doi: 10.3150/17-BEJ1005.