Interpretable Layers and Interventions¶

The Low-Level API provides building blocks to create concept-based models using interpretable layers and perform interventions using a PyTorch-like interface.

Design Principles¶

Overview of Data Representations¶

In PyC, we distinguish between three types of data representations:

Input: High-dimensional representations where exogenous and endogenous information is entangled
Exogenous: Representations that are direct causes of endogenous variables
Endogenous: Representations of observable quantities of interest

Layer Types¶

In PyC you will find three types of layers whose interfaces reflect the distinction between data representations:

Encoder layers: Never take as input endogenous variables
Predictor layers: Must take as input a set of endogenous variables
Special layers: Perform operations like memory selection or graph learning

Layer Naming Standard¶

In order to easily identify the type of layer, PyC uses a consistent standard to assign names to layers. Each layer name follows the format:

<LayerType><InputType><OutputType>

where:

LayerType: describes the type of layer (e.g., Linear, HyperLinear, Selector, Transformer, etc…)
InputType and OutputType: describe the type of data representations the layer takes as input and produces as output. PyC uses the following abbreviations:
- Z: Input
- U: Exogenous
- C: Endogenous

For instance, a layer named LinearZC is a linear layer that takes as input an Input representation and produces an Endogenous representation. Since it does not take as input any endogenous variables, it is an encoder layer.

pyc.nn.LinearZC(in_features=10, out_features=3)

As another example, a layer named HyperLinearCUC is a hyper-network layer that takes as input both Endogenous and Exogenous representations and produces an Endogenous representation. Since it takes as input endogenous variables, it is a predictor layer.

pyc.nn.HyperLinearCUC(
   in_features_endogenous=10,
   in_features_exogenous=7,
   embedding_size=24,
   out_features=3
)

As a final example, graph learners are a special layers that learn relationships between concepts. They do not follow the standard naming convention of encoders and predictors, but their purpose should be clear from their name.

wanda = pyc.nn.WANDAGraphLearner(
   ['c1', 'c2', 'c3'],
   ['task A', 'task B', 'task C']
)

Detailed Guides¶

Next Steps¶

Explore the full Low-Level API documentation
Try the Mid-Level API for probabilistic modeling
Try the Mid-Level API for causal modeling
Check out example notebooks