BDM

Elixir module that implements the Block Decomposition Method (BDM) developed by Hector Zenil et al to approximate the algorithmic complexity of datasets by decomposing them into smaller blocks and using precomputed CTM (Coding Theorem Method) values.

This implementation is based on PyBDM by Szymon Talaga et al.

Installation

The package can be installed by adding bdm to your list of dependencies in mix.exs:

def deps do
  [
    {:bdm, "~> 0.5.0"}
  ]
end

Key Features

Supports both 1D (binary strings) and 2D (binary matrices) data
Two boundary conditions: :ignore and :correlated
- Ignore: Discards incomplete blocks
- Correlated: Uses sliding window approach with fixed window size of 1
Backend could be selected for BDM using the backend option. The default is CTM (:ctm), Lempel–Ziv Complexity (:lzc) [Kaspar, Schuster, 1987] is a performant, but less accurate alternative. Useful for large data sets
Precomputed CTM values for small binary strings and matrices
- 1D lists, with maximum block size 11
- 2D matrices with maximum block size 4*4
Fallback mechanism for missing values (max CTM + 1 bit)

Main Modules and Functions

BDM.new/6 - Creates a new BDM analysis structure
BDM.compute/2 - Computes the BDM complexity of a dataset. The input argument can either be a 1D list, a 2D matrix (list of lists) or an %Nx.Tensor{}
BDM.PerturbationAnalysis - Performs perturbation analysis, identifies complexity-driving elements
- single_bit_perturbations/1 - Single-bit flip perturbations
- random_perturbations/3- Random noise perturbations with configurable noise levels
- calculate_perturbation_effects/3, perturbation_landscape/3 - More insight into the perturbation behavior
- sensitivity_profile/2, detect_critical_positions/2, stability_coefficient/4 - Comprehensive sensitivity analysis
BDM.MinimalInformationLoss - Minimal Information Loss algorithm for feature selection and dimension reduction
- feature_scores/2, select_features/3 - Feature scoring based on the change in the object’s estimated algorithmic information content
- reduce_dimensions/2 - Dimension reduction based on feature selection

Usage

bdm = BDM.new(2, 2, 2)

large_matrix = [
  [0, 1, 0, 1, 0, 1],
  [1, 0, 1, 0, 1, 0],
  [0, 1, 0, 1, 0, 1],
  [1, 0, 1, 0, 1, 0],
  [0, 1, 0, 1, 0, 1],
  [1, 0, 1, 0, 1, 0]
]

complexity_2x2 = BDM.compute(bdm, large_matrix)

For more details and explanations, check the Livebook.

How BDM Works

The Block Decomposition Method decomposes an object into smaller parts for which there exist, thanks to CTM, good approximations to their algorithmic complexity, and then aggregates these quantities by following the rules of algorithmic information theory.

Foundation: Coding Theorem Method (CTM)

The Coding Theorem Method (CTM) is a numerical approximation to the algorithmic complexity of single objects.

BDM builds upon the Coding Theorem Method (CTM), which approximates algorithmic complexity using this formula:

$$ K(s) ≈ -log_2(P(s)) $$

where P(s) is the algorithmic probability of string s. CTM approximates algorithmic probability by exploring spaces of Turing machines with n symbols and m states, counting how many produce a given output, and dividing by the total number of machines that halt.

The BDM Process

The Block Decomposition Method operates in three main stages: decomposition, lookup, and aggregation.

Step 1: Precomputation First precompute CTM values for all possible small objects of a given type (e.g. all binary strings of up to 12 digits or all possible square binary matrices up to 4x4) and store them in an efficient lookup table.

Step 2: Decomposition Any arbitrarily large object can be decomposed into smaller slices of appropriate sizes for which CTM values can be looked up very fast. The method partitions the input data into blocks of predetermined sizes.

Step 3: Lookup For each unique slice created during decomposition, the method looks up the precomputed CTM value from the lookup table.

Step 4: Aggregation The CTM values for slices can be aggregated back to a global estimate of Kolmogorov complexity for the entire object using the BDM formula:

$$ BDM(X) = \sum_i{CTM(sᵢ) + log_2(nᵢ)} $$

where:

$i$ indexes the set of all unique slices
$CTM(sᵢ)$ is the complexity of slice $i$
$nᵢ$ is the number of occurrences of slice $i$

Boundary Conditions

If the object’s size is not a multiple of the block size, the boundary (or residual) region remains. This module handles two boundary conditions:

Ignore: Discard incomplete blocks at the edges
Correlated: Use sliding window instead of slicing. By choosing the right window size, no residual blocks will remain at the boundary

Key Advantages

Computational Efficiency: Instead of computing CTM for each large dataset (which is extremely expensive), BDM uses precomputed values for small blocks
Scalability: Can handle arbitrarily large datasets by decomposing them into manageable pieces
Practical Approximation: Provides a computable approximation to the theoretically uncomputable Kolmogorov complexity

The method essentially transforms an intractable global computation into a series of fast local lookups, making algorithmic complexity estimation practical for real-world datasets.

Citations

Soler-Toscano F., Zenil H., Delahaye J.-P. and Gauvrit N. (2014) Calculating Kolmogorov Complexity from the Output Frequency Distributions of Small Turing Machines. PLoS ONE 9(5): e96223.
Zenil H., Soler-Toscano F., Kiani N.A., Hernández-Orozco S., Rueda-Toicen A. (2016) A Decomposition Method for Global Evaluation of Shannon Entropy and Local Estimations of Algorithmic Complexity. arXiv:1609.00110
Zenil H, Kiani NA, Tegnér J. Algorithmic Information Dynamics: A Computational Approach to Causality with Applications to Living Systems. Cambridge University Press; 2023.
F. Kaspar, H. G. Schuster (1987) Easily calculable measure for the complexity of spatiotemporal patterns. Phys. Rev. A 36, 842
Hector Zenil and Narsis A. Kiani and Alyssa Adams and Felipe S. Abrahão and Antonio Rueda-Toicen and Allan A. Zea and Luan Ozelim and Jesper Tegnér (2018) Minimal Algorithmic Information Loss Methods for Dimension Reduction, Feature Selection and Network Sparsification. arXiv:1802.05843