![]() |
Figure: Visual description of random stream chunking, M.Suzen (2017) |
The art of data science and scientific computing
by Mehmet Süzen
See also: Science Memo
Request one-on-one session
Friday, 9 February 2024
Exact reproducibility of stochastic simulations for parallel and serial algorithms simultaneously
Random Stream Chunking
Tuesday, 28 November 2023
What's the purpose of randomness in causal discovery techniques?
![]() |
Roulette Wheel (Wikipedia) |
Preamble
In this short exposition, we inquire about the purpose of randomness and how this related to discovering or testing causal inferential problem solving using data and causal models. In his seminal work by Holland (1986) point out something striking that was not put in such form earlier works. He stated the "obvious" that almost all data sets addressing interventional nature, such as treatment vs. non-treatment, that a person or unit we study, can not be treated and not-treated at the same time. We delve question of randomness from this perspective, i.e., so called fundamental problem of causal inference.
Group assignment for causal inference
Group assignment probably one of the most fundamental approach in statistical research, such as in the famous Lady tea tasting problem. The idea of assignment in causal inference, we need to find a matching person or unit that is not-treated if we have a treated sample or the other-way around, so called a matching or balancing.
Randomness in causality: Removal of pseudo-confounders
Randomness doesn't only allow fair representation of control and treatment group assignments, reducing bias essentially. The primary effect of randomness is removal of pseudo-confounders, this is not well studied in the literature. What it means, if we don't randomise there would be other causal connections that would really shouldn't be there.
Conclusion
Here, we hint about something called pseudo-confounders. Randomisation in both matching and other causal techniques primarily removes bias but removal of pseudo-confounders is not commonly mentioned and an open research.
Further reading
Saturday, 25 November 2023
Why should there be no simultaneity rule for causal models?
Dominos in motion
(Wikipedia)
Preamble
- looper : Causality Resource
- Looper Nuggets 1 (LN1) : Definition of wDAGs with no simultaneity rule.
Saturday, 14 October 2023
Ising-Conway lattice-games: Understanding increasing entropy
Preamble
![]() |
Figure: Evolution of Ising-Conway Game (arXiv:2310.01458) |
Ising-Lenz model is probably one of the landmark models in physics, remarkably provides beyond its idealised case of magnetic domains, now impacts even quantum computational research. However, computing entropy of Ising-Lenz models are still quite difficult. On the other hand, Conway introduce a game with simple rules generating complexity in various orders, via simple dynamical rules. By analogy to these two modelling approach, we recently introduce game like physical system of spins or lattice sides on a finite space with constraints. This gives a physically plausible dynamics but simpler dynamical evolution to generate the trajectories. Because vanilla Ising-Models requires more complicated Monte Carlo techniques. Here is the configuration and dynamics of Ising-Conway games,
- $M$ sites as a fixed space.
- $N$ occupied sites, or 1s.
- Configuration $C(M,N,t)=C(i)$ over time changes. But at $t=0$ all occupied sites live in at the corner.
- Configuration can only change to neighbouring sites if they are empty. This is closely related to spin-flip dynamics of the Ising Model.
- No sites occupy the same lattice cell, Pauli exclusion
- Should be contained within $M$ Cell.
Defining ensemble Entropy on ICG
Now we are in position to define the entropy for ICGs, which easy to grasp conceptually and computationally. $C(i, t) \in \{1,0\}$ defines the states of the game. We build an ensemble at a given time $t$ by defining a region enclosed by 1s. Then dimensionality of the ensemble $ k(t) = argmax[\mathbb{I}(C(i))] - argmin [\mathbb{I}(C(i)) ]$. Here, $\mathbb{I}$ returns index of $1$s on the lattice. This ensemble closely track maximum entropy of the system at a given time.
Conclusions
A new game-like system that helps us to understand entropy increase that has a plausible physical characteristics that one can easily simulate.
Further reading
- H-theorem do-conjecture, M.Süzen, arXiv:2310.01458
- Effective ergodicity in single-spin-flip dynamics, Mehmet Süzen. Phys. Rev. E 90, 03214 url
- do_ensemble module provides such simulation via simulate_single_spin_flip_game from the repo h-do-conjecture