<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Math on Sebastian Spicker</title>
    <link>https://sebastianspicker.github.io/tags/math/</link>
    <description>Recent content in Math on Sebastian Spicker</description>
    <image>
      <title>Sebastian Spicker</title>
      <url>https://sebastianspicker.github.io/og-image.png</url>
      <link>https://sebastianspicker.github.io/og-image.png</link>
    </image>
    <generator>Hugo -- 0.160.0</generator>
    <language>en</language>
    <lastBuildDate>Tue, 14 Oct 2025 00:00:00 +0000</lastBuildDate>
    <atom:link href="https://sebastianspicker.github.io/tags/math/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>A Gas at Temperature T: Xenakis and the Physics of Stochastic Music</title>
      <link>https://sebastianspicker.github.io/posts/xenakis-stochastic-music/</link>
      <pubDate>Tue, 14 Oct 2025 00:00:00 +0000</pubDate>
      <guid>https://sebastianspicker.github.io/posts/xenakis-stochastic-music/</guid>
      <description>Iannis Xenakis applied the Maxwell-Boltzmann velocity distribution, Markov chains, and game theory to orchestral composition. In Pithoprakta (1955–56), 46 string parts are molecules of a gas, each following the kinetic theory distribution. In Duel and Stratégie (1959–62), two conductors play a zero-sum game with payoff matrices on stage. This post works through the physics and mathematics, and asks what it means when a composer treats an orchestra as a thermodynamic system.</description>
      <content:encoded><![CDATA[<p><em>Iannis Xenakis (1922–2001) was trained as a civil engineer at the Athens
Polytechnic, joined the Greek Resistance during the Second World War and the
subsequent Greek Civil War, survived a British army tank shell in January 1945
that cost him the sight in his left eye and part of his jaw, was sentenced to
death in absentia by the Greek military government, fled to Paris in 1947, and
worked for twelve years as an architect in Le Corbusier&rsquo;s atelier — where he
contributed structural engineering to the Unité d&rsquo;Habitation in Marseille and
designed the Philips Pavilion for Expo 58. In parallel, already in his thirties,
he taught himself composition — approaching Honegger (who was too ill to teach) and then studying with Messiaen
— and became one of the central figures of the post-war avant-garde. I mention
the biography not as background colour but because it bears on the physics. A
person who has been through what Xenakis had been through by 1950 is not likely
to be intimidated by the kinetic theory of gases.</em></p>
<p><em>He was not. In 1955–56 he composed</em> Pithoprakta <em>— &ldquo;actions through
probability&rdquo; — for 46 strings, each of which is, in his own account, a
molecule of an ideal gas. This post works through the mathematics he
used and asks what it means when a composer takes statistical mechanics
seriously as a compositional tool.</em></p>
<hr>
<h2 id="the-problem-with-post-war-serialism">The Problem with Post-War Serialism</h2>
<p>To understand why Xenakis did what he did, it helps to know what everyone
else was doing. By the early 1950s, the dominant tendency in European
new music was total serialism: the systematic extension of Schoenberg&rsquo;s
twelve-tone technique to rhythm, dynamics, articulation, and register. Every
parameter of every note was determined by a series. Messiaen had sketched
this direction in <em>Mode de valeurs et d&rsquo;intensités</em> (1949); Boulez and
Stockhausen had taken it to its logical extreme.</p>
<p>The result, as Xenakis observed with characteristic bluntness in <em>Formalized
Music</em> (1963/1992), was a kind of sonic indistinguishability: because every
parameter varied according to independent deterministic series, the textures
produced by total serialism sounded essentially like random noise. The
maximum of local determinism had produced the appearance of global chaos.</p>
<p>His diagnosis was precise and, I think, correct: if the perceptual result of
maximum determinism and maximum randomness is the same, then the path forward
is not to find a better deterministic scheme but to embrace randomness
explicitly, at the level that governs the <em>macroscopic</em> structure. Control the
distribution; let the individual events vary within it. This is exactly what
statistical mechanics does for a gas: it does not track every molecule, but
it knows with great precision what the distribution of velocities will be.</p>
<hr>
<h2 id="statistical-mechanics-in-brief">Statistical Mechanics in Brief</h2>
<p>In a classical ideal gas of $N$ molecules at thermal equilibrium with
temperature $T$, the molecules move in all directions with speeds distributed
according to the Maxwell-Boltzmann speed distribution:</p>
$$f(v) = \sqrt{\frac{2}{\pi}}\, \frac{v^2}{a^3}\, \exp\!\left(-\frac{v^2}{2a^2}\right), \qquad a = \sqrt{\frac{k_B T}{m}},$$<p>where $m$ is the molecular mass and $k_B$ is Boltzmann&rsquo;s constant. The
parameter $a$ sets the characteristic speed scale: it grows with temperature
(hotter gas means faster molecules) and shrinks with molecular mass (heavier
molecules move more slowly at the same temperature).</p>
<p>The distribution has a characteristic shape: it rises as $v^2$ for small
speeds (few molecules are nearly stationary), peaks at the most probable
speed $v_p = a\sqrt{2}$, and falls off as $e^{-v^2/2a^2}$ for large speeds
(very fast molecules are exponentially rare). The three characteristic
speeds are:</p>
$$v_p = a\sqrt{2}, \qquad \langle v \rangle = a\sqrt{\tfrac{8}{\pi}}, \qquad v_\mathrm{rms} = a\sqrt{3}.$$<p>No individual molecule is tracked. The distribution is everything: once you
know $f(v)$, you know all macroscopic properties of the gas — pressure,
mean kinetic energy, thermal conductivity — without knowing the trajectory
of a single molecule. The individual is sacrificed to the ensemble.</p>
<hr>
<h2 id="pithoprakta-and-the-orchestra-as-gas"><em>Pithoprakta</em> and the Orchestra as Gas</h2>
<p>In <em>Pithoprakta</em> (1955–56), Xenakis assigns each of the 46 string instruments
to a molecule of a gas. The musical analogue of molecular speed is the
<em>velocity of a glissando</em>: the rate at which a glissando moves through
pitch, measured in semitones per second. Slow glissandi are cold molecules;
fast glissandi are hot ones.</p>
<p>For a given passage with a specified musical &ldquo;temperature&rdquo; (an
intensity-and-density parameter he could set as a compositional choice),
the 46 glissando speeds are drawn from the Maxwell-Boltzmann distribution
for that temperature. No two strings play the same glissando at the same
speed. The effect, to a listener, is a dense sound-mass — a shimmer or
a roar — whose internal texture varies but whose overall character (the
temperature, the density) is under the composer&rsquo;s control at exactly the
level that matters perceptually.</p>
<p>Xenakis worked out the velocities numerically by hand. The score of
<em>Pithoprakta</em> was among the first in which the individual parts were derived
from a statistical distribution rather than from a melody, a row, or an
improvisation instruction. The calculation is tedious but not difficult:
for each time window, choose a temperature, compute $f(v)$ for the 46
values of $v$ that tile the distribution, and assign one speed to each
instrument.</p>
<p>The connection between macroscopic structure and microscopic liberty is
deliberately preserved. The shape of the sound-mass — its brightness,
its turbulence, its rate of change — is controlled. Each individual line
is unpredictable. This is, structurally, the same trade-off that makes
thermodynamics work: you give up on the individual trajectory and gain
exact knowledge of the aggregate.</p>
<hr>
<h2 id="musical-temperature-as-a-compositional-parameter">Musical Temperature as a Compositional Parameter</h2>
<p>The analogy is worth making precise. In the physical gas, raising the
temperature $T$ increases $a = \sqrt{k_B T / m}$, which shifts the
peak of $f(v)$ to the right and widens the distribution. More molecules
have high speeds; the variance of speeds increases.</p>
<p>In <em>Pithoprakta</em>, raising the musical &ldquo;temperature&rdquo; has the same
effect: more instruments perform rapid glissandi; the pitch-space
trajectories are more varied; the texture becomes more active and
more turbulent. Lowering the temperature concentrates the glissando
speeds near zero — slow motion, near-stasis, long sustained tones
that change pitch only gradually. The orchestra cools.</p>
<p>This mapping is not metaphorical. Xenakis computed it. The score
contains numerically derived glissando speeds; the connection between the
perceptual temperature of the texture and the statistical parameter $T$ is
quantitative. When musicians speak of a passage &ldquo;heating up,&rdquo; they are
usually using a figure of speech. In <em>Pithoprakta</em>, they are describing
a thermodynamic fact.</p>
<hr>
<h2 id="the-poisson-distribution-and-event-density">The Poisson Distribution and Event Density</h2>
<p><em>Pithoprakta</em> uses a second physical model alongside the Maxwell-Boltzmann
distribution: the Poisson process, which governs the density of
independent, randomly occurring events.</p>
<p>If musical events (pizzicato attacks, bow changes, individual note entries)
occur at a mean rate of $\lambda$ events per second, the probability of
exactly $k$ events occurring in a time window of length $T$ is:</p>
$$P(N = k) = \frac{(\lambda T)^k\, e^{-\lambda T}}{k!}.$$<p>The Poisson distribution has a single parameter $\lambda$ that controls
both the mean and the variance (they are equal: $\langle N \rangle =
\mathrm{Var}(N) = \lambda T$). A high $\lambda$ produces a dense cluster
of events; a low $\lambda$ produces sparse, widely spaced events.</p>
<p>Xenakis used this to control the density of pizzicato attacks independently
of the glissando texture. A passage can be cool (slow glissandi) and dense
(many pizzicati), or hot and sparse, or any combination. The two
distributions operate on independent musical parameters — pitch motion and
event density — giving the composer a two-dimensional thermodynamic control
space over the texture.</p>
<hr>
<h2 id="markov-chains-analogique-a-and-analogique-b">Markov Chains: <em>Analogique A</em> and <em>Analogique B</em></h2>
<p>In <em>Analogique A</em> (for string orchestra, 1958–59) and its companion
<em>Analogique B</em> (for sinusoidal tones, same year), Xenakis moved to a
different stochastic framework: Markov chains.</p>
<p>A Markov chain is a sequence of states where the probability of
transitioning to the next state depends only on the current state. The
chain is specified by a transition matrix $P$, where $P_{ij}$ is the
probability of moving from state $i$ to state $j$:</p>
$$P_{ij} \geq 0, \qquad \sum_j P_{ij} = 1 \quad \forall\, i.$$<p>Under mild conditions (irreducibility and aperiodicity), the chain
converges to a unique stationary distribution $\pi$ satisfying:</p>
$$\pi P = \pi, \qquad \sum_i \pi_i = 1.$$<p>The convergence is geometric: if $\lambda_2$ is the second-largest eigenvalue
of $P$ in absolute value, then after $n$ steps the distribution $\pi^{(n)}$
satisfies $\|\pi^{(n)} - \pi\| \leq C |\lambda_2|^n$ for some constant $C$.
The gap $1 - |\lambda_2|$ — the <em>spectral gap</em> — controls how quickly the
chain forgets its initial state. A transition matrix with a large spectral
gap produces rapid convergence; one with $|\lambda_2| \approx 1$ produces
long-memory dependence between distant states. This is a compositional
choice: the spectral gap determines how quickly a piece&rsquo;s texture changes
character.</p>
<p>In <em>Analogique A</em>, Xenakis divided the sonic space into a grid of
cells defined by pitch register (high/middle/low), density
(sparse/medium/dense), and dynamic (soft/loud). Each &ldquo;screen&rdquo; — a brief
time window — occupies one cell in this grid. The progression of screens
through the piece is governed by transition probabilities: from a
high/dense/loud screen, there is some probability of moving to each
adjacent cell, specified by Xenakis&rsquo;s chosen transition matrix.</p>
<p>This is a Markov chain on a discrete state space of sonic textures. The
macroscopic trajectory of the piece — its overall movement through sound-
quality space — is determined by the transition matrix, which the composer
sets. The details of each screen are filled in stochastically, within the
parameters of the current state. Again, the individual is sacrificed to the
aggregate; control is exercised at the level of the distribution rather
than the event.</p>
<hr>
<h2 id="game-theory-duel-and-stratégie">Game Theory: <em>Duel</em> and <em>Stratégie</em></h2>
<p>The most extreme and, to my mind, most interesting of Xenakis&rsquo;s
formalisations is the use of game theory in <em>Duel</em> (1959) and <em>Stratégie</em>
(1962).</p>
<p>A <strong>two-player zero-sum game</strong> is specified by a payoff matrix $A \in
\mathbb{R}^{m \times n}$. Player 1 (the &ldquo;maximiser&rdquo;) chooses a row $i$;
Player 2 (the &ldquo;minimiser&rdquo;) chooses a column $j$; Player 1 receives payoff
$A_{ij}$ and Player 2 receives $-A_{ij}$. In a pure-strategy game, each
player selects a single action. In a <strong>mixed-strategy game</strong>, each player
chooses a probability distribution over their actions: Player 1 uses
$\mathbf{x} \in \Delta_m$ and Player 2 uses $\mathbf{y} \in \Delta_n$,
where $\Delta_k$ denotes the standard $(k-1)$-simplex.</p>
<p>The expected payoff to Player 1 under mixed strategies is:</p>
$$E(\mathbf{x}, \mathbf{y}) = \mathbf{x}^\top A\, \mathbf{y}.$$<p>Von Neumann&rsquo;s minimax theorem (1928) guarantees that:</p>
$$\max_{\mathbf{x} \in \Delta_m} \min_{\mathbf{y} \in \Delta_n}
\mathbf{x}^\top A\, \mathbf{y}
\;=\;
\min_{\mathbf{y} \in \Delta_n} \max_{\mathbf{x} \in \Delta_m}
\mathbf{x}^\top A\, \mathbf{y}
\;=\; v^*,$$<p>where $v^*$ is the <strong>value</strong> of the game. The pair $(\mathbf{x}^*,
\mathbf{y}^*)$ that achieves this saddle point is the Nash equilibrium:
neither player can improve their expected payoff by unilaterally deviating
from their equilibrium strategy.</p>
<p>In <em>Stratégie</em>, each conductor leads one orchestra. Each has nineteen
&ldquo;tactics&rdquo; — six basic musical textures (e.g., sustained chords, staccato
pizzicati, glissandi masses, silence) plus thirteen combinatorial tactics
that combine two or three of the basics. The payoff matrix is a
$19 \times 19$ integer matrix, also defined by Xenakis, specifying how
many points Conductor 1 scores when their orchestra plays tactic $i$ against
Conductor 2&rsquo;s tactic $j$. A referee tracks the score.</p>
<p>The conductors make decisions in real time during the performance, choosing
tactics based on what the other conductor is doing and on the evolving score.
The piece ends when one conductor reaches a predetermined score threshold.</p>
<p>The Nash equilibrium of the payoff matrix tells each conductor, in principle,
the optimal <em>distribution</em> over tactics to play: if both play optimally, the
expected score trajectory is determined. In practice, conductors are not
expected to compute mixed strategies on the podium; Xenakis&rsquo;s point is
structural. The game-theoretic formalism is used to design the payoff matrix
so that no tactic dominates — every choice has consequences that depend on
the opponent&rsquo;s choice — guaranteeing that the piece will always contain
genuine strategic tension regardless of who is conducting.</p>
<p><em>Duel</em> (1959) is the earlier, simpler version for two chamber orchestras.
<em>Stratégie</em> (1962) was premiered in April 1963 at the Venice Biennale with two conductors
competing live. The audience was aware of the game, of the score, and of
the payoff matrix. The premiere was by most accounts a success, though the
practical complications of running a zero-sum game in a concert hall
(including the question of whether conductors were actually computing Nash
equilibria or just following intuition) were never fully resolved.</p>
<hr>
<h2 id="formalized-music"><em>Formalized Music</em></h2>
<p>Xenakis assembled his theoretical framework in <em>Musiques formelles</em> (1963),
translated and expanded as <em>Formalized Music</em> (1971; revised edition 1992).
The book is one of the strangest documents in twentieth-century music theory:
part treatise, part manifesto, part mathematical appendix. It covers
stochastic composition, Markov chains, game theory, set theory, group theory,
and symbolic logic — all presented with the confidence of someone who is
equally at home in the engineering faculty and the concert hall, and with
the occasional obscurity of someone writing simultaneously for two audiences
who share almost no vocabulary.</p>
<p>The core argument is that musical composition can and should be treated as
the application of mathematical structures to sonic material, not because
mathematics makes music &ldquo;better&rdquo; but because mathematical structures are
the most powerful available tools for controlling relationships between
sounds at multiple scales simultaneously. The statistical distributions
control the macroscopic; the individual values vary within them. The game-
theoretic payoff matrix controls the strategic interaction; the individual
tactics fill in the details. Mathematics operates at the structural level
and leaves the acoustic surface free.</p>
<p>This is a different relationship between mathematics and music from the
ones in my earlier posts on <a href="/posts/messiaen-modes-group-theory/">group theory and Messiaen</a>
or <a href="/posts/euclidean-rhythms/">the Euclidean algorithm and world rhythms</a>.
In those cases, mathematics describes structure that already exists in the
music — structure the composers arrived at by ear. In Xenakis, mathematics
is the generative tool: the score is derived from the calculation.</p>
<hr>
<h2 id="what-the-analogy-does-and-does-not-do">What the Analogy Does and Does Not Do</h2>
<p>The Maxwell-Boltzmann analogy in <em>Pithoprakta</em> is exact in one direction
and approximate in another.</p>
<p>It is exact in the following sense: the glissando speeds Xenakis computed
for his 46 strings genuinely follow the Maxwell-Boltzmann distribution with
the parameters he chose. The score is a realisation of that distribution.
If you collect the glissando speeds from the score and plot their histogram,
you will find the characteristic $v^2 e^{-v^2/2a^2}$ shape.</p>
<p>It is approximate — or rather, it is analogical — in the sense that strings
in an orchestra are not molecules of a gas. They do not collide. They have
mass and inertia in a physical sense that has no direct mapping to
musical parameters. The temperature $T$ is not a temperature in any
thermodynamic sense; it is a compositional variable that Xenakis chose to
parameterise with the same symbol because the formal relationship is the
same. The analogy is structural, not ontological.</p>
<p>This is worth saying plainly because it is easy to be misled in both
directions: either to over-claim (the orchestra <em>is</em> a gas) or to dismiss
(the orchestra is <em>merely</em> labelled with physical vocabulary). The actual
claim is more modest and more interesting: the mathematical structure of the
Maxwell-Boltzmann distribution is the right tool for specifying a certain
kind of orchestral texture, namely one where individual elements vary
stochastically around a controlled macroscopic envelope. The physics
provides the formalism; the music provides the application. This is how
mathematics works in engineering, too.</p>
<hr>
<h2 id="the-centenary-and-what-remains">The Centenary and What Remains</h2>
<p>Xenakis died in 2001, by then partially deaf and with dementia. His centenary
in 2022 produced a wave of new performances, recordings, and scholarship
— including the <em>Meta-Xenakis</em> volume (Open Book Publishers, 2022), which
collects analyses of his compositional mathematics, his architectural work
(he designed the Philips Pavilion for Le Corbusier&rsquo;s Expo 58 in Brussels
using the same ruled-surface geometry he was using in <em>Metastaseis</em>), and
his political biography.</p>
<p>What remains resonant about his project is not the specific distributions
he chose — the Maxwell-Boltzmann is not the only or even necessarily the
best distribution for many musical applications — but the epistemological
position it represents. Xenakis insisted that the right question to ask
about a musical texture is not &ldquo;what is the note at beat 3 of bar 47?&rdquo; but
&ldquo;what is the distribution from which the events in this section are drawn?&rdquo;
This shift from individual determination to statistical control is precisely
the shift that makes thermodynamics possible as a science, and Xenakis was
the first composer to apply it deliberately and systematically.</p>
<p>When a composer writes &ldquo;let the
orchestra be a gas at temperature $T$&rdquo; and then actually computes the
consequences with Boltzmann&rsquo;s constant in front of him, I do not feel that
physics has been appropriated. I feel that it has been recognised — seen,
from a different direction, as the same thing it always was: a set of tools
for thinking about ensembles of interacting elements whose individual
behaviour is too complex to track but whose collective behaviour is not.</p>
<p>The orchestra is not a gas. But the Maxwell-Boltzmann distribution describes
it anyway.</p>
<hr>
<h2 id="references">References</h2>
<ul>
<li>
<p>Ames, C. (1989). The Markov process as a compositional model: A survey and
tutorial. <em>Leonardo</em>, 22(2), 175–187. <a href="https://doi.org/10.2307/1575226">https://doi.org/10.2307/1575226</a></p>
</li>
<li>
<p>Jedrzejewski, F. (2006). <em>Mathematical Theory of Music.</em> Delatour France /
IRCAM.</p>
</li>
<li>
<p>Nash, J. F. (1950). Equilibrium points in $n$-person games. <em>Proceedings of
the National Academy of Sciences</em>, 36(1), 48–49.
<a href="https://doi.org/10.1073/pnas.36.1.48">https://doi.org/10.1073/pnas.36.1.48</a></p>
</li>
<li>
<p>Nierhaus, G. (2009). <em>Algorithmic Composition: Paradigms of Automated Music
Generation.</em> Springer.</p>
</li>
<li>
<p>Matossian, N. (2005). <em>Xenakis</em> (revised ed.). Moufflon Publications.</p>
</li>
<li>
<p>Solomos, M. (Ed.). (2022). <em>Meta-Xenakis.</em> Open Book Publishers.
<a href="https://doi.org/10.11647/OBP.0313">https://doi.org/10.11647/OBP.0313</a></p>
</li>
<li>
<p>von Neumann, J. (1928). Zur Theorie der Gesellschaftsspiele. <em>Mathematische
Annalen</em>, 100(1), 295–320. <a href="https://doi.org/10.1007/BF01448847">https://doi.org/10.1007/BF01448847</a></p>
</li>
<li>
<p>von Neumann, J., &amp; Morgenstern, O. (1944). <em>Theory of Games and Economic
Behavior.</em> Princeton University Press.</p>
</li>
<li>
<p>Xenakis, I. (1992). <em>Formalized Music: Thought and Mathematics in
Composition</em> (revised ed.). Pendragon Press.
(Originally published as <em>Musiques formelles</em>, La Revue Musicale, 1963.)</p>
</li>
</ul>
<hr>
<h2 id="changelog">Changelog</h2>
<ul>
<li><strong>2026-01-14</strong>: Corrected the description of <em>Stratégie</em> (1962): each conductor has nineteen tactics (six basic plus thirteen combinatorial), with a 19 x 19 payoff matrix — not six tactics and a 6 x 6 matrix. The six-tactic, 6 x 6 description applies to the earlier <em>Duel</em> (1959).</li>
<li><strong>2026-01-14</strong>: Added &ldquo;in April 1963&rdquo; to the <em>Stratégie</em> premiere sentence. The composition date is 1962; the premiere took place on 25 April 1963 at the Venice Biennale.</li>
<li><strong>2026-01-14</strong>: Changed &ldquo;studying briefly with Honegger&rdquo; to &ldquo;approaching Honegger (who was too ill to teach).&rdquo; Xenakis sought instruction from Honegger circa 1949, but Honegger was in declining health and did not take him as a student.</li>
</ul>
]]></content:encoded>
    </item>
    <item>
      <title>The Oldest Algorithm in the World Plays the Clave</title>
      <link>https://sebastianspicker.github.io/posts/euclidean-rhythms/</link>
      <pubDate>Mon, 07 Apr 2025 00:00:00 +0000</pubDate>
      <guid>https://sebastianspicker.github.io/posts/euclidean-rhythms/</guid>
      <description>Euclid&amp;rsquo;s algorithm for computing greatest common divisors, applied to the problem of distributing k drum beats as evenly as possible among n time slots, generates rhythmic patterns that match traditional timelines from West Africa, Cuba, Brazil, Turkey, and the Balkans. An algorithm devised in Alexandria around 300 BCE encodes the rhythmic structure of musical cultures that had no contact with ancient Greek mathematics.</description>
      <content:encoded><![CDATA[<p><em>The first time I encountered the West African standard bell pattern it was in a
Music and Physics seminar. The lecturer played a twelve-beat cycle on a wood
block — seven strokes distributed unevenly but with a rightness that arrested the
room. She then played the Cuban clave, the bossa nova timeline, a Bulgarian
aksak rhythm. Different cultures, different instruments, different centuries. She
asked whether there was a pattern. There was. It is named after a mathematician
who died around 270 BCE.</em></p>
<hr>
<h2 id="euclids-algorithm">Euclid&rsquo;s Algorithm</h2>
<p>Every student who has taken a number theory course has encountered the algorithm
for computing the greatest common divisor of two positive integers. Given $a \geq
b$, repeatedly replace $(a, b)$ with $(b, a \bmod b)$ until the remainder is
zero; the last non-zero remainder is the GCD.</p>
<p>For example, $\gcd(8, 3)$:</p>
$$8 = 2 \times 3 + 2 \;\Rightarrow\; \gcd(8, 3) = \gcd(3, 2)$$<p>
</p>
$$3 = 1 \times 2 + 1 \;\Rightarrow\; \gcd(3, 2) = \gcd(2, 1)$$<p>
</p>
$$2 = 2 \times 1 + 0 \;\Rightarrow\; \gcd(2, 1) = 1.$$<p>Three steps, result 1 (8 and 3 are coprime). The algorithm is efficient: the
number of steps is proportional to the number of digits in the smaller input.
It appears in Book VII of Euclid&rsquo;s <em>Elements</em>, composed around 300 BCE, making
it the oldest non-trivial algorithm in the Western mathematical tradition.</p>
<hr>
<h2 id="distributing-onsets-toussaints-observation">Distributing Onsets: Toussaint&rsquo;s Observation</h2>
<p>In 2005, Godfried Toussaint — a computer scientist and ethnomusicologist at
McGill University — published the observation that the problem of distributing
$k$ musical onsets as evenly as possible among $n$ time slots has the same
recursive structure as Euclid&rsquo;s algorithm applied to the pair $(k, n-k)$
(Toussaint, 2005).</p>
<p>The algorithm that solves this distribution problem was independently discovered
in nuclear physics. Bjorklund (2003), working on timing systems for the
Spallation Neutron Source particle accelerator at Oak Ridge, needed to distribute
$k$ beam-extraction pulses as evenly as possible among $n$ machine cycles.
The algorithm he derived — Bjorklund&rsquo;s algorithm — is mathematically equivalent
to the Euclidean algorithm applied to the same pair of integers.</p>
<p>The resulting pattern is denoted $E(k, n)$: the <strong>Euclidean rhythm</strong> with $k$
onsets distributed among $n$ pulses. A 1 denotes an onset; a 0 denotes a rest.</p>
<hr>
<h2 id="working-through--the-tresillo">Working Through $E(3, 8)$: The Tresillo</h2>
<p>Let us derive $E(3, 8)$ — 3 onsets distributed in 8 pulses — step by step.</p>
<p><strong>Start</strong>: 3 onset groups and 5 rest groups:</p>
$$[1]\; [1]\; [1]\; [0]\; [0]\; [0]\; [0]\; [0]$$<p><strong>Step 1</strong>: Distribute one rest group into each onset group, pairing until the
shorter list is exhausted. Three pairs, with $5 - 3 = 2$ rest groups remaining:</p>
$$[1,0]\; [1,0]\; [1,0]\; [0]\; [0]$$<p><strong>Step 2</strong>: Now 3 longer groups and 2 shorter groups. Distribute one shorter group
into each longer group, $3 - 2 = 1$ longer group unpaired:</p>
$$[1,0,0]\; [1,0,0]\; [1,0]$$<p><strong>Step 3</strong>: The two group types have different lengths and only one group of the
shorter type remains; no further pairing is possible. Read the sequence
left to right:</p>
$$E(3, 8) = [1, 0, 0, 1, 0, 0, 1, 0].$$<p>This is the <strong>Cuban tresillo</strong> — one of the foundational rhythmic cells of
Afro-Cuban music, used across son, salsa, and mambo. Its onset positions are
$\{0, 3, 6\}$, giving gap sizes $[3, 3, 2]$: two wide gaps and one narrow gap,
arranged as evenly as the integers allow.</p>
<p>The parallel with Euclid&rsquo;s algorithm is direct. In the division $8 = 2 \times 3</p>
<ul>
<li>2$, the quotient 2 gives the number of pairing steps before a remainder
appears, and the remainder 2 gives the number of groups in the shorter list at
each intermediate stage. The recursion $\gcd(8, 3) \to \gcd(3, 2) \to \gcd(2,
1)$ mirrors the three steps above.</li>
</ul>
<hr>
<h2 id="the-gap-structure">The Gap Structure</h2>
<p>For any $E(k, n)$, the spacing between consecutive onsets takes exactly two
values:</p>
$$\text{gap} \in \left\{\left\lfloor \frac{n}{k} \right\rfloor,\
\left\lceil \frac{n}{k} \right\rceil\right\}.$$<p>The number of each gap size is determined by the constraint that all $k$ gaps
sum to $n$. Writing $\alpha = n \bmod k$:</p>
$$\alpha \cdot \left\lceil \frac{n}{k} \right\rceil \;+\; (k - \alpha) \cdot
\left\lfloor \frac{n}{k} \right\rfloor = n.$$<p>So $E(k,n)$ has $\alpha$ gaps of the larger size and $k - \alpha$ gaps of the
smaller size. The Euclidean property is that these two gap types are distributed
<em>as evenly as possible</em> among themselves — not clustered at one end of the cycle
but interleaved. A cycle that maximises the minimum distance between any two
consecutive onsets has this property; it is called <strong>maximally even</strong> (Clough
and Douthett, 1991).</p>
<p>For $E(3, 8)$: $\lfloor 8/3 \rfloor = 2$, $\lceil 8/3 \rceil = 3$,
$\alpha = 8 \bmod 3 = 2$. Two gaps of 3, one gap of 2. Gap sequence $[3, 3, 2]$.
Maximum-evenness is why the tresillo sounds &ldquo;right&rdquo; even though it is
asymmetric: the asymmetry is the smallest possible deviation from perfect
regularity.</p>
<hr>
<h2 id="a-gallery-of-world-rhythms">A Gallery of World Rhythms</h2>
<p>The following table, derived from Toussaint (2005, 2020), shows Euclidean rhythms
alongside their ethnomusicological identifications. Asterisks mark patterns given
as rotations of the canonical form.</p>
<table>
  <thead>
      <tr>
          <th>Pattern</th>
          <th>Gap structure</th>
          <th>Musical tradition</th>
      </tr>
  </thead>
  <tbody>
      <tr>
          <td>$E(2,3) = [1,0,1]$</td>
          <td>$[2,1]$</td>
          <td>Iambic foot; West African, Balkan</td>
      </tr>
      <tr>
          <td>$E(3,8) = [1,0,0,1,0,0,1,0]$</td>
          <td>$[3,3,2]$</td>
          <td>Cuban <em>tresillo</em>; Flamenco</td>
      </tr>
      <tr>
          <td>$E(5,8) = [1,0,1,1,0,1,1,0]$</td>
          <td>$[2,1,2,1,2]^*$</td>
          <td>Cuban <em>cinquillo</em></td>
      </tr>
      <tr>
          <td>$E(4,9) = [1,0,0,1,0,1,0,1,0]^*$</td>
          <td>$[3,2,2,2]^*$</td>
          <td>Turkish <em>aksak</em> patterns</td>
      </tr>
      <tr>
          <td>$E(7,12) = [1,0,1,1,0,1,0,1,1,0,1,0]$</td>
          <td>$[2,1,2,2,1,2,2]^*$</td>
          <td>West African standard bell</td>
      </tr>
      <tr>
          <td>$E(9,16)$</td>
          <td>$[2,1,2,2,1,2,2,1,2,1]^*$</td>
          <td>Brazilian and West African</td>
      </tr>
      <tr>
          <td>$E(13,24)$</td>
          <td></td>
          <td>South Indian (Carnatic) <em>tāla</em></td>
      </tr>
  </tbody>
</table>
<p>Three of these are worth examining in more detail.</p>
<p><strong>$E(5,8)$: the cinquillo.</strong> Five onsets in eight pulses: $\lfloor 8/5 \rfloor =
1$, $\lceil 8/5 \rceil = 2$, $\alpha = 3$. Three gaps of 2 and two gaps of 1.
Gap sequence $[2,1,2,1,2]$ or a rotation thereof. The <em>cinquillo</em> is a
fundamental pattern in Cuban music, used as a melodic rhythmic figure in the
nineteenth-century contradanza and in much of what followed.</p>
<p><strong>$E(7,12)$: the West African standard bell.</strong> Seven onsets in a twelve-beat
cycle: $\lfloor 12/7 \rfloor = 1$, $\lceil 12/7 \rceil = 2$, $\alpha = 5$.
Five gaps of 2 and two gaps of 1. This timeline — used across the Ewe, Akan,
and many other traditions in West Africa — is the cyclic reference structure
against which other rhythmic layers are measured in ensemble drumming. It is also
the pitch-class set $\{0, 2, 4, 5, 7, 9, 11\}$ — the Western diatonic scale,
translated from pitch to rhythm. That the same maximally-even distribution
describes both the diatonic scale in pitch space and the standard bell in rhythm
is one of the more remarkable coincidences in mathematical music theory.</p>
<p><strong>Universality across non-connected cultures.</strong> The tresillo $E(3,8)$ appears
independently in Cuban music, Flamenco, Namibian Juǀ&rsquo;hoansi music, and
medieval Persian music (Toussaint, 2020). These traditions had no common musical
ancestor that could have transmitted the pattern. The Euclidean algorithm
produces what maximum evenness demands, and maximum evenness turns out to be
what these rhythmic traditions independently converged on.</p>
<hr>
<h2 id="circular-notation-and-necklaces">Circular Notation and Necklaces</h2>
<p>Euclidean rhythms are most naturally represented as <strong>necklaces</strong> — equivalence
classes of binary sequences under cyclic rotation. All rotations of $E(3,8)$
represent the same rhythmic structure with a different starting downbeat: the
musical identity is independent of which position is designated &ldquo;beat 1.&rdquo;</p>
<p>In circular notation, place $n$ equally spaced dots on a circle and mark the $k$
onset positions. The pattern is immediately visible: the $k$ onset-dots divide
the circle as evenly as possible. For $E(7,12)$, the seven onset dots on a
twelve-position circle look like the seven vertices of a near-regular heptagon
inscribed in a dodecagon. For $E(3,8)$, the three onset dots form a near-
equilateral triangle.</p>
<p>This geometric representation makes the maximum-evenness property transparent in
a way that the linear binary string does not. It also makes clear why Euclidean
rhythms feel &ldquo;balanced&rdquo; when played: the onset dots distribute the &ldquo;weight&rdquo; of
the cycle as uniformly as the integer constraints allow.</p>
<p>The mathematical theory of necklaces belongs to combinatorics on words.
Euclidean rhythms correspond to specific equivalence classes of binary sequences
known as <em>Christoffel words</em> (Lothaire, 2002): words over the alphabet $\{0,1\}$
whose combinatorial properties encode the slope of a line segment, which brings
us to the third independent context in which the same algorithm appears.</p>
<hr>
<h2 id="the-bresenham-connection">The Bresenham Connection</h2>
<p>Jack Bresenham&rsquo;s line algorithm (1965) rasterises a line from $(0,0)$ to $(n,k)$
on a grid of integer pixels. At each column $x$, the algorithm tracks whether the
fractional error accumulated since the last row increment exceeds $\frac{1}{2}$,
and if so, increments the row and resets the error. The sequence of column
positions at which the row increments is the onset pattern $E(k,n)$.</p>
<p>Formally, an onset occurs at position $m$ in $E(k,n)$ if and only if:</p>
$$\left\lfloor \frac{(m+1)\, k}{n} \right\rfloor > \left\lfloor \frac{m\, k}{n} \right\rfloor.$$<p>Equivalently, the onset positions themselves form the sequence:</p>
$$s_j = \left\lfloor \frac{j \cdot n}{k} \right\rfloor, \qquad j = 0, 1, \ldots, k-1.$$<p>For $E(3,8)$: $s_0 = 0$, $s_1 = \lfloor 8/3 \rfloor = 2$,
$s_2 = \lfloor 16/3 \rfloor = 5$, giving onset positions $\{0, 2, 5\}$ — a
rotation of the tresillo.</p>
<p>This is exactly the Bresenham increment condition. Drawing the line from $(0,0)$
to $(8,3)$ and marking where the $y$-coordinate takes a step produces the onset
positions $\{2, 5, 7\}$ — a rotation of the tresillo $\{0, 3, 6\}$.</p>
<p>Three independent fields — ancient Greek number theory, Afro-Caribbean percussion,
and 1960s computer graphics — converge on the same mathematical object. This is
not a coincidence. All three are solving the same fundamental problem: how to
distribute $k$ discrete events as evenly as possible among $n$ slots. When the
problem is universal, its solution is too.</p>
<hr>
<h2 id="euclidean-rhythms-in-contemporary-practice">Euclidean Rhythms in Contemporary Practice</h2>
<p>Toussaint&rsquo;s 2005 paper was primarily a contribution to computational
ethnomusicology, but it reached electronic music production rapidly. Euclidean
rhythm sequencers are now standard in modular synthesis (dedicated Eurorack
hardware modules exist under names including &ldquo;Euclidean&rdquo; and &ldquo;Erica Synths
Pico&rdquo;) and digital audio workstations (as Max for Live devices and software
plug-ins). The interface is minimal: set $k$ and $n$, adjust the rotation offset,
and hear the resulting timeline immediately.</p>
<p>This has opened a compositional mode in which the mathematical structure is
operational: a producer constructing a layered African-style polyrhythm by
stacking $E(3,8)$, $E(5,8)$, and $E(7,8)$ on different instruments is — whether
they know it or not — computing the Euclidean algorithm three times and listening
to the result.</p>
<hr>
<h2 id="implications-for-teaching-rhythm">Implications for Teaching Rhythm</h2>
<p>Music conservatories in the European tradition teach rhythm almost entirely
through Western notation: time signatures, note values, dotted notes, ties. This
system is well-suited to the repertoire it was designed for. It handles Euclidean
rhythms awkwardly. The tresillo $E(3,8)$ requires either a triplet feel against
a binary pulse or a notation involving a dotted quarter note followed by a dotted
quarter and a quarter, which correctly represents the sound but obscures the
structural principle entirely.</p>
<p>The Euclidean framework suggests a different pedagogical starting point. Rather
than beginning from the bar line and asking how notes fill it, begin from the
cycle length $n$ and the onset count $k$ and ask how to distribute the onsets
as evenly as possible. The answer is always computable and always produces a
recognisable rhythm.</p>
<p>For students who encounter West African, Afro-Cuban, or Middle Eastern music —
which conservatory students increasingly do — having a framework that makes these
rhythms <em>structurally necessary</em> rather than <em>culturally exotic</em> changes the
pedagogical relationship fundamentally. The tresillo is not a deviation from
&ldquo;normal&rdquo; rhythm. It is the unique maximally even solution to the problem of
placing three beats in eight pulses. That the same algorithm appeared in a 300
BCE Alexandrian text on number theory is an accident of the history of mathematics.
That it sounds right is not.</p>
<p>Whether conservatory curricula are ready to incorporate the Euclidean framework
alongside Western notation is a separate question. The mathematics does not
demand it. But it offers a language for rhythm that transcends the Western
bar-line without abandoning precision — and that seems worth something, especially
in a world where the music students will perform and teach is no longer
exclusively European.</p>
<hr>
<h2 id="references">References</h2>
<ul>
<li>
<p>Bjorklund, E. (2003). <em>The theory of rep-rate pattern generation in the SNS
timing system.</em> Technical Report SNS-NOTE-CNTRL-99, Spallation Neutron Source,
Oak Ridge National Laboratory.</p>
</li>
<li>
<p>Bresenham, J. E. (1965). Algorithm for computer control of a digital plotter.
<em>IBM Systems Journal</em>, 4(1), 25–30. <a href="https://doi.org/10.1147/sj.41.0025">https://doi.org/10.1147/sj.41.0025</a></p>
</li>
<li>
<p>Clough, J., &amp; Douthett, J. (1991). Maximally even sets. <em>Journal of Music
Theory</em>, 35(1–2), 93–173. <a href="https://doi.org/10.2307/843811">https://doi.org/10.2307/843811</a></p>
</li>
<li>
<p>Lothaire, M. (2002). <em>Algebraic Combinatorics on Words.</em> Cambridge University
Press.</p>
</li>
<li>
<p>Toussaint, G. T. (2005). The Euclidean algorithm generates traditional musical
rhythms. In R. Sarhangi &amp; J. Sharp (Eds.), <em>Proceedings of BRIDGES 2005:
Mathematical Connections in Art, Music, and Science</em> (pp. 47–56). Bridges
Conference.</p>
</li>
<li>
<p>Toussaint, G. T. (2020). <em>The Geometry of Musical Rhythm: What Makes a &ldquo;Good&rdquo;
Rhythm Good?</em> (2nd ed.). Chapman &amp; Hall/CRC Press.</p>
</li>
</ul>
]]></content:encoded>
    </item>
    <item>
      <title>The Charm of Impossibilities: Group Theory and Messiaen&#39;s Modes of Limited Transposition</title>
      <link>https://sebastianspicker.github.io/posts/messiaen-modes-group-theory/</link>
      <pubDate>Wed, 19 Apr 2023 00:00:00 +0000</pubDate>
      <guid>https://sebastianspicker.github.io/posts/messiaen-modes-group-theory/</guid>
      <description>Messiaen&amp;rsquo;s seven modes of limited transposition cannot be fully transposed through all twelve keys — not by convention, but because of group theory. The modes are pitch-class sets whose stabiliser subgroups in ℤ₁₂ are non-trivial. The orbit–stabiliser theorem gives the exact count of distinct transpositions for each mode, and the subgroup lattice of ℤ₁₂ maps directly onto the hierarchy of the seven modes.</description>
      <content:encoded><![CDATA[<p><em>I first encountered Messiaen&rsquo;s second mode — the octatonic scale — in an
analysis seminar during my physics studies, played by a colleague on an upright
piano in a rehearsal room with terrible acoustics. She demonstrated something
that stopped me: no matter how many times she transposed the scale up by a minor
third, she could never find a &ldquo;new&rdquo; version. After three transpositions she was
back where she started. She called it the charm of impossibilities. It took me
years to understand why it is impossible, and longer still to see that the answer
is not musical but algebraic.</em></p>
<p><em>This post is a companion to <a href="/posts/fibonacci-lateralus/">Fibonacci, the Golden Ratio, and Tool&rsquo;s
Lateralus</a>, which found number theory in a prog-rock
song. Here we find abstract algebra in twentieth-century sacred music.</em></p>
<hr>
<h2 id="pitch-classes-and-the-chromatic-clock">Pitch Classes and the Chromatic Clock</h2>
<p>Western music divides the octave into twelve equal semitones. For purposes of
harmony and counterpoint, the absolute pitch is often less important than the
pitch <em>class</em> — the equivalence class of all pitches related by octave
transposition. Middle C and the C two octaves above belong to the same pitch
class.</p>
<p>We label the twelve pitch classes $0, 1, 2, \ldots, 11$, with $0 = \mathrm{C}$,
$1 = \mathrm{C}\sharp/\mathrm{D}\flat$, $2 = \mathrm{D}$, and so on up to
$11 = \mathrm{B}$. Addition is taken modulo 12 — the integers wrap around like
a clock face, with $11 + 2 = 1$ (one semitone above B is C$\sharp$).</p>
<p>The set of pitch classes with this operation is a group:</p>
$$\mathbb{Z}_{12} = \{0, 1, 2, \ldots, 11\}, \qquad x \oplus y = (x + y) \bmod 12.$$<p>This is the cyclic group of order 12. It has an identity element ($0$, &ldquo;no
transposition&rdquo;), every element has an inverse ($-n \bmod 12$), and the operation
is associative. If you are used to thinking about the chromatic scale as a linear
sequence ending at the octave, $\mathbb{Z}_{12}$ is the insistence that it is
actually a circle.</p>
<hr>
<h2 id="musical-operations-as-group-elements">Musical Operations as Group Elements</h2>
<p>Two operations are fundamental in tonal and post-tonal music theory.</p>
<p><strong>Transposition</strong> by $n$ semitones maps every pitch class up by $n$:</p>
$$T_n \colon x \mapsto x + n \pmod{12}.$$<p>The twelve transpositions $T_0, T_1, \ldots, T_{11}$ are exactly the elements of
$\mathbb{Z}_{12}$, with $T_n$ corresponding to the integer $n$. Composing two
transpositions gives a transposition: $T_m \circ T_n = T_{m+n}$.</p>
<p><strong>Inversion</strong> reflects the pitch-class circle:</p>
$$I \colon x \mapsto -x \pmod{12}.$$<p>Inversion maps C to C, D to B$\flat$, E to A$\flat$, and so on — it is the
mirror symmetry of the chromatic circle about the C/F$\sharp$ axis. Combining
inversion with transposition gives the <em>inversional transpositions</em>:</p>
$$I_n \colon x \mapsto n - x \pmod{12}.$$<p>The transpositions and inversional transpositions together generate a group of
order 24:</p>
$$D_{12} = \langle T_1, I \rangle.$$<p>This is the <em>dihedral group</em> $D_{12}$ — the same abstract group that describes
the symmetries of a regular 12-gon (twelve rotations and twelve reflections). The
identification is not coincidental: the twelve pitch classes arranged in a circle
<em>are</em> the vertices of a regular 12-gon, and the musical operations are
geometrically the symmetries of that polygon.</p>
<p>Twelve-tone composition — Schoenberg&rsquo;s method — is almost entirely a
working-out of the consequences of $D_{12}$ acting on ordered sequences of the
twelve pitch classes. The four canonical row forms (prime, inversion, retrograde,
retrograde-inversion) correspond to cosets of $\mathbb{Z}_{12}$ (the transposition subgroup).</p>
<hr>
<h2 id="orbits-and-stabilisers">Orbits and Stabilisers</h2>
<p>Let $S \subseteq \mathbb{Z}_{12}$ be a pitch-class set — a chord, a scale, a
collection of any size.</p>
<p>The <strong>orbit</strong> of $S$ under $\mathbb{Z}_{12}$ is the collection of all distinct
transpositions of $S$:</p>
$$\mathrm{Orb}(S) = \{ T_n(S) : n \in \mathbb{Z}_{12} \}.$$<p>For most sets, all twelve transpositions produce a different set, so
$|\mathrm{Orb}(S)| = 12$. The C major scale, for example, has twelve distinct
transpositions, one for each key.</p>
<p>But some sets are symmetric under certain transpositions: there exists $n \neq 0$
such that $T_n(S) = S$. The collection of all symmetry transpositions of $S$ is
the <strong>stabiliser</strong>:</p>
$$\mathrm{Stab}(S) = \{ T_n \in \mathbb{Z}_{12} : T_n(S) = S \}.$$<p>Because composing two symmetry transpositions yields another, $\mathrm{Stab}(S)$
is a <em>subgroup</em> of $\mathbb{Z}_{12}$.</p>
<p>The <strong>orbit–stabiliser theorem</strong> gives the fundamental count:</p>
$$|\mathrm{Orb}(S)| \cdot |\mathrm{Stab}(S)| = |\mathbb{Z}_{12}| = 12.$$<p>The number of distinct transpositions of $S$ equals $12$ divided by the number
of transpositions that leave $S$ unchanged. The more internally symmetric $S$ is,
the fewer new versions you can produce by transposing it.</p>
<p>A set with $|\mathrm{Stab}(S)| > 1$ — one that is invariant under some
non-trivial transposition — is a <strong>mode of limited transposition</strong>.</p>
<hr>
<h2 id="mode-1-the-whole-tone-scale">Mode 1: The Whole-Tone Scale</h2>
<p>The whole-tone scale contains the six pitch classes at even intervals:</p>
$$\mathrm{Mode\ 1} = \{0, 2, 4, 6, 8, 10\}.$$<p>Transposing by $T_2$:</p>
$$T_2(\{0, 2, 4, 6, 8, 10\}) = \{2, 4, 6, 8, 10, 0\} = \{0, 2, 4, 6, 8, 10\}. \checkmark$$<p>The set is unchanged. The same holds for $T_4, T_6, T_8, T_{10}$. The stabiliser
is the full subgroup of even transpositions:</p>
$$\mathrm{Stab}(\mathrm{Mode\ 1}) = \{T_0, T_2, T_4, T_6, T_8, T_{10}\} \cong \mathbb{Z}_6.$$<p>By the orbit–stabiliser theorem:</p>
$$|\mathrm{Orb}(\mathrm{Mode\ 1})| = \frac{12}{6} = 2.$$<p>There are exactly two distinct whole-tone scales. Every pianist learns this: the
one on C and the one on C$\sharp$. Composing with whole-tone harmony means
working from a stock of only two harmonic pools with no way to modulate into a
genuinely new version of the scale. This is Messiaen&rsquo;s first charm of
impossibility.</p>
<hr>
<h2 id="mode-2-the-octatonic-scale">Mode 2: The Octatonic Scale</h2>
<p>The octatonic (diminished) scale alternates half-step and whole-step intervals.
Starting on C:</p>
$$\mathrm{Mode\ 2} = \{0, 1, 3, 4, 6, 7, 9, 10\}.$$<p>Does $T_3$ leave this set invariant?</p>
$$T_3(\{0, 1, 3, 4, 6, 7, 9, 10\}) = \{3, 4, 6, 7, 9, 10, 0, 1\} = \{0, 1, 3, 4, 6, 7, 9, 10\}. \checkmark$$<p>Also $T_6$ and $T_9$. The stabiliser is the subgroup generated by transposition
by a minor third:</p>
$$\mathrm{Stab}(\mathrm{Mode\ 2}) = \{T_0, T_3, T_6, T_9\} \cong \mathbb{Z}_4.$$<p>The orbit size:</p>
$$|\mathrm{Orb}(\mathrm{Mode\ 2})| = \frac{12}{4} = 3.$$<p>There are exactly three distinct octatonic scales. Composers from Rimsky-Korsakov
and Bartók to Coltrane have exploited this closed system. The three scales
correspond to the three cosets of the subgroup $\langle T_3 \rangle$ in
$\mathbb{Z}_{12}$: the cosets $\{0, 3, 6, 9\}$, $\{1, 4, 7, 10\}$, and
$\{2, 5, 8, 11\}$ are the &ldquo;starting-point classes&rdquo; that generate each scale.
Note that the scales themselves are not pairwise disjoint — each has eight
pitch classes, so any two share four — but the coset structure determines
which transpositions produce the same scale and which produce a different one.</p>
<hr>
<h2 id="the-subgroup-lattice-and-all-seven-modes">The Subgroup Lattice and All Seven Modes</h2>
<p>The orbit–stabiliser theorem constrains which stabiliser sizes are algebraically
possible. Since $\mathrm{Stab}(S)$ is a subgroup of $\mathbb{Z}_{12}$, its order
must divide 12. The <em>proper non-trivial</em> subgroups of $\mathbb{Z}_{12}$ — those
with order strictly between 1 and 12 — are precisely:</p>
<table>
  <thead>
      <tr>
          <th>Subgroup</th>
          <th>Generator</th>
          <th>Order</th>
          <th>Orbit size</th>
      </tr>
  </thead>
  <tbody>
      <tr>
          <td>$\langle T_2 \rangle = \{T_0, T_2, T_4, T_6, T_8, T_{10}\}$</td>
          <td>$T_2$</td>
          <td>6</td>
          <td>2</td>
      </tr>
      <tr>
          <td>$\langle T_3 \rangle = \{T_0, T_3, T_6, T_9\}$</td>
          <td>$T_3$</td>
          <td>4</td>
          <td>3</td>
      </tr>
      <tr>
          <td>$\langle T_4 \rangle = \{T_0, T_4, T_8\}$</td>
          <td>$T_4$</td>
          <td>3</td>
          <td>4</td>
      </tr>
      <tr>
          <td>$\langle T_6 \rangle = \{T_0, T_6\}$</td>
          <td>$T_6$</td>
          <td>2</td>
          <td>6</td>
      </tr>
  </tbody>
</table>
<p>These four subgroups exist because the proper divisors of 12 that are greater
than 1 are exactly $\{2, 3, 4, 6\}$. The subgroups of $\mathbb{Z}_n$ are in
bijection with the divisors of $n$ — a consequence of the fundamental theorem of
cyclic groups. Since $12 = 2^2 \times 3$, the proper divisors are $1, 2, 3, 4,
6$.</p>
<p>Each row of the table maps onto a level in Messiaen&rsquo;s system:</p>
<ul>
<li><strong>Mode 1</strong> (whole-tone scale): stabiliser $\langle T_2 \rangle$, 2 transpositions</li>
<li><strong>Mode 2</strong> (octatonic scale): stabiliser $\langle T_3 \rangle$, 3 transpositions</li>
<li><strong>Mode 3</strong>: stabiliser $\langle T_4 \rangle$, 4 transpositions</li>
<li><strong>Modes 4 – 7</strong>: stabiliser $\langle T_6 \rangle$, 6 transpositions each</li>
</ul>
<p>The subgroup lattice of $\mathbb{Z}_{12}$ — its Hasse diagram of containment
relationships — maps directly onto the hierarchy of Messiaen&rsquo;s modes. The more
symmetric the stabiliser subgroup, the fewer distinct transpositions the mode
admits.</p>
<p>The containment relations are: $\langle T_2 \rangle \supset \langle T_4 \rangle$
and $\langle T_2 \rangle \supset \langle T_6 \rangle$ and
$\langle T_3 \rangle \supset \langle T_6 \rangle$. Correspondingly, Mode 1
(stabiliser $\langle T_2 \rangle$, order 6) is &ldquo;more limited&rdquo; than Mode 3
(stabiliser $\langle T_4 \rangle$, order 3), in the sense that $\langle T_4
\rangle \subset \langle T_2 \rangle$: every symmetry of Mode 3 is also a symmetry
of Mode 1&rsquo;s stabiliser.</p>
<hr>
<h2 id="why-exactly-seven-modes">Why Exactly Seven Modes?</h2>
<p>Messiaen was not enumerating all pitch-class sets with non-trivial stabilisers —
there are many more than seven. At the level of the stabiliser $\langle T_6
\rangle$, for example, there are numerous pitch-class sets invariant under the
tritone transposition $T_6$: any set $S$ such that $S = S + 6$ qualifies. Some
of these sets are large (ten pitch classes), some are small (two pitch classes),
some are musically coherent and some are not.</p>
<p>Messiaen selected seven that he found aesthetically and compositionally viable:
scales of moderate cardinality, with a balance of interval types, that he could
use as raw material for his harmonic language. The group theory explains the
<em>constraint</em> (modes are possible only at the four stabiliser types listed above),
not the <em>selection</em> (which specific sets Messiaen chose among the many that
satisfy the constraint).</p>
<p>The question &ldquo;why seven?&rdquo; is therefore partly combinatorial and partly
compositional. What is group-theoretically determined is the number of <em>levels</em>
(four: orbit sizes 2, 3, 4, 6) and the <em>impossibility</em> of any mode with, say,
five distinct transpositions (since 5 does not divide 12).</p>
<hr>
<h2 id="what-messiaen-knew--and-did-not-know">What Messiaen Knew — and Did Not Know</h2>
<p>Messiaen described his modes in <em>Technique de mon langage musical</em> (1944). His
account is entirely musical and phenomenological. He lists each mode by its
interval sequence, notes how many transpositions it admits, and names the
limitation a &ldquo;charm.&rdquo; The impossibility is for him a spiritual property, a form
of harmonic stasis that he associated — as a devout Catholic — with divine
eternity. A mode that cannot depart is, in his compositional theology, a glimpse
of the unchanging.</p>
<p>He was not doing group theory. The orbit–stabiliser theorem (in its abstract form)
postdates Lagrange (1771), Cauchy (early 19th century), and Galois (1832). But
the concepts were not part of music-theoretic discourse until Milton Babbitt&rsquo;s
work in the 1950s, and they were not formalised in the pitch-class set framework
I have used here until Allen Forte&rsquo;s <em>The Structure of Atonal Music</em> (1973) and
David Lewin&rsquo;s <em>Generalized Musical Intervals and Transformations</em> (1987).</p>
<p>What Messiaen had was a musician&rsquo;s ear for symmetry. He could <em>hear</em> that the
modes were closed, without having the algebraic vocabulary to explain why. The
group theory shows that he was correct, and why he was correct with a precision
that no amount of phenomenological description could provide.</p>
<hr>
<h2 id="from-messiaen-to-lewin">From Messiaen to Lewin</h2>
<p>Lewin&rsquo;s transformational theory (1987) generalises the $\mathbb{Z}_{12}$ framework
to arbitrary musical spaces. A <em>Generalized Interval System</em> is a triple
$(S, G, \mathrm{int})$ where $S$ is a set of musical objects, $G$ is a group, and
$\mathrm{int} : S \times S \to G$ assigns an interval to each ordered pair of
objects in a way that is consistent with the group structure.</p>
<p>This framework treats musical transformations — not just pitch-class transpositions
but rhythmic augmentations, timbral shifts, any structurally defined operation —
as elements of a group. The mathematics does not privilege any particular musical
parameter; it applies wherever a transformation group acts on a set of musical
objects.</p>
<p>Neo-Riemannian theory, which emerged from Lewin&rsquo;s work in the 1980s and 1990s
and was systematised by Cohn (1998), applies this framework to triadic
transformations (the operations L, P, and R that map major and minor triads to
their relatives, parallels, and leading-tone exchanges). The group generated by
L, P, and R on the set of 24 major and minor triads is isomorphic to $D_{12}$
— the same dihedral group that governs Messiaen&rsquo;s modes, but acting on a
different musical space.</p>
<p>Emmanuel Amiot&rsquo;s more recent work (2016) applies the discrete Fourier transform
to pitch-class sets, using the DFT coefficients on $\mathbb{Z}_{12}$ as a
continuous measure of a set&rsquo;s similarity to the modes of limited transposition.
The Fourier coefficients detect the algebraic symmetries that stabilisers measure
discretely: a set with large coefficient at frequency $k$ (in the DFT over
$\mathbb{Z}_{12}$) is close, in a precise sense, to having the stabiliser
$\langle T_{12/k} \rangle$.</p>
<p>The group-theoretic perspective has moved, over seventy years, from a marginal
curiosity to the dominant mathematical framework in music theory. Messiaen&rsquo;s
modes — which once seemed like personal compositional idiosyncrasies — are
revealed as structurally constrained: the possible stabiliser orders are fixed
by the divisors of 12, and the orbit sizes that Messiaen&rsquo;s ear discovered are
exactly those that Lagrange&rsquo;s theorem permits. Many pitch-class sets have
non-trivial stabilisers; Messiaen found the seven that are musically viable.
Their limitation is not a personal choice but an algebraic fact.</p>
<p>The charm of impossibilities is a theorem of group theory. And it is exactly as
beautiful as Messiaen heard it to be.</p>
<hr>
<h2 id="references">References</h2>
<ul>
<li>
<p>Amiot, E. (2016). <em>Music Through Fourier Space: Discrete Fourier Transform in
Music Theory.</em> Springer (Computational Music Science).</p>
</li>
<li>
<p>Babbitt, M. (1960). Twelve-tone invariants as compositional determinants.
<em>The Musical Quarterly</em>, 46(2), 246–259.
<a href="https://doi.org/10.1093/mq/XLVI.2.246">https://doi.org/10.1093/mq/XLVI.2.246</a></p>
</li>
<li>
<p>Cohn, R. (1998). Introduction to neo-Riemannian theory: A survey and a
historical perspective. <em>Journal of Music Theory</em>, 42(2), 167–180.
<a href="https://doi.org/10.2307/843871">https://doi.org/10.2307/843871</a></p>
</li>
<li>
<p>Forte, A. (1973). <em>The Structure of Atonal Music.</em> Yale University Press.</p>
</li>
<li>
<p>Lewin, D. (1987). <em>Generalized Musical Intervals and Transformations.</em> Yale
University Press. (Reissued Oxford University Press, 2007.)</p>
</li>
<li>
<p>Messiaen, O. (1944). <em>Technique de mon langage musical.</em> Alphonse Leduc.
(English translation: Satterfield, J., 1956.)</p>
</li>
<li>
<p>Tymoczko, D. (2006). The geometry of musical chords. <em>Science</em>, 313(5783),
72–74. <a href="https://doi.org/10.1126/science.1126287">https://doi.org/10.1126/science.1126287</a></p>
</li>
<li>
<p>Tymoczko, D. (2011). <em>A Geometry of Music: Harmony and Counterpoint in the
Extended Common Practice.</em> Oxford University Press.</p>
</li>
</ul>
<hr>
<h2 id="changelog">Changelog</h2>
<ul>
<li><strong>2026-01-14</strong>: Changed &ldquo;cosets of $D_{12}$&rdquo; to &ldquo;cosets of $\mathbb{Z}_{12}$ (the transposition subgroup)&rdquo; in the twelve-tone composition paragraph. $D_{12}$ (order 24) already includes both transpositions and inversions, yielding only 2 cosets in the full serial group. The four row forms {P, I, R, RI} correspond to 4 cosets of the transposition-only subgroup $\mathbb{Z}_{12}$ (order 12) in the full group of order 48.</li>
</ul>
]]></content:encoded>
    </item>
  </channel>
</rss>
