<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Music on Sebastian Spicker</title>
    <link>https://sebastianspicker.github.io/tags/music/</link>
    <description>Recent content in Music on Sebastian Spicker</description>
    <image>
      <title>Sebastian Spicker</title>
      <url>https://sebastianspicker.github.io/og-image.png</url>
      <link>https://sebastianspicker.github.io/og-image.png</link>
    </image>
    <generator>Hugo -- 0.160.0</generator>
    <language>en</language>
    <lastBuildDate>Tue, 10 Feb 2026 00:00:00 +0000</lastBuildDate>
    <atom:link href="https://sebastianspicker.github.io/tags/music/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Automate the Boring Stuff: Setlist to Playlist</title>
      <link>https://sebastianspicker.github.io/posts/setlist-to-playlist/</link>
      <pubDate>Tue, 10 Feb 2026 00:00:00 +0000</pubDate>
      <guid>https://sebastianspicker.github.io/posts/setlist-to-playlist/</guid>
      <description>I love concerts. I love setlists. I hate building the playlist manually afterward. But do I really? A small automation project, a Deftones show in Dortmund, and the question of whether you should automate something you kind of enjoy.</description>
      <content:encoded><![CDATA[<p>Saturday was the Deftones at the Westfalenhalle in Dortmund. One of those concerts where the setlist is part of the experience — where you register, with something close to physical relief, that the arc landed exactly right, and you spend the Uber home mentally replaying the order.</p>
<p>Sunday I built a playlist from it. It took about forty minutes.</p>
<p>This is the post about why that number is already too low, and also possibly too high.</p>
<h2 id="the-ritual">The Ritual</h2>
<p>There is a specific kind of concert listening that happens in the days after a show. You go home, you look up the setlist — setlist.fm is the canonical archive, maintained with an almost academic precision by people who care — and you build a playlist from it in whatever streaming app you use. Then you play it through, in order, and what comes back is not just the music but the spatial memory of the room, the sound mix, the moment the lights dropped for that particular song.</p>
<p>I have been doing this for years. It is a ritual, and like most rituals, part of its meaning is in the doing. The forty minutes of searching song by song, the occasional discovery that a deep cut is on Apple Music in one version but not another, the fiddling with live versus studio — that friction is not purely annoying. It is part of the processing.</p>
<p>And yet. The pile of unprocessed setlists sits in a folder. Shows I attended and never got around to. Setlists I meant to build into playlists and didn&rsquo;t, because the forty minutes were not available that week, and then the moment passed. The ritual unrealised is just a list of song titles.</p>
<p>This is the dilemma, and it is not entirely trivial.</p>
<h2 id="why-this-is-harder-than-it-should-be">Why This Is Harder Than It Should Be</h2>
<p>The setlist.fm API is excellent. It gives you structured data: artist, venue, date, song titles in order, with notations for encores, covers, and dropped songs. What it does not give you is streaming IDs. The song title is a string; the Apple Music track is an object with a catalog ID, a duration, multiple versions, regional availability, and the possibility of not existing at all in the catalog of your country.</p>
<p>The matching problem — connecting a string like &ldquo;Change (In the House of Flies)&rdquo; to the correct Apple Music track, filtered for the right album version, ignoring the live recordings you did not ask for — is not hard, but it is fiddly. You can get 80% of a setlist matched automatically without much effort. The remaining 20% are the covers, the deep cuts, the songs with subtitles in parentheses that differ between the setlist record and the catalog metadata.</p>
<p>Spotify has a fairly rich ecosystem of community tools for exactly this workflow, because Spotify&rsquo;s API is permissive and well-documented and the auth flow is reasonable for third-party developers. Apple Music is harder. The MusicKit framework is real and capable, but the authentication requires managing a private key and JWT tokens signed with developer credentials — not the OAuth dance most developers are used to. The result is that the setlist → Apple Music pipeline is significantly underbuilt compared to the Spotify equivalent.</p>
<p>This is partly why I built <a href="https://github.com/sebastianspicker/setlist-to-playlist">setlist-to-playlist</a> as a PWA rather than reaching for an existing tool.</p>
<h2 id="how-it-works">How It Works</h2>
<p>The app is a Progressive Web App — installable, mobile-friendly, works as a small tool you open on your phone in the taxi home from a show — built on Next.js with a monorepo structure managed by pnpm and Turbo. The architecture is in three phases:</p>
<p><strong>Import.</strong> You paste a setlist.fm URL or ID. The app queries setlist.fm through a server-side proxy — the API key lives on the server and never touches the client — and returns the structured setlist data: songs in order, with metadata about covers, medleys, and notes.</p>
<p><strong>Preview and matching.</strong> The core package runs a matching algorithm against the Apple Music catalog, using the MusicKit JS API for browser-based catalog search. For each song title, it searches Apple Music and presents the best candidate, giving you the chance to confirm or swap before anything is written. This is the step where the 20% problem is addressed manually — the app handles the obvious cases automatically and surfaces the ambiguous ones for human judgement.</p>
<p><strong>Export.</strong> Once you are happy with the track list, the app creates a playlist in your Apple Music library. MusicKit handles the authentication in-browser; the backend generates the JWT tokens using credentials from Apple Developer, signing with the private key server-side so it stays off the client.</p>
<p>The whole thing is local-first in the sense that matters: the Apple Music authentication is between your browser and Apple, and no playlist data or listening history is stored by the app. The only thing the server touches is the API key proxying and the JWT generation.</p>
<h2 id="the-actual-experience">The Actual Experience</h2>
<p>After the Deftones show: opened the app on the phone, pasted the setlist.fm URL, had the playlist in Apple Music in about four minutes. Three tracks needed manual confirmation — two because of live-versus-studio ambiguity, one because a cover required a search adjustment, the kind of edge case where the name setlist.fm records differs from what appears in regional streaming catalogs.</p>
<p>Four minutes instead of forty. Mission accomplished.</p>
<p>And yet.</p>
<p>I noticed, processing the setlist that quickly, that something was missing. Not the music — the music was all there, in order, correct. What was missing was the time spent inside the setlist. The forty minutes of handling each song is also forty minutes of thinking about each song, of remembering where in the set it fell, of deciding which album version you want to hear. The automation removed the friction and also removed the processing.</p>
<p>I am not sure this is a problem. It is probably more accurate to say that it is a trade-off, and that what trade-off you want depends on what you are doing with the ritual. If the backlog is the problem — the pile of unprocessed shows — the automation solves it cleanly. If the processing itself is the point, you probably should not automate it, and the tool is there for when you want it.</p>
<p>That is the correct relationship to automation, I think. Not &ldquo;this should always be automated&rdquo; or &ldquo;this should never be automated&rdquo;, but &ldquo;here is a tool that removes the mechanical part; use it when the mechanical part is not the point&rdquo;.</p>
<h2 id="a-note-on-the-tech-stack">A Note on the Tech Stack</h2>
<p>For the interested: Next.js 15 with App Router, pnpm workspaces with Turbo for the monorepo, MusicKit JS for Apple Music integration, setlist.fm REST API. The JWT for Apple Music uses the <code>jose</code> library for token signing. The matching logic lives in a standalone <code>packages/core</code> module, which makes it testable in isolation and reusable if anyone wants to port this to a different frontend or a CLI.</p>
<p>The repo is at <a href="https://github.com/sebastianspicker/setlist-to-playlist">github.com/sebastianspicker/setlist-to-playlist</a>. PRs welcome, particularly around the matching heuristics — that is the part where there is the most room for improvement.</p>
<hr>
<p>The Deftones were exceptional, for the record. The Westfalenhalle was loud in the way that only a concrete hall that size can be loud, which is to say: correctly loud.</p>
<p>The playlist is good. I am glad it took four minutes and not forty.</p>
<p>I am also glad I know what I gave up.</p>
]]></content:encoded>
    </item>
    <item>
      <title>A Gas at Temperature T: Xenakis and the Physics of Stochastic Music</title>
      <link>https://sebastianspicker.github.io/posts/xenakis-stochastic-music/</link>
      <pubDate>Tue, 14 Oct 2025 00:00:00 +0000</pubDate>
      <guid>https://sebastianspicker.github.io/posts/xenakis-stochastic-music/</guid>
      <description>Iannis Xenakis applied the Maxwell-Boltzmann velocity distribution, Markov chains, and game theory to orchestral composition. In Pithoprakta (1955–56), 46 string parts are molecules of a gas, each following the kinetic theory distribution. In Duel and Stratégie (1959–62), two conductors play a zero-sum game with payoff matrices on stage. This post works through the physics and mathematics, and asks what it means when a composer treats an orchestra as a thermodynamic system.</description>
      <content:encoded><![CDATA[<p><em>Iannis Xenakis (1922–2001) was trained as a civil engineer at the Athens
Polytechnic, joined the Greek Resistance during the Second World War and the
subsequent Greek Civil War, survived a British army tank shell in January 1945
that cost him the sight in his left eye and part of his jaw, was sentenced to
death in absentia by the Greek military government, fled to Paris in 1947, and
worked for twelve years as an architect in Le Corbusier&rsquo;s atelier — where he
contributed structural engineering to the Unité d&rsquo;Habitation in Marseille and
designed the Philips Pavilion for Expo 58. In parallel, already in his thirties,
he taught himself composition — approaching Honegger (who was too ill to teach) and then studying with Messiaen
— and became one of the central figures of the post-war avant-garde. I mention
the biography not as background colour but because it bears on the physics. A
person who has been through what Xenakis had been through by 1950 is not likely
to be intimidated by the kinetic theory of gases.</em></p>
<p><em>He was not. In 1955–56 he composed</em> Pithoprakta <em>— &ldquo;actions through
probability&rdquo; — for 46 strings, each of which is, in his own account, a
molecule of an ideal gas. This post works through the mathematics he
used and asks what it means when a composer takes statistical mechanics
seriously as a compositional tool.</em></p>
<hr>
<h2 id="the-problem-with-post-war-serialism">The Problem with Post-War Serialism</h2>
<p>To understand why Xenakis did what he did, it helps to know what everyone
else was doing. By the early 1950s, the dominant tendency in European
new music was total serialism: the systematic extension of Schoenberg&rsquo;s
twelve-tone technique to rhythm, dynamics, articulation, and register. Every
parameter of every note was determined by a series. Messiaen had sketched
this direction in <em>Mode de valeurs et d&rsquo;intensités</em> (1949); Boulez and
Stockhausen had taken it to its logical extreme.</p>
<p>The result, as Xenakis observed with characteristic bluntness in <em>Formalized
Music</em> (1963/1992), was a kind of sonic indistinguishability: because every
parameter varied according to independent deterministic series, the textures
produced by total serialism sounded essentially like random noise. The
maximum of local determinism had produced the appearance of global chaos.</p>
<p>His diagnosis was precise and, I think, correct: if the perceptual result of
maximum determinism and maximum randomness is the same, then the path forward
is not to find a better deterministic scheme but to embrace randomness
explicitly, at the level that governs the <em>macroscopic</em> structure. Control the
distribution; let the individual events vary within it. This is exactly what
statistical mechanics does for a gas: it does not track every molecule, but
it knows with great precision what the distribution of velocities will be.</p>
<hr>
<h2 id="statistical-mechanics-in-brief">Statistical Mechanics in Brief</h2>
<p>In a classical ideal gas of $N$ molecules at thermal equilibrium with
temperature $T$, the molecules move in all directions with speeds distributed
according to the Maxwell-Boltzmann speed distribution:</p>
$$f(v) = \sqrt{\frac{2}{\pi}}\, \frac{v^2}{a^3}\, \exp\!\left(-\frac{v^2}{2a^2}\right), \qquad a = \sqrt{\frac{k_B T}{m}},$$<p>where $m$ is the molecular mass and $k_B$ is Boltzmann&rsquo;s constant. The
parameter $a$ sets the characteristic speed scale: it grows with temperature
(hotter gas means faster molecules) and shrinks with molecular mass (heavier
molecules move more slowly at the same temperature).</p>
<p>The distribution has a characteristic shape: it rises as $v^2$ for small
speeds (few molecules are nearly stationary), peaks at the most probable
speed $v_p = a\sqrt{2}$, and falls off as $e^{-v^2/2a^2}$ for large speeds
(very fast molecules are exponentially rare). The three characteristic
speeds are:</p>
$$v_p = a\sqrt{2}, \qquad \langle v \rangle = a\sqrt{\tfrac{8}{\pi}}, \qquad v_\mathrm{rms} = a\sqrt{3}.$$<p>No individual molecule is tracked. The distribution is everything: once you
know $f(v)$, you know all macroscopic properties of the gas — pressure,
mean kinetic energy, thermal conductivity — without knowing the trajectory
of a single molecule. The individual is sacrificed to the ensemble.</p>
<hr>
<h2 id="pithoprakta-and-the-orchestra-as-gas"><em>Pithoprakta</em> and the Orchestra as Gas</h2>
<p>In <em>Pithoprakta</em> (1955–56), Xenakis assigns each of the 46 string instruments
to a molecule of a gas. The musical analogue of molecular speed is the
<em>velocity of a glissando</em>: the rate at which a glissando moves through
pitch, measured in semitones per second. Slow glissandi are cold molecules;
fast glissandi are hot ones.</p>
<p>For a given passage with a specified musical &ldquo;temperature&rdquo; (an
intensity-and-density parameter he could set as a compositional choice),
the 46 glissando speeds are drawn from the Maxwell-Boltzmann distribution
for that temperature. No two strings play the same glissando at the same
speed. The effect, to a listener, is a dense sound-mass — a shimmer or
a roar — whose internal texture varies but whose overall character (the
temperature, the density) is under the composer&rsquo;s control at exactly the
level that matters perceptually.</p>
<p>Xenakis worked out the velocities numerically by hand. The score of
<em>Pithoprakta</em> was among the first in which the individual parts were derived
from a statistical distribution rather than from a melody, a row, or an
improvisation instruction. The calculation is tedious but not difficult:
for each time window, choose a temperature, compute $f(v)$ for the 46
values of $v$ that tile the distribution, and assign one speed to each
instrument.</p>
<p>The connection between macroscopic structure and microscopic liberty is
deliberately preserved. The shape of the sound-mass — its brightness,
its turbulence, its rate of change — is controlled. Each individual line
is unpredictable. This is, structurally, the same trade-off that makes
thermodynamics work: you give up on the individual trajectory and gain
exact knowledge of the aggregate.</p>
<hr>
<h2 id="musical-temperature-as-a-compositional-parameter">Musical Temperature as a Compositional Parameter</h2>
<p>The analogy is worth making precise. In the physical gas, raising the
temperature $T$ increases $a = \sqrt{k_B T / m}$, which shifts the
peak of $f(v)$ to the right and widens the distribution. More molecules
have high speeds; the variance of speeds increases.</p>
<p>In <em>Pithoprakta</em>, raising the musical &ldquo;temperature&rdquo; has the same
effect: more instruments perform rapid glissandi; the pitch-space
trajectories are more varied; the texture becomes more active and
more turbulent. Lowering the temperature concentrates the glissando
speeds near zero — slow motion, near-stasis, long sustained tones
that change pitch only gradually. The orchestra cools.</p>
<p>This mapping is not metaphorical. Xenakis computed it. The score
contains numerically derived glissando speeds; the connection between the
perceptual temperature of the texture and the statistical parameter $T$ is
quantitative. When musicians speak of a passage &ldquo;heating up,&rdquo; they are
usually using a figure of speech. In <em>Pithoprakta</em>, they are describing
a thermodynamic fact.</p>
<hr>
<h2 id="the-poisson-distribution-and-event-density">The Poisson Distribution and Event Density</h2>
<p><em>Pithoprakta</em> uses a second physical model alongside the Maxwell-Boltzmann
distribution: the Poisson process, which governs the density of
independent, randomly occurring events.</p>
<p>If musical events (pizzicato attacks, bow changes, individual note entries)
occur at a mean rate of $\lambda$ events per second, the probability of
exactly $k$ events occurring in a time window of length $T$ is:</p>
$$P(N = k) = \frac{(\lambda T)^k\, e^{-\lambda T}}{k!}.$$<p>The Poisson distribution has a single parameter $\lambda$ that controls
both the mean and the variance (they are equal: $\langle N \rangle =
\mathrm{Var}(N) = \lambda T$). A high $\lambda$ produces a dense cluster
of events; a low $\lambda$ produces sparse, widely spaced events.</p>
<p>Xenakis used this to control the density of pizzicato attacks independently
of the glissando texture. A passage can be cool (slow glissandi) and dense
(many pizzicati), or hot and sparse, or any combination. The two
distributions operate on independent musical parameters — pitch motion and
event density — giving the composer a two-dimensional thermodynamic control
space over the texture.</p>
<hr>
<h2 id="markov-chains-analogique-a-and-analogique-b">Markov Chains: <em>Analogique A</em> and <em>Analogique B</em></h2>
<p>In <em>Analogique A</em> (for string orchestra, 1958–59) and its companion
<em>Analogique B</em> (for sinusoidal tones, same year), Xenakis moved to a
different stochastic framework: Markov chains.</p>
<p>A Markov chain is a sequence of states where the probability of
transitioning to the next state depends only on the current state. The
chain is specified by a transition matrix $P$, where $P_{ij}$ is the
probability of moving from state $i$ to state $j$:</p>
$$P_{ij} \geq 0, \qquad \sum_j P_{ij} = 1 \quad \forall\, i.$$<p>Under mild conditions (irreducibility and aperiodicity), the chain
converges to a unique stationary distribution $\pi$ satisfying:</p>
$$\pi P = \pi, \qquad \sum_i \pi_i = 1.$$<p>The convergence is geometric: if $\lambda_2$ is the second-largest eigenvalue
of $P$ in absolute value, then after $n$ steps the distribution $\pi^{(n)}$
satisfies $\|\pi^{(n)} - \pi\| \leq C |\lambda_2|^n$ for some constant $C$.
The gap $1 - |\lambda_2|$ — the <em>spectral gap</em> — controls how quickly the
chain forgets its initial state. A transition matrix with a large spectral
gap produces rapid convergence; one with $|\lambda_2| \approx 1$ produces
long-memory dependence between distant states. This is a compositional
choice: the spectral gap determines how quickly a piece&rsquo;s texture changes
character.</p>
<p>In <em>Analogique A</em>, Xenakis divided the sonic space into a grid of
cells defined by pitch register (high/middle/low), density
(sparse/medium/dense), and dynamic (soft/loud). Each &ldquo;screen&rdquo; — a brief
time window — occupies one cell in this grid. The progression of screens
through the piece is governed by transition probabilities: from a
high/dense/loud screen, there is some probability of moving to each
adjacent cell, specified by Xenakis&rsquo;s chosen transition matrix.</p>
<p>This is a Markov chain on a discrete state space of sonic textures. The
macroscopic trajectory of the piece — its overall movement through sound-
quality space — is determined by the transition matrix, which the composer
sets. The details of each screen are filled in stochastically, within the
parameters of the current state. Again, the individual is sacrificed to the
aggregate; control is exercised at the level of the distribution rather
than the event.</p>
<hr>
<h2 id="game-theory-duel-and-stratégie">Game Theory: <em>Duel</em> and <em>Stratégie</em></h2>
<p>The most extreme and, to my mind, most interesting of Xenakis&rsquo;s
formalisations is the use of game theory in <em>Duel</em> (1959) and <em>Stratégie</em>
(1962).</p>
<p>A <strong>two-player zero-sum game</strong> is specified by a payoff matrix $A \in
\mathbb{R}^{m \times n}$. Player 1 (the &ldquo;maximiser&rdquo;) chooses a row $i$;
Player 2 (the &ldquo;minimiser&rdquo;) chooses a column $j$; Player 1 receives payoff
$A_{ij}$ and Player 2 receives $-A_{ij}$. In a pure-strategy game, each
player selects a single action. In a <strong>mixed-strategy game</strong>, each player
chooses a probability distribution over their actions: Player 1 uses
$\mathbf{x} \in \Delta_m$ and Player 2 uses $\mathbf{y} \in \Delta_n$,
where $\Delta_k$ denotes the standard $(k-1)$-simplex.</p>
<p>The expected payoff to Player 1 under mixed strategies is:</p>
$$E(\mathbf{x}, \mathbf{y}) = \mathbf{x}^\top A\, \mathbf{y}.$$<p>Von Neumann&rsquo;s minimax theorem (1928) guarantees that:</p>
$$\max_{\mathbf{x} \in \Delta_m} \min_{\mathbf{y} \in \Delta_n}
\mathbf{x}^\top A\, \mathbf{y}
\;=\;
\min_{\mathbf{y} \in \Delta_n} \max_{\mathbf{x} \in \Delta_m}
\mathbf{x}^\top A\, \mathbf{y}
\;=\; v^*,$$<p>where $v^*$ is the <strong>value</strong> of the game. The pair $(\mathbf{x}^*,
\mathbf{y}^*)$ that achieves this saddle point is the Nash equilibrium:
neither player can improve their expected payoff by unilaterally deviating
from their equilibrium strategy.</p>
<p>In <em>Stratégie</em>, each conductor leads one orchestra. Each has nineteen
&ldquo;tactics&rdquo; — six basic musical textures (e.g., sustained chords, staccato
pizzicati, glissandi masses, silence) plus thirteen combinatorial tactics
that combine two or three of the basics. The payoff matrix is a
$19 \times 19$ integer matrix, also defined by Xenakis, specifying how
many points Conductor 1 scores when their orchestra plays tactic $i$ against
Conductor 2&rsquo;s tactic $j$. A referee tracks the score.</p>
<p>The conductors make decisions in real time during the performance, choosing
tactics based on what the other conductor is doing and on the evolving score.
The piece ends when one conductor reaches a predetermined score threshold.</p>
<p>The Nash equilibrium of the payoff matrix tells each conductor, in principle,
the optimal <em>distribution</em> over tactics to play: if both play optimally, the
expected score trajectory is determined. In practice, conductors are not
expected to compute mixed strategies on the podium; Xenakis&rsquo;s point is
structural. The game-theoretic formalism is used to design the payoff matrix
so that no tactic dominates — every choice has consequences that depend on
the opponent&rsquo;s choice — guaranteeing that the piece will always contain
genuine strategic tension regardless of who is conducting.</p>
<p><em>Duel</em> (1959) is the earlier, simpler version for two chamber orchestras.
<em>Stratégie</em> (1962) was premiered in April 1963 at the Venice Biennale with two conductors
competing live. The audience was aware of the game, of the score, and of
the payoff matrix. The premiere was by most accounts a success, though the
practical complications of running a zero-sum game in a concert hall
(including the question of whether conductors were actually computing Nash
equilibria or just following intuition) were never fully resolved.</p>
<hr>
<h2 id="formalized-music"><em>Formalized Music</em></h2>
<p>Xenakis assembled his theoretical framework in <em>Musiques formelles</em> (1963),
translated and expanded as <em>Formalized Music</em> (1971; revised edition 1992).
The book is one of the strangest documents in twentieth-century music theory:
part treatise, part manifesto, part mathematical appendix. It covers
stochastic composition, Markov chains, game theory, set theory, group theory,
and symbolic logic — all presented with the confidence of someone who is
equally at home in the engineering faculty and the concert hall, and with
the occasional obscurity of someone writing simultaneously for two audiences
who share almost no vocabulary.</p>
<p>The core argument is that musical composition can and should be treated as
the application of mathematical structures to sonic material, not because
mathematics makes music &ldquo;better&rdquo; but because mathematical structures are
the most powerful available tools for controlling relationships between
sounds at multiple scales simultaneously. The statistical distributions
control the macroscopic; the individual values vary within them. The game-
theoretic payoff matrix controls the strategic interaction; the individual
tactics fill in the details. Mathematics operates at the structural level
and leaves the acoustic surface free.</p>
<p>This is a different relationship between mathematics and music from the
ones in my earlier posts on <a href="/posts/messiaen-modes-group-theory/">group theory and Messiaen</a>
or <a href="/posts/euclidean-rhythms/">the Euclidean algorithm and world rhythms</a>.
In those cases, mathematics describes structure that already exists in the
music — structure the composers arrived at by ear. In Xenakis, mathematics
is the generative tool: the score is derived from the calculation.</p>
<hr>
<h2 id="what-the-analogy-does-and-does-not-do">What the Analogy Does and Does Not Do</h2>
<p>The Maxwell-Boltzmann analogy in <em>Pithoprakta</em> is exact in one direction
and approximate in another.</p>
<p>It is exact in the following sense: the glissando speeds Xenakis computed
for his 46 strings genuinely follow the Maxwell-Boltzmann distribution with
the parameters he chose. The score is a realisation of that distribution.
If you collect the glissando speeds from the score and plot their histogram,
you will find the characteristic $v^2 e^{-v^2/2a^2}$ shape.</p>
<p>It is approximate — or rather, it is analogical — in the sense that strings
in an orchestra are not molecules of a gas. They do not collide. They have
mass and inertia in a physical sense that has no direct mapping to
musical parameters. The temperature $T$ is not a temperature in any
thermodynamic sense; it is a compositional variable that Xenakis chose to
parameterise with the same symbol because the formal relationship is the
same. The analogy is structural, not ontological.</p>
<p>This is worth saying plainly because it is easy to be misled in both
directions: either to over-claim (the orchestra <em>is</em> a gas) or to dismiss
(the orchestra is <em>merely</em> labelled with physical vocabulary). The actual
claim is more modest and more interesting: the mathematical structure of the
Maxwell-Boltzmann distribution is the right tool for specifying a certain
kind of orchestral texture, namely one where individual elements vary
stochastically around a controlled macroscopic envelope. The physics
provides the formalism; the music provides the application. This is how
mathematics works in engineering, too.</p>
<hr>
<h2 id="the-centenary-and-what-remains">The Centenary and What Remains</h2>
<p>Xenakis died in 2001, by then partially deaf and with dementia. His centenary
in 2022 produced a wave of new performances, recordings, and scholarship
— including the <em>Meta-Xenakis</em> volume (Open Book Publishers, 2022), which
collects analyses of his compositional mathematics, his architectural work
(he designed the Philips Pavilion for Le Corbusier&rsquo;s Expo 58 in Brussels
using the same ruled-surface geometry he was using in <em>Metastaseis</em>), and
his political biography.</p>
<p>What remains resonant about his project is not the specific distributions
he chose — the Maxwell-Boltzmann is not the only or even necessarily the
best distribution for many musical applications — but the epistemological
position it represents. Xenakis insisted that the right question to ask
about a musical texture is not &ldquo;what is the note at beat 3 of bar 47?&rdquo; but
&ldquo;what is the distribution from which the events in this section are drawn?&rdquo;
This shift from individual determination to statistical control is precisely
the shift that makes thermodynamics possible as a science, and Xenakis was
the first composer to apply it deliberately and systematically.</p>
<p>When a composer writes &ldquo;let the
orchestra be a gas at temperature $T$&rdquo; and then actually computes the
consequences with Boltzmann&rsquo;s constant in front of him, I do not feel that
physics has been appropriated. I feel that it has been recognised — seen,
from a different direction, as the same thing it always was: a set of tools
for thinking about ensembles of interacting elements whose individual
behaviour is too complex to track but whose collective behaviour is not.</p>
<p>The orchestra is not a gas. But the Maxwell-Boltzmann distribution describes
it anyway.</p>
<hr>
<h2 id="references">References</h2>
<ul>
<li>
<p>Ames, C. (1989). The Markov process as a compositional model: A survey and
tutorial. <em>Leonardo</em>, 22(2), 175–187. <a href="https://doi.org/10.2307/1575226">https://doi.org/10.2307/1575226</a></p>
</li>
<li>
<p>Jedrzejewski, F. (2006). <em>Mathematical Theory of Music.</em> Delatour France /
IRCAM.</p>
</li>
<li>
<p>Nash, J. F. (1950). Equilibrium points in $n$-person games. <em>Proceedings of
the National Academy of Sciences</em>, 36(1), 48–49.
<a href="https://doi.org/10.1073/pnas.36.1.48">https://doi.org/10.1073/pnas.36.1.48</a></p>
</li>
<li>
<p>Nierhaus, G. (2009). <em>Algorithmic Composition: Paradigms of Automated Music
Generation.</em> Springer.</p>
</li>
<li>
<p>Matossian, N. (2005). <em>Xenakis</em> (revised ed.). Moufflon Publications.</p>
</li>
<li>
<p>Solomos, M. (Ed.). (2022). <em>Meta-Xenakis.</em> Open Book Publishers.
<a href="https://doi.org/10.11647/OBP.0313">https://doi.org/10.11647/OBP.0313</a></p>
</li>
<li>
<p>von Neumann, J. (1928). Zur Theorie der Gesellschaftsspiele. <em>Mathematische
Annalen</em>, 100(1), 295–320. <a href="https://doi.org/10.1007/BF01448847">https://doi.org/10.1007/BF01448847</a></p>
</li>
<li>
<p>von Neumann, J., &amp; Morgenstern, O. (1944). <em>Theory of Games and Economic
Behavior.</em> Princeton University Press.</p>
</li>
<li>
<p>Xenakis, I. (1992). <em>Formalized Music: Thought and Mathematics in
Composition</em> (revised ed.). Pendragon Press.
(Originally published as <em>Musiques formelles</em>, La Revue Musicale, 1963.)</p>
</li>
</ul>
<hr>
<h2 id="changelog">Changelog</h2>
<ul>
<li><strong>2026-01-14</strong>: Corrected the description of <em>Stratégie</em> (1962): each conductor has nineteen tactics (six basic plus thirteen combinatorial), with a 19 x 19 payoff matrix — not six tactics and a 6 x 6 matrix. The six-tactic, 6 x 6 description applies to the earlier <em>Duel</em> (1959).</li>
<li><strong>2026-01-14</strong>: Added &ldquo;in April 1963&rdquo; to the <em>Stratégie</em> premiere sentence. The composition date is 1962; the premiere took place on 25 April 1963 at the Venice Biennale.</li>
<li><strong>2026-01-14</strong>: Changed &ldquo;studying briefly with Honegger&rdquo; to &ldquo;approaching Honegger (who was too ill to teach).&rdquo; Xenakis sought instruction from Honegger circa 1949, but Honegger was in declining health and did not take him as a student.</li>
</ul>
]]></content:encoded>
    </item>
    <item>
      <title>Star Polygons and Drum Machines</title>
      <link>https://sebastianspicker.github.io/posts/tool-star-polygons-drum-machines/</link>
      <pubDate>Mon, 07 Jul 2025 00:00:00 +0000</pubDate>
      <guid>https://sebastianspicker.github.io/posts/tool-star-polygons-drum-machines/</guid>
      <description>The {7/2} heptagram is not only a symbol. It is a traversal algorithm over seven beat positions. Because 7 is prime, that traversal never gets trapped in a sub-orbit.</description>
      <content:encoded><![CDATA[<p>Two star polygons appear in Danny Carey&rsquo;s visual vocabulary, and they are not the same star. One is open, almost friendly — seven points connected by relatively shallow angles. The other is sharper, the points more acute. They look like variations on a theme, which is accurate: both are drawn on seven equally spaced vertices, but one connects every second vertex and the other connects every third.</p>
<p>In Schläfli notation — the system for naming regular star polygons — these are $\{7/2\}$ and $\{7/3\}$ <a href="#ref-1">[1]</a>. Both appear in Tool&rsquo;s artwork, in Thelemic symbolism, in medieval Islamic geometric patterns, and on the floor plans of cathedrals. They are the most visually intricate star polygons that can be drawn in a single closed stroke before the figure becomes illegible.</p>
<p>Both of them have a property that five-pointed and six-pointed stars do not share: they visit every vertex before closing. This is a consequence of 7 being prime. And it turns out to matter for how rhythmic accent cycles are built.</p>
<h2 id="the-schläfli-symbol">The Schläfli Symbol</h2>
<p>A regular star polygon $\{n/k\}$ is constructed by placing $n$ points evenly on a circle and connecting every $k$-th point in sequence until the path closes. The structural key is a single number:</p>
$$d = \gcd(n, k).$$<p>If $d = 1$, the traversal visits all $n$ vertices before returning to the start — a single connected figure. If $d > 1$, the path visits only $n/d$ vertices before closing, and the full figure consists of $d$ separate copies of the smaller star $\{(n/d)\,/\,(k/d)\}$.</p>
<p>The most familiar example of the disconnected case: $\{6/2\}$, the Star of David. Here $\gcd(6,2) = 2$, so the figure breaks into two copies of $\{3/1\} = \{3\}$ — two overlapping equilateral triangles. The traversal starting at vertex 1 visits $1 \to 3 \to 5 \to 1$, leaving vertices 2, 4, 6 entirely unvisited.</p>
<p>The pentagram $\{5/2\}$ is connected: $\gcd(5,2)=1$, traversal $1 \to 3 \to 5 \to 2 \to 4 \to 1$, all five vertices.</p>
<p>For $n=7$:</p>
<ul>
<li>$\{7/2\}$: $\gcd(7,2)=1$, traversal $1 \to 3 \to 5 \to 7 \to 2 \to 4 \to 6 \to 1$, all seven vertices.</li>
<li>$\{7/3\}$: $\gcd(7,3)=1$, traversal $1 \to 4 \to 7 \to 3 \to 6 \to 2 \to 5 \to 1$, all seven vertices.</li>
</ul>
<p>Both connected. Neither leaves any vertex unvisited.</p>
<h2 id="the-group-theory">The Group Theory</h2>
<p>The traversal of $\{n/k\}$ is an instance of a standard construction in modular arithmetic: the <strong>orbit</strong> of an element under repeated addition in $\mathbb{Z}/n\mathbb{Z}$.</p>
<p>Label the $n$ vertices $0, 1, \ldots, n-1$. Starting at vertex 0, the traversal visits:</p>
$$0, \quad k \bmod n, \quad 2k \bmod n, \quad 3k \bmod n, \quad \ldots$$<p>The orbit of 0 under the action of $+k$ is the subgroup of $\mathbb{Z}/n\mathbb{Z}$ generated by $k$. By a standard result, this subgroup has size $n / \gcd(n,k)$.</p>
<ul>
<li>When $\gcd(n,k) = 1$: orbit size $= n$. The traversal visits every vertex.</li>
<li>When $\gcd(n,k) = d > 1$: orbit size $= n/d$. The traversal visits only a fraction of the vertices.</li>
</ul>
<p>For prime $n$: $\gcd(n,k) = 1$ for every $1 \leq k \leq n-1$, without exception. <strong>Every traversal is complete.</strong> There is no step size that traps the path in a proper sub-orbit before visiting all vertices. This follows directly from the fact that a prime has no divisors other than 1 and itself, so $\mathbb{Z}/p\mathbb{Z}$ has no non-trivial subgroups (Lagrange&rsquo;s theorem: any subgroup of a group of prime order must have order 1 or $p$).</p>
<p>This is the specific property that makes 7 — and any prime — rhythmically fertile.</p>
<h2 id="the-contrast-with-six">The Contrast with Six</h2>
<p>The comparison with $n = 6$ is the clearest illustration.</p>
<p>In $\mathbb{Z}/6\mathbb{Z}$, the possible step sizes are 1, 2, 3, 4, 5. Their orbits:</p>
<table>
  <thead>
      <tr>
          <th>Step $k$</th>
          <th>$\gcd(6,k)$</th>
          <th>Orbit size</th>
          <th>Vertices visited</th>
      </tr>
  </thead>
  <tbody>
      <tr>
          <td>1</td>
          <td>1</td>
          <td>6</td>
          <td>0,1,2,3,4,5 (the hexagon)</td>
      </tr>
      <tr>
          <td>2</td>
          <td>2</td>
          <td>3</td>
          <td>0,2,4 only</td>
      </tr>
      <tr>
          <td>3</td>
          <td>3</td>
          <td>2</td>
          <td>0,3 only</td>
      </tr>
      <tr>
          <td>4</td>
          <td>2</td>
          <td>3</td>
          <td>0,2,4 only</td>
      </tr>
      <tr>
          <td>5</td>
          <td>1</td>
          <td>6</td>
          <td>0,5,4,3,2,1 (the hexagon reversed)</td>
      </tr>
  </tbody>
</table>
<p>The only step sizes that visit all six vertices are 1 and 5 — both of which just traverse the hexagon itself, not a star. Every non-trivial star polygon on six points gets trapped. $\{6/2\}$ visits only half the vertices. $\{6/3\}$ visits only two. There is no connected six-pointed star that isn&rsquo;t either the hexagon or a compound figure.</p>
<p>In $\mathbb{Z}/7\mathbb{Z}$, every step from 2 to 5 generates the full group:</p>
<table>
  <thead>
      <tr>
          <th>Step $k$</th>
          <th>$\gcd(7,k)$</th>
          <th>Orbit size</th>
          <th>Traversal</th>
      </tr>
  </thead>
  <tbody>
      <tr>
          <td>2</td>
          <td>1</td>
          <td>7</td>
          <td>1,3,5,7,2,4,6</td>
      </tr>
      <tr>
          <td>3</td>
          <td>1</td>
          <td>7</td>
          <td>1,4,7,3,6,2,5</td>
      </tr>
      <tr>
          <td>4</td>
          <td>1</td>
          <td>7</td>
          <td>1,5,2,6,3,7,4</td>
      </tr>
      <tr>
          <td>5</td>
          <td>1</td>
          <td>7</td>
          <td>1,6,4,2,7,5,3</td>
      </tr>
  </tbody>
</table>
<p>All four non-trivial step sizes give connected traversals. Both are stars. Both visit every vertex. This is not a coincidence: it is the algebraic signature of primality.</p>
<h2 id="from-geometry-to-rhythm">From Geometry to Rhythm</h2>
<p>The connection to drumming is direct. Here is the mechanism.</p>
<p>Consider a repeating rhythmic figure of 7 beats — a bar of 7/8, say, with positions 1 through 7. An <a href="/posts/euclidean-rhythms/">earlier post</a> discussed Euclidean rhythms: the algorithm that distributes $k$ onset positions as evenly as possible among $n$ slots. That is a problem of <em>selection</em> — which of the $n$ positions to activate.</p>
<p>The star polygon traversal asks a different question. Given that all $n$ positions are present, in what <em>order of emphasis</em> should they be related, such that each accent is a fixed distance from the last? The traversal of $\{n/k\}$ answers this: accent position $1$, then $1+k$, then $1+2k$, and so on modulo $n$.</p>
<p>For $\{7/2\}$: the accent cycle within a single bar runs $1 \to 3 \to 5 \to 7 \to 2 \to 4 \to 6$. Each featured beat is two positions ahead of the last.</p>
<p>Now project this across multiple bars. In bar 1, the primary accent sits on beat 1. In bar 2, if the accent shifts by 2, it lands on beat 3. Bar 3: beat 5. Bar 4: beat 7. Bar 5: beat 2. Bar 6: beat 4. Bar 7: beat 6. Bar 8: beat 1 again.</p>
<p>The accent takes <strong>seven bars</strong> to return to its starting position. Because $\gcd(2,7) = 1$, the step of 2 generates all of $\mathbb{Z}/7\mathbb{Z}$: every beat position receives the accent exactly once before the cycle resets. The resulting large-scale figure is $7 \times 7 = 49$ beats long — a super-phrase built from a single local rule.</p>
<p>The $\{7/3\}$ traversal generates the same exhaustiveness with a different path. Step 3 gives $1 \to 4 \to 7 \to 3 \to 6 \to 2 \to 5$: a seven-bar accent cycle that visits every position before repeating, but with wider spacing between accented beats, creating a different feel over the same underlying meter.</p>
<p>A six-beat figure with step 2 cannot do this. The accent visits only beats 1, 3, 5 — half the cycle — and loops back without touching beats 2, 4, 6. A drummer building phrase-level architecture from a six-beat grid is working with a more fragmented material.</p>
<h2 id="two-problems-one-prime">Two Problems, One Prime</h2>
<p>It is worth stating the relationship between the star polygon approach and Euclidean rhythms precisely, because the two are sometimes conflated <a href="#ref-2">[2]</a>.</p>
<p>The Euclidean algorithm distributes $k$ onsets among $n$ positions with maximal evenness. The result is a <em>subset</em> of the $n$ positions — a selection. The primality of $n$ matters here too: because $\gcd(k,p) = 1$ for prime $p$ and any $1 \leq k \leq p-1$, the Euclidean rhythm $E(k,p)$ always achieves its theoretical maximum of evenness. There are no divisibility shortcuts that cause clumping.</p>
<p>The star polygon traversal selects <em>no subset</em> — it relates all $n$ positions via a cyclic permutation. The primality of $n$ matters here because it guarantees that every non-trivial cyclic permutation (every step size $k$ with $1 < k < n$) generates the full group, visiting all positions before repeating.</p>
<p>Same arithmetic property — $\gcd(k,p) = 1$ for all non-zero $k$ — but the two problems ask different things of it. Euclidean rhythms use it to guarantee dense coverage. Star polygon traversals use it to guarantee no sub-orbit trapping.</p>
<h2 id="the-compound-structure">The Compound Structure</h2>
<p>Written out explicitly, the $\{7/2\}$ accent pattern over seven bars looks like this — with bold marking the featured beat in each bar:</p>
$$\begin{array}{rccccccc}
\text{bar 1:} & \mathbf{1} & 2 & 3 & 4 & 5 & 6 & 7 \\
\text{bar 2:} & 1 & 2 & \mathbf{3} & 4 & 5 & 6 & 7 \\
\text{bar 3:} & 1 & 2 & 3 & 4 & \mathbf{5} & 6 & 7 \\
\text{bar 4:} & 1 & 2 & 3 & 4 & 5 & 6 & \mathbf{7} \\
\text{bar 5:} & 1 & \mathbf{2} & 3 & 4 & 5 & 6 & 7 \\
\text{bar 6:} & 1 & 2 & 3 & \mathbf{4} & 5 & 6 & 7 \\
\text{bar 7:} & 1 & 2 & 3 & 4 & 5 & \mathbf{6} & 7 \\
\end{array}$$<p>Each bar is metrically identical. The large-scale accent — which beat carries the phrase-level emphasis — traces the traversal path of the $\{7/2\}$ star polygon across the seven-bar cycle.</p>
<p>This is the kind of large-scale rhythmic architecture visible in a great deal of Tool&rsquo;s output. Whether Danny Carey explicitly constructs accent cycles from star polygon traversal paths, or whether the same structure emerges from his intuitive sense of how prime time signatures behave, produces the same result. The mathematics and the musical instinct point toward the same pattern.</p>
<h2 id="why-the-heptagram">Why the Heptagram</h2>
<p>The full mathematical picture of why seven-fold symmetry is special — why the regular heptagon cannot be constructed by compass and straightedge, what the minimal polynomial of $\cos(2\pi/7)$ implies about the heptagon&rsquo;s position outside the constructible world, and how the Galois group of the cyclotomic field over $\mathbb{Q}$ carries the obstruction — is developed in the companion post <a href="/posts/tool-impossible-heptagon/">The Impossible Heptagon</a>.</p>
<p>The short version, for the purposes of this post: seven is the smallest odd prime that is not a Fermat prime ($2^{2^j}+1$). This algebraic accident places it outside the reach of ruler-and-compass construction — the heptagon exists as an ideal but cannot be manifested by the classical tools. Its star polygons are the accessible shadows of an inaccessible form. And its primality, in both the constructibility sense and the traversal sense, is precisely what makes it inexhaustible as a rhythmic resource.</p>
<p>The Fibonacci structure in &ldquo;Lateralus&rdquo; <a href="#ref-3">[3]</a>, the group theory underlying twelve-tone equal temperament <a href="#ref-4">[4]</a>, and the Euclidean rhythm algorithm <a href="#ref-5">[5]</a> are all different facets of the same observation: mathematical structure, introduced as compositional constraint, generates musical complexity that cannot easily be produced by intuition alone. The star polygon is another instance. The drummer who keeps a heptagram on his kit has found, by a non-mathematical route, an object with a precise and interesting mathematical identity.</p>
<h2 id="references">References</h2>
<p><span id="ref-1"></span>[1] Coxeter, H.S.M. (1973). <em>Regular Polytopes</em> (3rd ed.). Dover. Ch. 2.</p>
<p><span id="ref-2"></span>[2] Toussaint, G. (2013). <em>The Geometry of Musical Rhythm: What Makes a &ldquo;Good&rdquo; Rhythm Good?</em> CRC Press.</p>
<p><span id="ref-3"></span>[3] See <a href="/posts/fibonacci-lateralus/">Fibonacci and Lateralus</a> on this blog.</p>
<p><span id="ref-4"></span>[4] See <a href="/posts/twelve-tet-group-theory-musical-tuning/">Twelve-TET and Group Theory</a> on this blog.</p>
<p><span id="ref-5"></span>[5] See <a href="/posts/euclidean-rhythms/">Euclidean Rhythms</a> on this blog.</p>
]]></content:encoded>
    </item>
    <item>
      <title>The Oldest Algorithm in the World Plays the Clave</title>
      <link>https://sebastianspicker.github.io/posts/euclidean-rhythms/</link>
      <pubDate>Mon, 07 Apr 2025 00:00:00 +0000</pubDate>
      <guid>https://sebastianspicker.github.io/posts/euclidean-rhythms/</guid>
      <description>Euclid&amp;rsquo;s algorithm for computing greatest common divisors, applied to the problem of distributing k drum beats as evenly as possible among n time slots, generates rhythmic patterns that match traditional timelines from West Africa, Cuba, Brazil, Turkey, and the Balkans. An algorithm devised in Alexandria around 300 BCE encodes the rhythmic structure of musical cultures that had no contact with ancient Greek mathematics.</description>
      <content:encoded><![CDATA[<p><em>The first time I encountered the West African standard bell pattern it was in a
Music and Physics seminar. The lecturer played a twelve-beat cycle on a wood
block — seven strokes distributed unevenly but with a rightness that arrested the
room. She then played the Cuban clave, the bossa nova timeline, a Bulgarian
aksak rhythm. Different cultures, different instruments, different centuries. She
asked whether there was a pattern. There was. It is named after a mathematician
who died around 270 BCE.</em></p>
<hr>
<h2 id="euclids-algorithm">Euclid&rsquo;s Algorithm</h2>
<p>Every student who has taken a number theory course has encountered the algorithm
for computing the greatest common divisor of two positive integers. Given $a \geq
b$, repeatedly replace $(a, b)$ with $(b, a \bmod b)$ until the remainder is
zero; the last non-zero remainder is the GCD.</p>
<p>For example, $\gcd(8, 3)$:</p>
$$8 = 2 \times 3 + 2 \;\Rightarrow\; \gcd(8, 3) = \gcd(3, 2)$$<p>
</p>
$$3 = 1 \times 2 + 1 \;\Rightarrow\; \gcd(3, 2) = \gcd(2, 1)$$<p>
</p>
$$2 = 2 \times 1 + 0 \;\Rightarrow\; \gcd(2, 1) = 1.$$<p>Three steps, result 1 (8 and 3 are coprime). The algorithm is efficient: the
number of steps is proportional to the number of digits in the smaller input.
It appears in Book VII of Euclid&rsquo;s <em>Elements</em>, composed around 300 BCE, making
it the oldest non-trivial algorithm in the Western mathematical tradition.</p>
<hr>
<h2 id="distributing-onsets-toussaints-observation">Distributing Onsets: Toussaint&rsquo;s Observation</h2>
<p>In 2005, Godfried Toussaint — a computer scientist and ethnomusicologist at
McGill University — published the observation that the problem of distributing
$k$ musical onsets as evenly as possible among $n$ time slots has the same
recursive structure as Euclid&rsquo;s algorithm applied to the pair $(k, n-k)$
(Toussaint, 2005).</p>
<p>The algorithm that solves this distribution problem was independently discovered
in nuclear physics. Bjorklund (2003), working on timing systems for the
Spallation Neutron Source particle accelerator at Oak Ridge, needed to distribute
$k$ beam-extraction pulses as evenly as possible among $n$ machine cycles.
The algorithm he derived — Bjorklund&rsquo;s algorithm — is mathematically equivalent
to the Euclidean algorithm applied to the same pair of integers.</p>
<p>The resulting pattern is denoted $E(k, n)$: the <strong>Euclidean rhythm</strong> with $k$
onsets distributed among $n$ pulses. A 1 denotes an onset; a 0 denotes a rest.</p>
<hr>
<h2 id="working-through--the-tresillo">Working Through $E(3, 8)$: The Tresillo</h2>
<p>Let us derive $E(3, 8)$ — 3 onsets distributed in 8 pulses — step by step.</p>
<p><strong>Start</strong>: 3 onset groups and 5 rest groups:</p>
$$[1]\; [1]\; [1]\; [0]\; [0]\; [0]\; [0]\; [0]$$<p><strong>Step 1</strong>: Distribute one rest group into each onset group, pairing until the
shorter list is exhausted. Three pairs, with $5 - 3 = 2$ rest groups remaining:</p>
$$[1,0]\; [1,0]\; [1,0]\; [0]\; [0]$$<p><strong>Step 2</strong>: Now 3 longer groups and 2 shorter groups. Distribute one shorter group
into each longer group, $3 - 2 = 1$ longer group unpaired:</p>
$$[1,0,0]\; [1,0,0]\; [1,0]$$<p><strong>Step 3</strong>: The two group types have different lengths and only one group of the
shorter type remains; no further pairing is possible. Read the sequence
left to right:</p>
$$E(3, 8) = [1, 0, 0, 1, 0, 0, 1, 0].$$<p>This is the <strong>Cuban tresillo</strong> — one of the foundational rhythmic cells of
Afro-Cuban music, used across son, salsa, and mambo. Its onset positions are
$\{0, 3, 6\}$, giving gap sizes $[3, 3, 2]$: two wide gaps and one narrow gap,
arranged as evenly as the integers allow.</p>
<p>The parallel with Euclid&rsquo;s algorithm is direct. In the division $8 = 2 \times 3</p>
<ul>
<li>2$, the quotient 2 gives the number of pairing steps before a remainder
appears, and the remainder 2 gives the number of groups in the shorter list at
each intermediate stage. The recursion $\gcd(8, 3) \to \gcd(3, 2) \to \gcd(2,
1)$ mirrors the three steps above.</li>
</ul>
<hr>
<h2 id="the-gap-structure">The Gap Structure</h2>
<p>For any $E(k, n)$, the spacing between consecutive onsets takes exactly two
values:</p>
$$\text{gap} \in \left\{\left\lfloor \frac{n}{k} \right\rfloor,\
\left\lceil \frac{n}{k} \right\rceil\right\}.$$<p>The number of each gap size is determined by the constraint that all $k$ gaps
sum to $n$. Writing $\alpha = n \bmod k$:</p>
$$\alpha \cdot \left\lceil \frac{n}{k} \right\rceil \;+\; (k - \alpha) \cdot
\left\lfloor \frac{n}{k} \right\rfloor = n.$$<p>So $E(k,n)$ has $\alpha$ gaps of the larger size and $k - \alpha$ gaps of the
smaller size. The Euclidean property is that these two gap types are distributed
<em>as evenly as possible</em> among themselves — not clustered at one end of the cycle
but interleaved. A cycle that maximises the minimum distance between any two
consecutive onsets has this property; it is called <strong>maximally even</strong> (Clough
and Douthett, 1991).</p>
<p>For $E(3, 8)$: $\lfloor 8/3 \rfloor = 2$, $\lceil 8/3 \rceil = 3$,
$\alpha = 8 \bmod 3 = 2$. Two gaps of 3, one gap of 2. Gap sequence $[3, 3, 2]$.
Maximum-evenness is why the tresillo sounds &ldquo;right&rdquo; even though it is
asymmetric: the asymmetry is the smallest possible deviation from perfect
regularity.</p>
<hr>
<h2 id="a-gallery-of-world-rhythms">A Gallery of World Rhythms</h2>
<p>The following table, derived from Toussaint (2005, 2020), shows Euclidean rhythms
alongside their ethnomusicological identifications. Asterisks mark patterns given
as rotations of the canonical form.</p>
<table>
  <thead>
      <tr>
          <th>Pattern</th>
          <th>Gap structure</th>
          <th>Musical tradition</th>
      </tr>
  </thead>
  <tbody>
      <tr>
          <td>$E(2,3) = [1,0,1]$</td>
          <td>$[2,1]$</td>
          <td>Iambic foot; West African, Balkan</td>
      </tr>
      <tr>
          <td>$E(3,8) = [1,0,0,1,0,0,1,0]$</td>
          <td>$[3,3,2]$</td>
          <td>Cuban <em>tresillo</em>; Flamenco</td>
      </tr>
      <tr>
          <td>$E(5,8) = [1,0,1,1,0,1,1,0]$</td>
          <td>$[2,1,2,1,2]^*$</td>
          <td>Cuban <em>cinquillo</em></td>
      </tr>
      <tr>
          <td>$E(4,9) = [1,0,0,1,0,1,0,1,0]^*$</td>
          <td>$[3,2,2,2]^*$</td>
          <td>Turkish <em>aksak</em> patterns</td>
      </tr>
      <tr>
          <td>$E(7,12) = [1,0,1,1,0,1,0,1,1,0,1,0]$</td>
          <td>$[2,1,2,2,1,2,2]^*$</td>
          <td>West African standard bell</td>
      </tr>
      <tr>
          <td>$E(9,16)$</td>
          <td>$[2,1,2,2,1,2,2,1,2,1]^*$</td>
          <td>Brazilian and West African</td>
      </tr>
      <tr>
          <td>$E(13,24)$</td>
          <td></td>
          <td>South Indian (Carnatic) <em>tāla</em></td>
      </tr>
  </tbody>
</table>
<p>Three of these are worth examining in more detail.</p>
<p><strong>$E(5,8)$: the cinquillo.</strong> Five onsets in eight pulses: $\lfloor 8/5 \rfloor =
1$, $\lceil 8/5 \rceil = 2$, $\alpha = 3$. Three gaps of 2 and two gaps of 1.
Gap sequence $[2,1,2,1,2]$ or a rotation thereof. The <em>cinquillo</em> is a
fundamental pattern in Cuban music, used as a melodic rhythmic figure in the
nineteenth-century contradanza and in much of what followed.</p>
<p><strong>$E(7,12)$: the West African standard bell.</strong> Seven onsets in a twelve-beat
cycle: $\lfloor 12/7 \rfloor = 1$, $\lceil 12/7 \rceil = 2$, $\alpha = 5$.
Five gaps of 2 and two gaps of 1. This timeline — used across the Ewe, Akan,
and many other traditions in West Africa — is the cyclic reference structure
against which other rhythmic layers are measured in ensemble drumming. It is also
the pitch-class set $\{0, 2, 4, 5, 7, 9, 11\}$ — the Western diatonic scale,
translated from pitch to rhythm. That the same maximally-even distribution
describes both the diatonic scale in pitch space and the standard bell in rhythm
is one of the more remarkable coincidences in mathematical music theory.</p>
<p><strong>Universality across non-connected cultures.</strong> The tresillo $E(3,8)$ appears
independently in Cuban music, Flamenco, Namibian Juǀ&rsquo;hoansi music, and
medieval Persian music (Toussaint, 2020). These traditions had no common musical
ancestor that could have transmitted the pattern. The Euclidean algorithm
produces what maximum evenness demands, and maximum evenness turns out to be
what these rhythmic traditions independently converged on.</p>
<hr>
<h2 id="circular-notation-and-necklaces">Circular Notation and Necklaces</h2>
<p>Euclidean rhythms are most naturally represented as <strong>necklaces</strong> — equivalence
classes of binary sequences under cyclic rotation. All rotations of $E(3,8)$
represent the same rhythmic structure with a different starting downbeat: the
musical identity is independent of which position is designated &ldquo;beat 1.&rdquo;</p>
<p>In circular notation, place $n$ equally spaced dots on a circle and mark the $k$
onset positions. The pattern is immediately visible: the $k$ onset-dots divide
the circle as evenly as possible. For $E(7,12)$, the seven onset dots on a
twelve-position circle look like the seven vertices of a near-regular heptagon
inscribed in a dodecagon. For $E(3,8)$, the three onset dots form a near-
equilateral triangle.</p>
<p>This geometric representation makes the maximum-evenness property transparent in
a way that the linear binary string does not. It also makes clear why Euclidean
rhythms feel &ldquo;balanced&rdquo; when played: the onset dots distribute the &ldquo;weight&rdquo; of
the cycle as uniformly as the integer constraints allow.</p>
<p>The mathematical theory of necklaces belongs to combinatorics on words.
Euclidean rhythms correspond to specific equivalence classes of binary sequences
known as <em>Christoffel words</em> (Lothaire, 2002): words over the alphabet $\{0,1\}$
whose combinatorial properties encode the slope of a line segment, which brings
us to the third independent context in which the same algorithm appears.</p>
<hr>
<h2 id="the-bresenham-connection">The Bresenham Connection</h2>
<p>Jack Bresenham&rsquo;s line algorithm (1965) rasterises a line from $(0,0)$ to $(n,k)$
on a grid of integer pixels. At each column $x$, the algorithm tracks whether the
fractional error accumulated since the last row increment exceeds $\frac{1}{2}$,
and if so, increments the row and resets the error. The sequence of column
positions at which the row increments is the onset pattern $E(k,n)$.</p>
<p>Formally, an onset occurs at position $m$ in $E(k,n)$ if and only if:</p>
$$\left\lfloor \frac{(m+1)\, k}{n} \right\rfloor > \left\lfloor \frac{m\, k}{n} \right\rfloor.$$<p>Equivalently, the onset positions themselves form the sequence:</p>
$$s_j = \left\lfloor \frac{j \cdot n}{k} \right\rfloor, \qquad j = 0, 1, \ldots, k-1.$$<p>For $E(3,8)$: $s_0 = 0$, $s_1 = \lfloor 8/3 \rfloor = 2$,
$s_2 = \lfloor 16/3 \rfloor = 5$, giving onset positions $\{0, 2, 5\}$ — a
rotation of the tresillo.</p>
<p>This is exactly the Bresenham increment condition. Drawing the line from $(0,0)$
to $(8,3)$ and marking where the $y$-coordinate takes a step produces the onset
positions $\{2, 5, 7\}$ — a rotation of the tresillo $\{0, 3, 6\}$.</p>
<p>Three independent fields — ancient Greek number theory, Afro-Caribbean percussion,
and 1960s computer graphics — converge on the same mathematical object. This is
not a coincidence. All three are solving the same fundamental problem: how to
distribute $k$ discrete events as evenly as possible among $n$ slots. When the
problem is universal, its solution is too.</p>
<hr>
<h2 id="euclidean-rhythms-in-contemporary-practice">Euclidean Rhythms in Contemporary Practice</h2>
<p>Toussaint&rsquo;s 2005 paper was primarily a contribution to computational
ethnomusicology, but it reached electronic music production rapidly. Euclidean
rhythm sequencers are now standard in modular synthesis (dedicated Eurorack
hardware modules exist under names including &ldquo;Euclidean&rdquo; and &ldquo;Erica Synths
Pico&rdquo;) and digital audio workstations (as Max for Live devices and software
plug-ins). The interface is minimal: set $k$ and $n$, adjust the rotation offset,
and hear the resulting timeline immediately.</p>
<p>This has opened a compositional mode in which the mathematical structure is
operational: a producer constructing a layered African-style polyrhythm by
stacking $E(3,8)$, $E(5,8)$, and $E(7,8)$ on different instruments is — whether
they know it or not — computing the Euclidean algorithm three times and listening
to the result.</p>
<hr>
<h2 id="implications-for-teaching-rhythm">Implications for Teaching Rhythm</h2>
<p>Music conservatories in the European tradition teach rhythm almost entirely
through Western notation: time signatures, note values, dotted notes, ties. This
system is well-suited to the repertoire it was designed for. It handles Euclidean
rhythms awkwardly. The tresillo $E(3,8)$ requires either a triplet feel against
a binary pulse or a notation involving a dotted quarter note followed by a dotted
quarter and a quarter, which correctly represents the sound but obscures the
structural principle entirely.</p>
<p>The Euclidean framework suggests a different pedagogical starting point. Rather
than beginning from the bar line and asking how notes fill it, begin from the
cycle length $n$ and the onset count $k$ and ask how to distribute the onsets
as evenly as possible. The answer is always computable and always produces a
recognisable rhythm.</p>
<p>For students who encounter West African, Afro-Cuban, or Middle Eastern music —
which conservatory students increasingly do — having a framework that makes these
rhythms <em>structurally necessary</em> rather than <em>culturally exotic</em> changes the
pedagogical relationship fundamentally. The tresillo is not a deviation from
&ldquo;normal&rdquo; rhythm. It is the unique maximally even solution to the problem of
placing three beats in eight pulses. That the same algorithm appeared in a 300
BCE Alexandrian text on number theory is an accident of the history of mathematics.
That it sounds right is not.</p>
<p>Whether conservatory curricula are ready to incorporate the Euclidean framework
alongside Western notation is a separate question. The mathematics does not
demand it. But it offers a language for rhythm that transcends the Western
bar-line without abandoning precision — and that seems worth something, especially
in a world where the music students will perform and teach is no longer
exclusively European.</p>
<hr>
<h2 id="references">References</h2>
<ul>
<li>
<p>Bjorklund, E. (2003). <em>The theory of rep-rate pattern generation in the SNS
timing system.</em> Technical Report SNS-NOTE-CNTRL-99, Spallation Neutron Source,
Oak Ridge National Laboratory.</p>
</li>
<li>
<p>Bresenham, J. E. (1965). Algorithm for computer control of a digital plotter.
<em>IBM Systems Journal</em>, 4(1), 25–30. <a href="https://doi.org/10.1147/sj.41.0025">https://doi.org/10.1147/sj.41.0025</a></p>
</li>
<li>
<p>Clough, J., &amp; Douthett, J. (1991). Maximally even sets. <em>Journal of Music
Theory</em>, 35(1–2), 93–173. <a href="https://doi.org/10.2307/843811">https://doi.org/10.2307/843811</a></p>
</li>
<li>
<p>Lothaire, M. (2002). <em>Algebraic Combinatorics on Words.</em> Cambridge University
Press.</p>
</li>
<li>
<p>Toussaint, G. T. (2005). The Euclidean algorithm generates traditional musical
rhythms. In R. Sarhangi &amp; J. Sharp (Eds.), <em>Proceedings of BRIDGES 2005:
Mathematical Connections in Art, Music, and Science</em> (pp. 47–56). Bridges
Conference.</p>
</li>
<li>
<p>Toussaint, G. T. (2020). <em>The Geometry of Musical Rhythm: What Makes a &ldquo;Good&rdquo;
Rhythm Good?</em> (2nd ed.). Chapman &amp; Hall/CRC Press.</p>
</li>
</ul>
]]></content:encoded>
    </item>
    <item>
      <title>When Musicians Lock In: Coupled Oscillators and the Physics of Ensemble Synchronisation</title>
      <link>https://sebastianspicker.github.io/posts/kuramoto-ensemble-sync/</link>
      <pubDate>Thu, 08 Feb 2024 00:00:00 +0000</pubDate>
      <guid>https://sebastianspicker.github.io/posts/kuramoto-ensemble-sync/</guid>
      <description>Every ensemble faces the same physical problem: N oscillators with slightly different natural frequencies trying to synchronise through a shared coupling channel. The Kuramoto model — developed by a statistical physicist to describe fireflies, neurons, and power grids — applies directly to musicians. It predicts a phase transition between incoherence and synchrony, quantifies why latency destroys networked ensemble performance, and connects to recent EEG studies of inter-brain synchronisation.</description>
      <content:encoded><![CDATA[<p><em>The problem is ancient and the language for it is recent. In any ensemble — a
string quartet, a jazz rhythm section, an orchestra — musicians with slightly
different internal tempos must stay together. They do this by listening to each
other. But what, exactly, does &ldquo;listening to each other&rdquo; do to their timing? And
what happens when the listening channel is imperfect — delayed by the speed of
sound across a wide stage, or by a network cable crossing a continent? The answer
involves a differential equation that was not written to describe music.</em></p>
<p><em>This post extends the latency analysis in <a href="/posts/nmp-latency-lola-mvtp/">Latency in Networked Music
Performance</a> with the dynamical systems framework
that underlies it.</em></p>
<hr>
<h2 id="two-clocks-on-a-board">Two Clocks on a Board</h2>
<p>The first documented observation of coupled-oscillator synchronisation was made
not by a musician but by a physicist. In 1665, Christiaan Huygens, confined to
bed with illness, was watching two pendulum clocks mounted on the same wooden
beam. Over the course of the night, the pendulums had synchronised into
<em>anti-phase</em> oscillation — swinging in opposite directions in exact unison.
He reported it to his father:</p>
<blockquote>
<p>&ldquo;I have noticed a remarkable effect which no-one has observed before&hellip; two
clocks on the same board always end up in mutual synchrony.&rdquo;</p>
</blockquote>
<p>The mechanism was mechanical coupling through the beam. Each pendulum&rsquo;s swing
imparted a small impulse to the wood; the other pendulum felt this as a
perturbation to its rhythm. Small perturbations, accumulated over hours, drove
the clocks into a shared frequency and a fixed phase relationship.</p>
<p>This is the prototype of every ensemble synchronisation problem. Each musician
is a clock. The acoustic environment — the air in the room, the reflected sound
from the walls, the vibrations through the stage floor — is the wooden beam.</p>
<hr>
<h2 id="the-kuramoto-model">The Kuramoto Model</h2>
<p>Yoshiki Kuramoto formalised the mathematics of coupled oscillators in 1975,
motivated by biological synchronisation problems: firefly flashing, circadian
rhythms, cardiac pacemakers. His model considers $N$ oscillators, each with a
phase $\theta_i(t)$ evolving according to:</p>
$$\frac{d\theta_i}{dt} = \omega_i + \frac{K}{N} \sum_{j=1}^{N} \sin(\theta_j - \theta_i), \qquad i = 1, \ldots, N.$$<p>The first term, $\omega_i$, is the oscillator&rsquo;s <em>natural frequency</em> — the tempo it
would maintain in isolation. These are drawn from a distribution $g(\omega)$, which
in a real ensemble reflects the spread of individual preferred tempos among the
players. The second term is the coupling: each oscillator is attracted toward the
phases of all others, with strength $K/N$. The factor $1/N$ keeps the total
coupling intensive (independent of ensemble size) as $N$ grows large.</p>
<p>Musically: $\theta_i$ is the phase of musician $i$&rsquo;s internal pulse at a given
moment, $\omega_i$ is their preferred tempo if playing alone, and $K$ is the
coupling strength — how much they adjust their tempo in response to what they
hear from the others.</p>
<hr>
<h2 id="the-order-parameter-and-the-phase-transition">The Order Parameter and the Phase Transition</h2>
<p>To measure the degree of synchronisation, Kuramoto introduced the complex order
parameter:</p>
$$r(t)\, e^{i\psi(t)} = \frac{1}{N} \sum_{j=1}^{N} e^{i\theta_j(t)},$$<p>where $r(t) \in [0, 1]$ is the <em>coherence</em> of the ensemble and $\psi(t)$ is the
collective mean phase. When $r = 0$, the phases are uniformly spread around the
unit circle — the ensemble is incoherent. When $r = 1$, all phases coincide —
perfect synchrony. In a live ensemble, $r$ is a direct measure of rhythmic
cohesion, though of course not one you can read off a score.</p>
<p>Substituting the order parameter into the equation of motion:</p>
$$\frac{d\theta_i}{dt} = \omega_i + K r \sin(\psi - \theta_i).$$<p>Each oscillator now interacts only with the mean-field quantities $r$ and $\psi$,
not with every other oscillator individually. The coupling pulls each musician
toward the collective mean phase with a force proportional to both $K$ (how
attentively they listen) and $r$ (how coherent the group already is).</p>
<p>This mean-field form reveals the essential physics. For small $K$, oscillators
with widely differing $\omega_i$ cannot follow the mean field — they drift at
their own frequencies, and $r \approx 0$. At a critical coupling strength $K_c$,
a macroscopic fraction of oscillators suddenly locks to a shared frequency, and
$r$ begins to grow continuously from zero. For a unimodal,
symmetric frequency distribution $g(\omega)$ with density $g(\bar\omega)$ at the
mean:</p>
$$K_c = \frac{2}{\pi\, g(\bar\omega)}.$$<p>Above $K_c$, the coherence grows as:</p>
$$r \approx \sqrt{\frac{K - K_c}{K_c}}, \qquad K \gtrsim K_c.$$<p>This is a <strong>second-order (continuous) phase transition</strong> — the same
mathematical structure as a ferromagnet approaching the Curie temperature,
where spontaneous magnetisation appears continuously above a critical coupling.
The musical ensemble and the magnetic material belong to the same universality
class, governed by the same mean-field exponent $\frac{1}{2}$.</p>
<p>Above $K_c$, the fraction of oscillators that are <em>locked</em> (synchronised to the
mean-field frequency) can be computed explicitly. An oscillator with natural
frequency $\omega_i$ locks to the mean field if $|\omega_i - \bar\omega| \leq
Kr$. For a Lorentzian distribution $g(\omega) = \frac{\gamma/\pi}{(\omega -
\bar\omega)^2 + \gamma^2}$, this yields:</p>
$$r = \sqrt{1 - \frac{K_c}{K}}, \qquad K_c = 2\gamma,$$<p>which is the exact self-consistency equation for the Kuramoto model with
Lorentzian frequency spread (Strogatz, 2000).</p>
<p>The physical reading is direct: whether an ensemble locks into a shared pulse or
drifts apart is a threshold phenomenon. A group of musicians with similar
preferred tempos has a peaked $g(\bar\omega)$, giving a low $K_c$ — they
synchronise easily with minimal attentive listening. A group with widely varying
individual tempos needs stronger, more sustained coupling to cross the threshold.
This is not a matter of musical discipline; it is a material property of the
ensemble.</p>
<hr>
<h2 id="concert-hall-applause-neda-et-al-2000">Concert Hall Applause: Neda et al. (2000)</h2>
<p>The Kuramoto model is not only a theoretical construction. Neda et al. (2000)
applied it to concert hall applause — one of the most direct real-world
demonstrations of coupled-oscillator dynamics in a musical context.</p>
<p>They recorded applause in Romanian and Hungarian theaters and found that audiences
spontaneously alternate between two distinct states. In the <em>incoherent</em> regime,
each audience member claps at their own preferred rate (typically 2–3 Hz). Through
acoustic coupling — each person hears the room-averaged sound and adjusts their
clapping — the audience gradually synchronises to a shared, slower frequency
(around 1.5 Hz): the <em>synchronised</em> regime.</p>
<p>The transitions between the two regimes are quantitatively consistent with the
Kuramoto phase transition: the emergence of synchrony corresponds to $K$ crossing
$K_c$ as people progressively pay more attention to the collective sound.
Furthermore, Neda et al. document a characteristic phenomenon when synchrony
breaks down: individual clapping frequency approximately <em>doubles</em> as audience
members attempt to re-establish coherence. This frequency-doubling — a feature of
nonlinear oscillator systems near instability — is exactly what the delayed
response of coupling near $K_c$ predicts.</p>
<p>The paper is a useful pedagogical artefact: every music student has experienced
concert hall applause, and hearing that it undergoes a physically measurable phase
transition makes the connection between physics and musical experience concrete.</p>
<hr>
<h2 id="latency-and-the-limits-of-networked-ensemble-performance">Latency and the Limits of Networked Ensemble Performance</h2>
<p>In standard acoustic ensemble playing, the coupling delay is the propagation time
for sound to cross the ensemble: at $343\ \text{m/s}$, across a ten-metre stage,
roughly 30 ms. This is why orchestral seating is arranged with attention to who
needs to hear whom first.</p>
<p>In networked music performance (NMP), the coupling delay $\tau$ is much larger:
tens to hundreds of milliseconds depending on geographic distance and network
infrastructure. The Kuramoto model generalises naturally to include this delay:</p>
$$\frac{d\theta_i}{dt} = \omega_i + \frac{K}{N} \sum_{j=1}^{N} \sin\!\bigl(\theta_j(t - \tau) - \theta_i(t)\bigr).$$<p>Each musician hears the others&rsquo; phases as they were $\tau$ seconds ago, not as
they are now.</p>
<p>In a synchronised state where all oscillators share the collective frequency
$\bar\omega$ and phase $\psi(t) = \bar\omega t$, the delayed phase signal is
$\psi(t - \tau) = \bar\omega t - \bar\omega\tau$. The effective coupling
force contains a factor $\cos(\bar\omega\tau)$: the delay introduces a phase
shift that reduces the useful component of the coupling. The critical coupling
with delay is therefore:</p>
$$K_c(\tau) = \frac{K_c(0)}{\cos(\bar\omega \tau)}.$$<p>As $\tau$ increases, $K_c(\tau)$ grows: synchronisation requires progressively
stronger coupling (more attentive adjustment) to compensate for the information
lag. The denominator $\cos(\bar\omega\tau)$ reaches zero when
$\bar\omega\tau = \pi/2$. At this point $K_c(\tau) \to \infty$: no finite coupling
strength can maintain synchrony. The critical delay is:</p>
$$\tau_c = \frac{\pi}{2\bar\omega}.$$<p>For an ensemble performing at 120 BPM, the beat frequency is
$\bar\omega = 2\pi \times 2\ \text{Hz} = 4\pi\ \text{rad/s}$:</p>
$$\tau_c = \frac{\pi}{2 \times 4\pi} = \frac{1}{8}\ \text{s} = 125\ \text{ms}.$$<p>This is a remarkably clean result. The Kuramoto model with delay predicts that
ensemble synchronisation collapses at around 125 ms one-way delay for a standard
performance tempo. The empirical literature on NMP — from LoLa deployments across
European conservatories to controlled latency studies in the lab — consistently
finds that rhythmic coherence degrades noticeably above 50–80 ms and becomes
essentially unworkable above 100–150 ms one-way. The model and the data agree.</p>
<p>The derivation also shows why faster tempos are harder in NMP: $\tau_c \propto
1/\bar\omega$, so doubling the tempo halves the tolerable latency. An ensemble
performing at 240 BPM in a distributed setting faces a theoretical ceiling of
62 ms — which rules out transcontinental performance for most repertoire.</p>
<hr>
<h2 id="brains-in-sync-eeg-hyperscanning">Brains in Sync: EEG Hyperscanning</h2>
<p>The Kuramoto framework has recently been applied at a neural level.
EEG hyperscanning — simultaneous EEG recording from multiple participants during
a shared musical activity — has shown that musicians performing together exhibit
<em>inter-brain synchronisation</em>: coherent cortical oscillations at the frequency of
the music are measurable between players (Lindenberger et al., 2009; Müller et
al., 2013). The phase coupling between brains during joint performance is
significantly higher than during solo performance and higher than for musicians
playing simultaneously but without acoustic coupling.</p>
<p>This suggests that the Kuramoto coupling operates at two levels: the acoustic
(each musician hears the other and adjusts physical timing) and the neural (each
musician&rsquo;s cortical oscillators entrain to the shared musical pulse). The
question of which level is primary — whether neural synchrony causes or follows
from acoustic synchrony — remains open.</p>
<p>A 2023 review by Demos and Palmer argues that pairwise Kuramoto-type coupling is
insufficient to capture full ensemble dynamics. Group-level effects — the
differentiation between leader and follower roles, the emergence of collective
timing that no individual would produce alone — require nonlinear dynamical
frameworks that go beyond mean-field averaging. The model that adequately
describes a string quartet may need to be richer than the one that describes a
population of identical fireflies.</p>
<hr>
<h2 id="what-this-means-for-teaching">What This Means for Teaching</h2>
<p>The Kuramoto model reframes standard rehearsal intuitions in physical terms.</p>
<p><strong>&ldquo;Listen more&rdquo;</strong> translates to &ldquo;increase your effective coupling constant $K$.&rdquo;
A musician who plays without attending to others has set $K \approx 0$ and will
drift freely according to their own $\omega_i$. Listening — actively adjusting
tempo in response to what you hear — is not metaphorical. It is the physical
mechanism of coupling, and its effect is to pull you toward the mean phase $\psi$
with a force $Kr\sin(\psi - \theta_i)$.</p>
<p><strong>&ldquo;Our tempos are too different&rdquo;</strong> is a claim about $g(\bar\omega)$ and therefore
about $K_c$. A group with a wide spread of natural tempos needs more and stronger
listening to synchronise. This is not a moral failing but a parameter; it
suggests that ensemble warm-up time or explicit tempo negotiation before a
performance serves to reduce the spread of natural frequencies before the coupling
has to do all the work.</p>
<p><strong>Latency as a rehearsal experiment</strong> can be made explicit. Artificially delaying
the acoustic return to one musician in an ensemble — via headphone monitoring with
variable delay — allows students to experience directly how the coordination
degrades as $\tau$ increases toward $\tau_c$. They feel the system approaching
the phase transition without the theoretical framework, but the framework makes
the experience interpretable afterward.</p>
<p><strong>The click track</strong> replaces peer-to-peer Kuramoto coupling with an external
forcing term: each musician locks to a shared reference with fixed $\omega$
rather than adjusting dynamically to the group mean. This eliminates the phase
transition but also eliminates the adaptive dynamics — the micro-timing
fluctuations and expressive rubato — that characterise live ensemble playing. It
is a pedagogically important distinction, even if studios routinely make the
pragmatic choice.</p>
<hr>
<h2 id="references">References</h2>
<ul>
<li>
<p>Demos, A. P., &amp; Palmer, C.
(2023). Social and nonlinear dynamics unite: Musical group synchrony. <em>Trends
in Cognitive Sciences</em>, 27(11), 1008–1018.
<a href="https://doi.org/10.1016/j.tics.2023.08.005">https://doi.org/10.1016/j.tics.2023.08.005</a></p>
</li>
<li>
<p>Huygens, C. (1665). Letter to his father Constantijn Huygens, 26 February
1665. In <em>Œuvres complètes de Christiaan Huygens</em>, Vol. 5, p. 243. Martinus
Nijhoff, 1893.</p>
</li>
<li>
<p>Kuramoto, Y. (1975). Self-entrainment of a population of coupled non-linear
oscillators. In H. Araki (Ed.), <em>International Symposium on Mathematical
Problems in Theoretical Physics</em> (Lecture Notes in Physics, Vol. 39,
pp. 420–422). Springer.</p>
</li>
<li>
<p>Kuramoto, Y. (1984). <em>Chemical Oscillations, Waves, and Turbulence.</em> Springer.</p>
</li>
<li>
<p>Lindenberger, U., Li, S.-C., Gruber, W., &amp; Müller, V. (2009). Brains swinging
in concert: Cortical phase synchronization while playing guitar.
<em>BMC Neuroscience</em>, 10, 22. <a href="https://doi.org/10.1186/1471-2202-10-22">https://doi.org/10.1186/1471-2202-10-22</a></p>
</li>
<li>
<p>Müller, V., Sänger, J., &amp; Lindenberger, U. (2013). Intra- and inter-brain
synchronization during musical improvisation on the guitar. <em>PLOS ONE</em>, 8(9),
e73852. <a href="https://doi.org/10.1371/journal.pone.0073852">https://doi.org/10.1371/journal.pone.0073852</a></p>
</li>
<li>
<p>Neda, Z., Ravasz, E., Vicsek, T., Brechet, Y., &amp; Barabási, A.-L. (2000).
Physics of the rhythmic applause. <em>Physical Review E</em>, 61(6), 6987–6992.
<a href="https://doi.org/10.1103/PhysRevE.61.6987">https://doi.org/10.1103/PhysRevE.61.6987</a></p>
</li>
<li>
<p>Strogatz, S. H. (2000). From Kuramoto to Crawford: Exploring the onset of
synchronization in populations of coupled oscillators. <em>Physica D: Nonlinear
Phenomena</em>, 143(1–4), 1–20.
<a href="https://doi.org/10.1016/S0167-2789(00)00094-4">https://doi.org/10.1016/S0167-2789(00)00094-4</a></p>
</li>
<li>
<p>Strogatz, S. H. (2003). <em>Sync: How Order Emerges from Chaos in the Universe,
Nature, and Daily Life.</em> Hyperion.</p>
</li>
</ul>
<hr>
<h2 id="changelog">Changelog</h2>
<ul>
<li><strong>2026-01-14</strong>: Updated the author list for the Demos (2023) <em>Trends in Cognitive Sciences</em> reference to the published two authors (Demos &amp; Palmer). The five names previously listed were from a different Demos paper.</li>
<li><strong>2026-01-14</strong>: Changed &ldquo;period-doubling&rdquo; to &ldquo;frequency-doubling.&rdquo; When the clapping frequency doubles, the period halves; &ldquo;frequency-doubling&rdquo; is the precise term in this context.</li>
</ul>
]]></content:encoded>
    </item>
    <item>
      <title>The Impossible Heptagon</title>
      <link>https://sebastianspicker.github.io/posts/tool-impossible-heptagon/</link>
      <pubDate>Mon, 15 Jan 2024 00:00:00 +0000</pubDate>
      <guid>https://sebastianspicker.github.io/posts/tool-impossible-heptagon/</guid>
      <description>Danny Carey calls it sacred geometry. Gauss proved it non-constructible. They are, unexpectedly, describing the same object.</description>
      <content:encoded><![CDATA[<p>Danny Carey — drummer of Tool, one of the most rhythmically inventive musicians in rock — keeps a seven-pointed star on his kit and speaks about it using the language of sacred geometry. The heptagram appears in Tool&rsquo;s visual artwork, in the Thelemic symbolism Carey draws on, in pre-modern cosmological diagrams, and in the decorative traditions of several cultures that had no contact with each other. The claim, loosely stated, is that seven-fold symmetry is privileged: that it reflects something structurally true, that its forms carry significance beyond the aesthetic.</p>
<p>The scientific reflex here is usually impatience. &ldquo;Sacred geometry&rdquo; occupies an uncomfortable cultural space — mathematically dressed, factually thin, reliant on the listener not checking claims too carefully. The golden ratio does not appear everywhere in nature. Most things described as sacred in this tradition are better described as things the speaker found surprising before learning a more precise vocabulary.</p>
<p>But the heptagon is genuinely strange. Not for the reasons usually given. For a different reason — a theorem.</p>
<p><strong>The regular heptagon cannot be constructed with compass and straightedge.</strong></p>
<p>Not &ldquo;it is difficult.&rdquo; Not &ldquo;no one has found a construction yet.&rdquo; The regular seven-sided polygon — all sides equal, all interior angles equal — is <em>provably impossible</em> to construct using an unmarked ruler and compass in finitely many steps. This has been known since 1801.</p>
<h2 id="the-classical-constraint">The Classical Constraint</h2>
<p>Greek geometry restricted its tools deliberately. An unmarked straightedge draws lines through two known points. A compass draws circles centred at a known point with a given radius. No angle trisection. No markings. No graduated instruments. Just these two operations, applied one at a time, finitely many times.</p>
<p>Within this constraint, a great deal is achievable. A perpendicular bisector. An equilateral triangle. A regular pentagon — which requires the golden ratio and takes some work, but is reachable. A regular hexagon (trivially: six equilateral triangles around a centre).</p>
<p>Then: nothing for the heptagon. Greek geometers left no construction. Medieval Islamic mathematicians, who knew the regular polygon problem well, left no construction. Albrecht Dürer, in his 1525 <em>Underweysung der Messung</em>, gave an approximate construction that falls short by a small but nonzero margin. Each generation encountered the same wall.</p>
<p>In 1796, an 18-year-old Gauss proved that the regular 17-gon <em>is</em> constructible — a result so unexpected that he reportedly decided at that moment to become a mathematician rather than a philologist. In his 1801 <em>Disquisitiones Arithmeticae</em> he gave the complete characterisation of which regular polygons are constructible and which are not <a href="#ref-1">[1]</a>. The heptagon was definitively placed among the impossible.</p>
<h2 id="gausss-theorem">Gauss&rsquo;s Theorem</h2>
<p>A regular $n$-gon is constructible with compass and straightedge if and only if $n$ has the form</p>
$$n = 2^k \cdot p_1 \cdot p_2 \cdots p_m$$<p>where $k \geq 0$ and the $p_i$ are distinct <strong>Fermat primes</strong> — primes of the form $2^{2^j} + 1$.</p>
<p>The Fermat primes currently known:</p>
<table>
  <thead>
      <tr>
          <th>$j$</th>
          <th>$F_j = 2^{2^j}+1$</th>
          <th>Prime?</th>
      </tr>
  </thead>
  <tbody>
      <tr>
          <td>0</td>
          <td>3</td>
          <td>✓</td>
      </tr>
      <tr>
          <td>1</td>
          <td>5</td>
          <td>✓</td>
      </tr>
      <tr>
          <td>2</td>
          <td>17</td>
          <td>✓</td>
      </tr>
      <tr>
          <td>3</td>
          <td>257</td>
          <td>✓</td>
      </tr>
      <tr>
          <td>4</td>
          <td>65537</td>
          <td>✓</td>
      </tr>
      <tr>
          <td>5</td>
          <td>4 294 967 297</td>
          <td>✗ (Euler, 1732)</td>
      </tr>
      <tr>
          <td>6</td>
          <td>18 446 744 073 709 551 617</td>
          <td>✗</td>
      </tr>
      <tr>
          <td>⋮</td>
          <td>⋮</td>
          <td>no further Fermat primes known</td>
      </tr>
  </tbody>
</table>
<p>Five Fermat primes are known, all identified by the seventeenth century. Fermat himself conjectured that all numbers of this form are prime; he was wrong from $j = 5$ onward. Whether any further Fermat primes exist remains an open problem.</p>
<p>The constructible regular polygons therefore include the triangle (3), square (4), pentagon (5), hexagon (6), octagon (8), decagon (10), 15-gon, 17-gon, 257-gon, 65537-gon, and products of these with powers of 2. The 65537-gon was actually fully constructed by Johann Gustav Hermes, who spent around ten years on the computation in the 1880s and deposited a manuscript reportedly filling a large trunk at the University of Göttingen, where it remains.</p>
<p>Seven is prime, but $7 \neq 2^{2^j} + 1$ for any $j$ — it is not a Fermat prime. Therefore the regular heptagon is not on the list. It is not constructible.</p>
<h2 id="the-algebra-behind-the-geometry">The Algebra Behind the Geometry</h2>
<p>Why does the structure of Fermat primes determine constructibility? The connection goes through algebra <a href="#ref-2">[2]</a><a href="#ref-3">[3]</a>.</p>
<p>Every compass-and-straightedge construction corresponds to solving a sequence of equations of degree at most 2. Bisecting an angle, finding an intersection of a line and a circle — each step is a quadratic operation. After $k$ such steps, the numbers reachable lie in some field extension of $\mathbb{Q}$ (the rationals) with degree over $\mathbb{Q}$ at most $2^k$. Constructibility therefore requires the degree of the relevant extension to be a power of 2.</p>
<p>To construct a regular $n$-gon, you need to construct the angle $2\pi/n$, which requires constructing $\cos(2\pi/n)$. The question is: over what kind of field extension does $\cos(2\pi/n)$ sit?</p>
<p>For $n = 7$: let $\omega = e^{2\pi i/7}$, a primitive 7th root of unity. The minimal polynomial of $\omega$ over $\mathbb{Q}$ is the 7th cyclotomic polynomial</p>
$$\Phi_7(x) = x^6 + x^5 + x^4 + x^3 + x^2 + x + 1,$$<p>which is irreducible over $\mathbb{Q}$, giving $[\mathbb{Q}(\omega) : \mathbb{Q}] = 6$. Since $\cos(2\pi/7) = (\omega + \omega^{-1})/2$, and since $\omega$ satisfies a degree-2 polynomial over $\mathbb{Q}(\cos 2\pi/7)$, we get</p>
$$[\mathbb{Q}(\cos 2\pi/7) : \mathbb{Q}] = 3.$$<p>Specifically, $c = \cos(2\pi/7)$ is the root of the irreducible cubic</p>
$$8c^3 + 4c^2 - 4c - 1 = 0,$$<p>or equivalently, $\alpha = 2\cos(2\pi/7)$ satisfies</p>
$$\alpha^3 + \alpha^2 - 2\alpha - 1 = 0.$$<p>The three roots of this cubic are $2\cos(2\pi/7)$, $2\cos(4\pi/7)$, and $2\cos(6\pi/7)$. By Vieta&rsquo;s formulas their sum is $-1$ and their product is $1$ — which can be verified directly from the identity $\cos(2\pi/7) + \cos(4\pi/7) + \cos(6\pi/7) = -1/2$.</p>
<p>The degree of the extension is 3. Three is not a power of 2. Therefore $\cos(2\pi/7)$ cannot be reached by any tower of quadratic extensions of $\mathbb{Q}$. Therefore the regular heptagon is not constructible. $\square$</p>
<p>Compare the pentagon: $\cos(2\pi/5) = (\sqrt{5}-1)/4$, satisfying the quadratic $4x^2 + 2x - 1 = 0$. Degree 2 — a power of 2. Constructible.</p>
<p>The 17-gon: the Galois group of $\mathbb{Q}(\zeta_{17})/\mathbb{Q}$ is $(\mathbb{Z}/17\mathbb{Z})^* \cong \mathbb{Z}/16\mathbb{Z}$, order $16 = 2^4$. The extension decomposes into four quadratic steps. This is exactly what Gauss computed at 18.</p>
<p>For 7: $(\mathbb{Z}/7\mathbb{Z})^* \cong \mathbb{Z}/6\mathbb{Z}$, order $6 = 2 \times 3$. The factor of 3 is the obstruction. The Galois group is not a 2-group, so the extension cannot be decomposed into quadratic steps. The heptagon is out of reach.</p>
<h2 id="sacred-precisely">Sacred, Precisely</h2>
<p>The phrase &ldquo;sacred geometry&rdquo; usually does work that &ldquo;elegant mathematics&rdquo; could do more honestly. But the heptagon is a case where something with genuine mathematical content sits underneath the mystical framing.</p>
<p>The Platonic tradition held that certain geometric forms exist as ideals — perfect, unchanging, more real than their physical approximations. The philosopher&rsquo;s claim is that the heptagon exists in a realm beyond its material instantiation. The mathematician&rsquo;s claim is: the heptagon is perfectly well-defined — seven equal sides, seven equal angles — but it cannot be reached from $\mathbb{Q}$ by the operations available to ruler and compass. You can approximate it to any desired precision. You can construct it exactly using origami, which allows angle trisection and is strictly more powerful than compass and straightedge <a href="#ref-4">[4]</a>. But the classical constructive program — the one that reaches the pentagon, the hexagon, the 17-gon, the 65537-gon — cannot reach the heptagon.</p>
<p>There is a precise mathematical sense in which it lies outside the constructible world. Whether that constitutes sacredness is a question for a different kind of argument. But it is not nothing. The Pythagoreans were working without Galois theory; they had an intuition without the theorem. The theorem, when it came, confirmed that intuition about seven while explaining it more clearly than they could.</p>
<p>Carey&rsquo;s intuition — that 7 sits outside the ordinary — is, by this route, formally correct.</p>
<h2 id="what-the-heptagram-is">What the Heptagram Is</h2>
<p>The regular heptagon may be impossible to construct exactly, but the heptagram — the seven-pointed star — is perfectly drawable. Connecting every second vertex of an approximate regular heptagon gives $\{7/2\}$ in Schläfli notation <a href="#ref-5">[5]</a>; connecting every third vertex gives $\{7/3\}$. Both are closed figures. Both appear throughout pre-modern symbolic traditions, which is unsurprising: they are the most intricate star polygons drawable with a single pen stroke before complexity outruns visibility.</p>
<p>They are also generators of rhythmic structure. Because 7 is prime, every star polygon on seven points visits all seven vertices in a single closed traversal — a property that does not hold for six-pointed or eight-pointed stars. This turns out to matter for how drum patterns are built across multiple bars. That connection — from the primality of 7 to the architecture of rhythmic accent cycles — is the subject of the companion post, <a href="/posts/tool-star-polygons-drum-machines/">Star Polygons and Drum Machines</a>.</p>
<p>The broader series on mathematics in Tool&rsquo;s music began with the Fibonacci structure embedded in the time signatures and syllable counts of &ldquo;Lateralus&rdquo; <a href="#ref-6">[6]</a>, and the group-theoretic structure underlying twelve-tone equal temperament provides the same algebraic scaffolding seen here <a href="#ref-7">[7]</a>.</p>
<h2 id="references">References</h2>
<p><span id="ref-1"></span>[1] Gauss, C.F. (1801). <em>Disquisitiones Arithmeticae</em>. Leipzig: Fleischer. (§VII.)</p>
<p><span id="ref-2"></span>[2] Stewart, I. (2004). <em>Galois Theory</em> (3rd ed.). CRC Press. Ch. 4.</p>
<p><span id="ref-3"></span>[3] Conway, J.H. &amp; Guy, R.K. (1996). <em>The Book of Numbers</em>. Springer. pp. 190–202.</p>
<p><span id="ref-4"></span>[4] Hull, T. (2011). Solving cubics with creases: The work of Beloch and Lill. <em>The American Mathematical Monthly</em>, 118(4), 307–315. <a href="https://doi.org/10.4169/amer.math.monthly.118.04.307">DOI: 10.4169/amer.math.monthly.118.04.307</a></p>
<p><span id="ref-5"></span>[5] Coxeter, H.S.M. (1973). <em>Regular Polytopes</em> (3rd ed.). Dover. Ch. 2.</p>
<p><span id="ref-6"></span>[6] See <a href="/posts/fibonacci-lateralus/">Fibonacci and Lateralus</a> on this blog.</p>
<p><span id="ref-7"></span>[7] See <a href="/posts/twelve-tet-group-theory-musical-tuning/">Twelve-TET and Group Theory</a> on this blog.</p>
]]></content:encoded>
    </item>
    <item>
      <title>Twelve Is Not an Accident: The Group Theory of Musical Tuning</title>
      <link>https://sebastianspicker.github.io/posts/twelve-tet-group-theory-musical-tuning/</link>
      <pubDate>Fri, 15 Dec 2023 00:00:00 +0000</pubDate>
      <guid>https://sebastianspicker.github.io/posts/twelve-tet-group-theory-musical-tuning/</guid>
      <description>Why does the octave divide into twelve? The answer lies not in aesthetics but in the continued-fraction convergents of log₂(3/2) — and the same group structure that gives Messiaen his modes.</description>
      <content:encoded><![CDATA[<p>Sit down at a piano and count the keys in one octave. Twelve. Seven white, five black, twelve total pitch classes before the pattern repeats. Ask a musician why twelve and they will probably say something about Western tradition, the church modes, or maybe vaguely gesture at the circle of fifths. Ask a musicologist and you might hear about Pythagoras, or the development of equal temperament in the Baroque period, or the well-tempered tuning systems of J. S. Bach. All of that history is real and worth knowing. But none of it explains <em>why</em> the number 12 works, and why every serious attempt at a usable keyboard instrument across widely separated cultures converges on the same cardinality.</p>
<p>The real answer is in number theory. Specifically, it is in the continued fraction expansion of a single irrational number: $\log_2(3/2)$. The number 12 is not a cultural choice. It is the smallest integer that gives a genuinely good rational approximation to that number — subject to the constraint that a human hand can navigate the resulting keyboard. Once you see the argument, the feeling of contingency evaporates completely. Twelve is forced on us.</p>
<p>Along the way, the same mathematical structure — the cyclic group $\mathbb{Z}_{12}$ — explains why Messiaen&rsquo;s modes of limited transposition exist, why the circle of fifths closes exactly, and why certain chord types (augmented triads, diminished seventh chords, the whole-tone scale) have a strange self-similar quality that composers have exploited for centuries. If you want the full treatment of the Messiaen connection, I wrote a dedicated post: <a href="/posts/messiaen-modes-group-theory/">Messiaen, Modes, and the Group Theory of Harmony</a>. Here I want to build the foundations from scratch, starting with the one interval that makes all of this necessary.</p>
<hr>
<h2 id="the-interval-that-started-everything">The interval that started everything</h2>
<p>The perfect fifth has a frequency ratio of exactly 3:2. Play two strings in that ratio and the sound is stable, open, and unmistakably consonant — second only to the octave (2:1) in the hierarchy of simple intervals. The reason is physics: the overtone series of any vibrating string includes the fundamental frequency $f$, then $2f$, $3f$, $4f$, and so on. Two notes a perfect fifth apart share the overtone at $3f$ (for the lower note) and $2f'$ (for the upper note, where $f' = 3f/2$): those are the same frequency, $3f$. Shared overtones mean the two notes reinforce rather than fight each other. This is why the fifth sounds stable: it is literally built into the harmonic structure of physical vibration.</p>
<p>Humans discovered the fifth independently in ancient Greece, China, India, and Mesopotamia. It is not a cultural artifact <a href="#ref-4">[4]</a>. Given that stability, it is natural to ask: can we build a complete pitch system by stacking fifths? Take a starting note, go up a fifth, up another, up another, and keep going. The notes you produce — C, G, D, A, E, B, F♯, … — are acoustically related to the starting point in a simple way, and they sound good together. This is the Pythagorean tuning system, and it underlies the construction of diatonic scales.</p>
<p>But here is the problem. A fifth raises the pitch by a factor of 3/2. An octave raises it by a factor of 2. These are independent: one is a power of 3 and the other a power of 2, and no power of 3/2 will ever equal a power of 2 exactly. In the language of modern mathematics, $\log_2(3/2)$ is irrational — this follows directly from the fundamental theorem of arithmetic, since no product of powers of 2 can equal a product of powers of 3. Whether it is also transcendental is an open question; a proof would follow from Schanuel&rsquo;s conjecture, but that conjecture remains unresolved. What matters for tuning is the irrationality alone. Stacking pure fifths and stacking octaves are incommensurable operations. The circle of fifths can never close in pure Pythagorean tuning. We will always end up slightly sharp or flat relative to where we started.</p>
<p>This incommensurability is the central problem of musical tuning. Everything else — equal temperament, just intonation, meantone tuning, the Pythagorean comma, the whole apparatus of tuning theory — is a response to it.</p>
<hr>
<h2 id="equal-temperament-and-the-approximation-problem">Equal temperament and the approximation problem</h2>
<p>In an equal temperament with $N$ notes per octave, we divide the octave into $N$ equal logarithmic steps. Each step corresponds to a frequency ratio of $2^{1/N}$. We then ask: how many steps $k$ gives the best approximation to a perfect fifth?</p>
<p>The condition is simply that $2^{k/N}$ should be close to $3/2$, which means $k/N$ should be close to $\log_2(3/2)$. So we need a good rational approximation to</p>
$$\log_2\!\left(\frac{3}{2}\right) = \log_2 3 - 1 \approx 0.584962\ldots$$<p>The classical tool for finding best rational approximations is the continued fraction. Any real number $x$ can be written as</p>
$$x = a_0 + \cfrac{1}{a_1 + \cfrac{1}{a_2 + \cfrac{1}{a_3 + \cdots}}}$$<p>where the $a_i$ are non-negative integers (positive for $i \geq 1$), called the partial quotients. For $\log_2(3/2)$ the expansion is</p>
$$\log_2\!\left(\frac{3}{2}\right) = [0;\, 1,\, 1,\, 2,\, 2,\, 3,\, 1,\, 5,\, 2,\, 23,\, 2,\, \ldots]$$<p>The truncated continued fractions — the convergents — give the sequence of best rational approximations:</p>
$$\frac{0}{1},\quad \frac{1}{1},\quad \frac{1}{2},\quad \frac{3}{5},\quad \frac{7}{12},\quad \frac{24}{41},\quad \frac{31}{53},\quad \frac{179}{306},\quad \ldots$$<p>Each convergent $k/N$ corresponds to a tuning system: the denominator $N$ is the number of equal steps per octave, and the numerator $k$ is the number of steps that best approximates a fifth. So we get: 1-TET (trivial), 2-TET (trivial), 5-TET, 12-TET, 41-TET, 53-TET, 306-TET, and so on <a href="#ref-1">[1]</a>, <a href="#ref-2">[2]</a>.</p>
<p>The key property of convergents is that they give <em>uniquely good</em> approximations. No rational number with a smaller denominator comes closer to the true value than a convergent does. So 7/12 is not merely a decent approximation to $\log_2(3/2)$ — it is provably the best approximation with denominator at most 12. To do better with a denominator below 41, you cannot.</p>
<p>To put numbers on it: in 12-TET, the fifth is $2^{7/12} \approx 1.498307\ldots$, while the true fifth is exactly $1.500000$. The error is about 0.11%, or roughly 2 cents (hundredths of a semitone). In 53-TET, the fifth is $2^{31/53} \approx 1.499941\ldots$, an error of less than 0.004%, about 0.07 cents — essentially indistinguishable from pure. Both 12 and 53 are convergents. Intermediate values like 19-TET or 31-TET are not convergents (they are not best approximations), and their fifths, while sometimes used in experimental or microtonal music, are less accurate relative to their complexity.</p>
<p>Why does this matter? Because a tuning system that approximates the fifth poorly will produce harmonies that beat visibly — the slight mistuning causes the sound to waver in a way that trained ears find uncomfortable in sustained chords. A good fifth approximation is not a luxury; it is the condition for the system to be musically usable in the harmonic practice that most of the world&rsquo;s music assumes.</p>
<hr>
<h2 id="the-pythagorean-comma">The Pythagorean comma</h2>
<p>Before equal temperament became standard (roughly the 18th century in Western Europe), instruments were tuned using pure Pythagorean fifths: exact 3:2 ratios, stacked on top of each other. This gives beautiful, stable individual fifths, but it collects a debt.</p>
<p>After stacking 12 pure fifths, you have climbed in frequency by $(3/2)^{12}$:</p>
$$(3/2)^{12} = \frac{3^{12}}{2^{12}} = \frac{531441}{4096} \approx 129.746\ldots$$<p>Meanwhile, 7 octaves is $2^7 = 128$. The ratio between these is</p>
$$\frac{(3/2)^{12}}{2^7} = \frac{3^{12}}{2^{19}} = \frac{531441}{524288} \approx 1.01364$$<p>This is the Pythagorean comma: roughly 23.46 cents, or about a quarter of a semitone <a href="#ref-4">[4]</a>. In Pythagorean tuning, the circle of fifths never closes. After 12 fifths you arrive at a note that is nominally the same pitch class as the starting point — but sharp by 23.46 cents. That final fifth, the one that &ldquo;should&rdquo; close the circle, sounds badly out of tune. It was historically called the &ldquo;wolf fifth&rdquo; because it howls.</p>
<p>Equal temperament solves this by distributing the comma across all 12 fifths. Each fifth is flattened by $23.46/12 \approx 1.955$ cents. The individual fifths are no longer pure, but the error is small enough to be acceptable — and crucially, it is <em>uniform</em>, so every key sounds equally good (or equally impure, depending on your perspective).</p>
<p>The Pythagorean comma being small — about 1.96% of the octave — is precisely why 12-TET works. It is small because 7/12 is an unusually good convergent of $\log_2(3/2)$. The two facts are the same fact. The comma is the numerator of the error when you approximate $\log_2(3/2)$ by $7/12$, multiplied up by 12 fifths&rsquo; worth of accumulation. When the approximation is good, the comma is small, and the distribution is imperceptible. This is why the piano is tuned the way it is.</p>
<hr>
<h2 id="the-group-theory">The group theory</h2>
<p>We are now ready for the algebra. In 12-TET, pitch classes form the set $\{0, 1, 2, \ldots, 11\}$ where we identify 0 with C, 1 with C♯, 2 with D, 3 with D♯, 4 with E, 5 with F, 6 with F♯, 7 with G, 8 with G♯, 9 with A, 10 with A♯, and 11 with B. Addition is modulo 12: after 11 comes 0 again, because after B comes C in the next octave (same pitch class). This is $\mathbb{Z}_{12}$, the integers mod 12, and it is a group under addition <a href="#ref-1">[1]</a>.</p>
<p>Transposition by a semitone is addition of 1. Transposition by a perfect fifth is addition of 7, because the fifth is 7 semitones in 12-TET. Start from C (0) and repeatedly add 7, always reducing modulo 12:</p>
$$0 \to 7 \to 14 \equiv 2 \to 9 \to 16 \equiv 4 \to 11 \to 18 \equiv 6 \to 13 \equiv 1 \to 8 \to 15 \equiv 3 \to 10 \to 17 \equiv 5 \to 12 \equiv 0$$<p>In note names: C, G, D, A, E, B, F♯, C♯, G♯, D♯/E♭, A♯/B♭, F, C. That is the circle of fifths — all 12 pitch classes visited exactly once before returning to the start. The circle of fifths is the orbit of 0 under repeated addition of 7 in $\mathbb{Z}_{12}$.</p>
<p>Why does the orbit visit all 12 elements? Because $\gcd(7, 12) = 1$. This is Bézout&rsquo;s theorem applied to cyclic groups: an element $g$ generates $\mathbb{Z}_n$ (i.e., its orbit under repeated addition covers all of $\mathbb{Z}_n$) if and only if $\gcd(g, n) = 1$. The generators of $\mathbb{Z}_{12}$ are exactly the elements coprime to 12: that is $\{1, 5, 7, 11\}$. Musically: transposition by 1 semitone (chromatic scale), by 5 semitones (perfect fourth), by 7 semitones (perfect fifth), or by 11 semitones (major seventh) each generates all 12 pitch classes. Transposition by 2 (a whole tone) does not — it produces only the 6-element whole-tone scale. Transposition by 3 (a minor third) produces only the 4-element diminished seventh chord.</p>
<p>This is not a curiosity; it is the algebraic skeleton of tonal music. The circle of fifths closes because 7 and 12 are coprime. That coprimality is guaranteed by the continued fraction structure: the numerator and denominator of a convergent in lowest terms are always coprime (as they must be, being a reduced fraction), and 7/12 is such a convergent.</p>
<p>Now consider the subgroups of $\mathbb{Z}_{12}$. By Lagrange&rsquo;s theorem, subgroups of a finite group must have orders dividing the group order. The divisors of 12 are 1, 2, 3, 4, 6, and 12, so these are the only possible subgroup orders. For cyclic groups there is exactly one subgroup of each order dividing $n$, and it is generated by $n/d$ where $d$ is the subgroup order. The full list:</p>
<p>The trivial subgroup of order 1 is just $\{0\}$. The subgroup of order 2 is $\{0, 6\}$, generated by 6 — that is, the tritone axis, the interval of exactly half an octave. The subgroup of order 3 is $\{0, 4, 8\}$, generated by 4 — this is the augmented triad, three notes equally spaced around the octave by major thirds. The subgroup of order 4 is $\{0, 3, 6, 9\}$, generated by 3 — the diminished seventh chord, four notes equally spaced by minor thirds. The subgroup of order 6 is $\{0, 2, 4, 6, 8, 10\}$, generated by 2 — the whole-tone scale. And the full group of order 12 is all of $\mathbb{Z}_{12}$.</p>
<p>Each of these has a musical life. The augmented triad ($\{0, 4, 8\}$) sounds ambiguous because it maps onto itself under transposition by a major third — there are only 4 distinct augmented triads total, not 12. Composers exploit this ambiguity when they want harmonic instability without committing to a direction. The diminished seventh ($\{0, 3, 6, 9\}$) is similarly ambiguous: it has only 3 distinct forms and can resolve to any of several keys, which is why it appears so often at structural pivots in Romantic music. These properties are direct consequences of the subgroup structure of $\mathbb{Z}_{12}$.</p>
<hr>
<h2 id="messiaens-modes-as-cosets">Messiaen&rsquo;s modes as cosets</h2>
<p>Olivier Messiaen described his &ldquo;modes of limited transposition&rdquo; in his 1944 treatise <em>Technique de mon langage musical</em>. He identified seven scales — including the whole-tone scale and the octatonic scale — that have the peculiar property of mapping onto themselves under some transposition strictly smaller than an octave. He found them by ear, by introspection, and by exhaustive search at the keyboard. He did not have the group theory. But the group theory makes their existence not merely explainable but <em>inevitable</em>.</p>
<p>Here is the key definition. A scale $S \subseteq \mathbb{Z}_{12}$ is a mode of limited transposition if there exists some $t \in \{1, 2, \ldots, 11\}$ such that $S + t \equiv S \pmod{12}$ (as a set). In other words, transposing the scale by $t$ semitones maps the scale onto itself. The integer $t$ is called a period of the scale.</p>
<p>Now, the set of all periods of $S$ — together with 0 — forms a subgroup of $\mathbb{Z}_{12}$ (it is closed under addition modulo 12, since if both $t_1$ and $t_2$ are periods then so is $t_1 + t_2$). Call this subgroup $H$. The condition for $S$ to be a mode of limited transposition is simply that $H$ is nontrivial — that is, $H \neq \{0\}$.</p>
<p>Moreover, if $H$ is the period subgroup of $S$, then $S$ must be a union of cosets of $H$ in $\mathbb{Z}_{12}$. This follows immediately from the fact that $H$ acts on $S$ by translation and maps $S$ to itself: every element of $S$ belongs to exactly one coset of $H$, and $S$ is a union of whole cosets. The size of $S$ must therefore be a multiple of $|H|$.</p>
<p>The whole-tone scale $\{0, 2, 4, 6, 8, 10\}$ is itself the unique subgroup of order 6 in $\mathbb{Z}_{12}$. Its period subgroup is the whole-tone scale itself. Transposing by any even number (2, 4, 6, 8, or 10) maps it to itself. Transposing by an odd number gives the complementary whole-tone scale $\{1, 3, 5, 7, 9, 11\}$. There are therefore only 2 distinct transpositions of the whole-tone scale, not 12.</p>
<p>The octatonic (diminished) scale $\{0, 1, 3, 4, 6, 7, 9, 10\}$ has period subgroup $\{0, 3, 6, 9\}$ — the subgroup of order 4. It is a union of two cosets: $\{0, 3, 6, 9\}$ itself and $\{1, 4, 7, 10\}$. Transposing by 3 maps it onto itself. There are only 3 distinct transpositions. Messiaen calls this his Mode 2.</p>
<p>The general formula is clean: a mode of limited transposition with period subgroup of order $d$ has exactly $12/d$ distinct transpositions. For the whole-tone scale, $d = 6$ gives $12/6 = 2$ transpositions. For the octatonic scale, $d = 4$ gives $12/4 = 3$ transpositions.</p>
<p>What Messiaen found by ear was the complete classification of subsets of $\mathbb{Z}_{12}$ that are unions of cosets of a nontrivial subgroup <a href="#ref-5">[5]</a>. The group theory makes their existence a theorem rather than a discovery. I find this genuinely beautiful: a composer&rsquo;s intuition about harmonic symmetry turns out to be an exercise in the theory of cosets of cyclic groups. For the full analysis of each of Messiaen&rsquo;s seven modes in these terms, see <a href="/posts/messiaen-modes-group-theory/">Messiaen, Modes, and the Group Theory of Harmony</a>.</p>
<hr>
<h2 id="why-not-53">Why not 53?</h2>
<p>Given that 53-TET approximates the fifth with an error of less than 0.004% — compared to 12-TET&rsquo;s 0.11% — one might ask why we do not simply use 53-TET. The mathematical case is overwhelming. In addition to the nearly perfect fifth, 53-TET gives excellent approximations to the just major third (frequency ratio 5:4) and the just minor third (6:5). It was seriously advocated by the 19th-century theorist Robert Holford Macdowall Bosanquet, who even built a 53-key harmonium to demonstrate it. The Chinese theorist Jing Fang described a 53-note system in the 1st century BC. The Arabic music theorist Al-Farabi considered 53-division scales in the 10th century. Everyone who has ever thought carefully about tuning arrives at 53 eventually.</p>
<p>And yet no 53-TET instrument has ever entered widespread use. The reason is anatomical, not mathematical. A piano with 53 keys per octave spans more than 2 metres per octave at any reasonable key size — impossible to play. A guitar with 53 frets per octave has frets spaced roughly 3–4 millimetres apart in the upper register: no human fingertip is narrow enough to press a single fret without touching its neighbours. Even if you could play it, reading 53-TET notation would require an entirely new theoretical and pedagogical apparatus.</p>
<p>The constraint is: we want the largest $N$ such that (a) $N$ is a convergent denominator of $\log_2(3/2)$, so the fifth approximation is genuinely good, and (b) $N$ is small enough to navigate with human hands and readable at a glance. The convergent denominators are 1, 2, 5, 12, 41, 53, 306, &hellip; Of these, 12 is the largest that satisfies condition (b). The next convergent, 41, already strains human dexterity — 41-TET keyboard instruments have been built experimentally but never mass-produced. At 53 the case is closed.</p>
<p>One might argue about where exactly the cutoff is, and reasonable people might draw it at 19 or 31 (which are not convergents but have other virtues). But the point is that 12 is not merely a local optimum found by trial and error. It is the specific value where the continued fraction and human physiology intersect.</p>
<hr>
<h2 id="closing">Closing</h2>
<p>There is something I find genuinely satisfying about this argument. Music feels like the most human of activities — expressive, cultural, steeped in history and tradition. And yet the number 12, which lies at the foundation of so much of the world&rsquo;s music, is not a human choice at all. It is the continued-fraction convergent of an irrational number that was fixed by the physics of vibrating strings long before any human struck a tuning fork.</p>
<p>The circle of fifths closes because $\gcd(7, 12) = 1$: a fact about integers, not about culture. Messiaen&rsquo;s modes exist because $\mathbb{Z}_{12}$ has nontrivial proper subgroups: a fact about cyclic groups, not about 20th-century French aesthetics. The augmented triad sounds ambiguous because it is a coset of the order-3 subgroup of $\mathbb{Z}_{12}$: a fact about quotient groups, not about Romantic harmony conventions.</p>
<p>I came to music theory sideways — through acoustics, then signal processing, then the mathematics of scales. What surprised me, when I finally worked through the continued fraction argument properly, was not that the math existed but that it was so <em>tight</em>. There is essentially no freedom in the answer. Given the constraint that a musical scale should be built around the most consonant interval (after the octave), should form a closed group structure, and should be navigable by a human performer, the answer is 12. Not approximately 12, not 12 as a historical compromise. Exactly 12.</p>
<p>The number is not a tradition. It is a theorem.</p>
<hr>
<p>For more on related themes: the Fibonacci sequence and golden ratio in music appear in <a href="/posts/fibonacci-lateralus/">Fibonacci, Lateralus, and the Golden Ratio</a>. The Euclidean algorithm and rhythmic structure are explored in <a href="/posts/euclidean-rhythms/">Euclidean Rhythms</a> — a sister post to this one in the math-and-music thread. And for the physics of audio sampling rates, where a similar interplay of number theory and practical constraints forces another specific number, see <a href="/posts/why-44100-hz-cd-sampling-rate/">Why 44,100 Hz?</a>.</p>
<hr>
<h2 id="references">References</h2>
<p><span id="ref-1"></span>[1] Balzano, G. J. (1980). The group-theoretic description of 12-fold and microtonal pitch systems. <em>Computer Music Journal</em>, 4(4), 66–84.</p>
<p><span id="ref-2"></span>[2] Carey, N., &amp; Clampitt, D. (1989). Aspects of well-formed scales. <em>Music Theory Spectrum</em>, 11(2), 187–206.</p>
<p><span id="ref-3"></span>[3] Milne, A., Sethares, W. A., &amp; Plamondon, J. (2007). Isomorphic controllers and dynamic tuning. <em>Computer Music Journal</em>, 31(4), 15–32.</p>
<p><span id="ref-4"></span>[4] Lloyd, L. S., &amp; Boyle, H. (1978). <em>Intervals, Scales and Temperaments</em>. St. Martin&rsquo;s Press.</p>
<p><span id="ref-5"></span>[5] Douthett, J., &amp; Steinbach, P. (1998). Parsimonious graphs: A study in parsimony, contextual transformations, and modes of limited transposition. <em>Journal of Music Theory</em>, 42(2), 241–263.</p>
<hr>
<h2 id="changelog">Changelog</h2>
<ul>
<li><strong>2025-11-20</strong>: Updated the spelling of &ldquo;Robert Holford Macdowall Bosanquet&rdquo; (previously rendered as &ldquo;Macdowell&rdquo;).</li>
<li><strong>2025-11-20</strong>: Changed &ldquo;about 1.36% of the octave&rdquo; to &ldquo;about 1.96% of the octave.&rdquo; The 1.36% figure is the frequency ratio above unity (531441/524288 ≈ 1.01364); the logarithmic fraction of the 1200-cent octave is 23.46/1200 ≈ 1.96%.</li>
<li><strong>2025-11-20</strong>: Changed &ldquo;12 octaves&rsquo; worth of accumulation&rdquo; to &ldquo;12 fifths&rsquo; worth of accumulation.&rdquo; The Pythagorean comma accumulates over 12 stacked fifths (which span approximately 7 octaves), not 12 octaves.</li>
</ul>
]]></content:encoded>
    </item>
    <item>
      <title>The Charm of Impossibilities: Group Theory and Messiaen&#39;s Modes of Limited Transposition</title>
      <link>https://sebastianspicker.github.io/posts/messiaen-modes-group-theory/</link>
      <pubDate>Wed, 19 Apr 2023 00:00:00 +0000</pubDate>
      <guid>https://sebastianspicker.github.io/posts/messiaen-modes-group-theory/</guid>
      <description>Messiaen&amp;rsquo;s seven modes of limited transposition cannot be fully transposed through all twelve keys — not by convention, but because of group theory. The modes are pitch-class sets whose stabiliser subgroups in ℤ₁₂ are non-trivial. The orbit–stabiliser theorem gives the exact count of distinct transpositions for each mode, and the subgroup lattice of ℤ₁₂ maps directly onto the hierarchy of the seven modes.</description>
      <content:encoded><![CDATA[<p><em>I first encountered Messiaen&rsquo;s second mode — the octatonic scale — in an
analysis seminar during my physics studies, played by a colleague on an upright
piano in a rehearsal room with terrible acoustics. She demonstrated something
that stopped me: no matter how many times she transposed the scale up by a minor
third, she could never find a &ldquo;new&rdquo; version. After three transpositions she was
back where she started. She called it the charm of impossibilities. It took me
years to understand why it is impossible, and longer still to see that the answer
is not musical but algebraic.</em></p>
<p><em>This post is a companion to <a href="/posts/fibonacci-lateralus/">Fibonacci, the Golden Ratio, and Tool&rsquo;s
Lateralus</a>, which found number theory in a prog-rock
song. Here we find abstract algebra in twentieth-century sacred music.</em></p>
<hr>
<h2 id="pitch-classes-and-the-chromatic-clock">Pitch Classes and the Chromatic Clock</h2>
<p>Western music divides the octave into twelve equal semitones. For purposes of
harmony and counterpoint, the absolute pitch is often less important than the
pitch <em>class</em> — the equivalence class of all pitches related by octave
transposition. Middle C and the C two octaves above belong to the same pitch
class.</p>
<p>We label the twelve pitch classes $0, 1, 2, \ldots, 11$, with $0 = \mathrm{C}$,
$1 = \mathrm{C}\sharp/\mathrm{D}\flat$, $2 = \mathrm{D}$, and so on up to
$11 = \mathrm{B}$. Addition is taken modulo 12 — the integers wrap around like
a clock face, with $11 + 2 = 1$ (one semitone above B is C$\sharp$).</p>
<p>The set of pitch classes with this operation is a group:</p>
$$\mathbb{Z}_{12} = \{0, 1, 2, \ldots, 11\}, \qquad x \oplus y = (x + y) \bmod 12.$$<p>This is the cyclic group of order 12. It has an identity element ($0$, &ldquo;no
transposition&rdquo;), every element has an inverse ($-n \bmod 12$), and the operation
is associative. If you are used to thinking about the chromatic scale as a linear
sequence ending at the octave, $\mathbb{Z}_{12}$ is the insistence that it is
actually a circle.</p>
<hr>
<h2 id="musical-operations-as-group-elements">Musical Operations as Group Elements</h2>
<p>Two operations are fundamental in tonal and post-tonal music theory.</p>
<p><strong>Transposition</strong> by $n$ semitones maps every pitch class up by $n$:</p>
$$T_n \colon x \mapsto x + n \pmod{12}.$$<p>The twelve transpositions $T_0, T_1, \ldots, T_{11}$ are exactly the elements of
$\mathbb{Z}_{12}$, with $T_n$ corresponding to the integer $n$. Composing two
transpositions gives a transposition: $T_m \circ T_n = T_{m+n}$.</p>
<p><strong>Inversion</strong> reflects the pitch-class circle:</p>
$$I \colon x \mapsto -x \pmod{12}.$$<p>Inversion maps C to C, D to B$\flat$, E to A$\flat$, and so on — it is the
mirror symmetry of the chromatic circle about the C/F$\sharp$ axis. Combining
inversion with transposition gives the <em>inversional transpositions</em>:</p>
$$I_n \colon x \mapsto n - x \pmod{12}.$$<p>The transpositions and inversional transpositions together generate a group of
order 24:</p>
$$D_{12} = \langle T_1, I \rangle.$$<p>This is the <em>dihedral group</em> $D_{12}$ — the same abstract group that describes
the symmetries of a regular 12-gon (twelve rotations and twelve reflections). The
identification is not coincidental: the twelve pitch classes arranged in a circle
<em>are</em> the vertices of a regular 12-gon, and the musical operations are
geometrically the symmetries of that polygon.</p>
<p>Twelve-tone composition — Schoenberg&rsquo;s method — is almost entirely a
working-out of the consequences of $D_{12}$ acting on ordered sequences of the
twelve pitch classes. The four canonical row forms (prime, inversion, retrograde,
retrograde-inversion) correspond to cosets of $\mathbb{Z}_{12}$ (the transposition subgroup).</p>
<hr>
<h2 id="orbits-and-stabilisers">Orbits and Stabilisers</h2>
<p>Let $S \subseteq \mathbb{Z}_{12}$ be a pitch-class set — a chord, a scale, a
collection of any size.</p>
<p>The <strong>orbit</strong> of $S$ under $\mathbb{Z}_{12}$ is the collection of all distinct
transpositions of $S$:</p>
$$\mathrm{Orb}(S) = \{ T_n(S) : n \in \mathbb{Z}_{12} \}.$$<p>For most sets, all twelve transpositions produce a different set, so
$|\mathrm{Orb}(S)| = 12$. The C major scale, for example, has twelve distinct
transpositions, one for each key.</p>
<p>But some sets are symmetric under certain transpositions: there exists $n \neq 0$
such that $T_n(S) = S$. The collection of all symmetry transpositions of $S$ is
the <strong>stabiliser</strong>:</p>
$$\mathrm{Stab}(S) = \{ T_n \in \mathbb{Z}_{12} : T_n(S) = S \}.$$<p>Because composing two symmetry transpositions yields another, $\mathrm{Stab}(S)$
is a <em>subgroup</em> of $\mathbb{Z}_{12}$.</p>
<p>The <strong>orbit–stabiliser theorem</strong> gives the fundamental count:</p>
$$|\mathrm{Orb}(S)| \cdot |\mathrm{Stab}(S)| = |\mathbb{Z}_{12}| = 12.$$<p>The number of distinct transpositions of $S$ equals $12$ divided by the number
of transpositions that leave $S$ unchanged. The more internally symmetric $S$ is,
the fewer new versions you can produce by transposing it.</p>
<p>A set with $|\mathrm{Stab}(S)| > 1$ — one that is invariant under some
non-trivial transposition — is a <strong>mode of limited transposition</strong>.</p>
<hr>
<h2 id="mode-1-the-whole-tone-scale">Mode 1: The Whole-Tone Scale</h2>
<p>The whole-tone scale contains the six pitch classes at even intervals:</p>
$$\mathrm{Mode\ 1} = \{0, 2, 4, 6, 8, 10\}.$$<p>Transposing by $T_2$:</p>
$$T_2(\{0, 2, 4, 6, 8, 10\}) = \{2, 4, 6, 8, 10, 0\} = \{0, 2, 4, 6, 8, 10\}. \checkmark$$<p>The set is unchanged. The same holds for $T_4, T_6, T_8, T_{10}$. The stabiliser
is the full subgroup of even transpositions:</p>
$$\mathrm{Stab}(\mathrm{Mode\ 1}) = \{T_0, T_2, T_4, T_6, T_8, T_{10}\} \cong \mathbb{Z}_6.$$<p>By the orbit–stabiliser theorem:</p>
$$|\mathrm{Orb}(\mathrm{Mode\ 1})| = \frac{12}{6} = 2.$$<p>There are exactly two distinct whole-tone scales. Every pianist learns this: the
one on C and the one on C$\sharp$. Composing with whole-tone harmony means
working from a stock of only two harmonic pools with no way to modulate into a
genuinely new version of the scale. This is Messiaen&rsquo;s first charm of
impossibility.</p>
<hr>
<h2 id="mode-2-the-octatonic-scale">Mode 2: The Octatonic Scale</h2>
<p>The octatonic (diminished) scale alternates half-step and whole-step intervals.
Starting on C:</p>
$$\mathrm{Mode\ 2} = \{0, 1, 3, 4, 6, 7, 9, 10\}.$$<p>Does $T_3$ leave this set invariant?</p>
$$T_3(\{0, 1, 3, 4, 6, 7, 9, 10\}) = \{3, 4, 6, 7, 9, 10, 0, 1\} = \{0, 1, 3, 4, 6, 7, 9, 10\}. \checkmark$$<p>Also $T_6$ and $T_9$. The stabiliser is the subgroup generated by transposition
by a minor third:</p>
$$\mathrm{Stab}(\mathrm{Mode\ 2}) = \{T_0, T_3, T_6, T_9\} \cong \mathbb{Z}_4.$$<p>The orbit size:</p>
$$|\mathrm{Orb}(\mathrm{Mode\ 2})| = \frac{12}{4} = 3.$$<p>There are exactly three distinct octatonic scales. Composers from Rimsky-Korsakov
and Bartók to Coltrane have exploited this closed system. The three scales
correspond to the three cosets of the subgroup $\langle T_3 \rangle$ in
$\mathbb{Z}_{12}$: the cosets $\{0, 3, 6, 9\}$, $\{1, 4, 7, 10\}$, and
$\{2, 5, 8, 11\}$ are the &ldquo;starting-point classes&rdquo; that generate each scale.
Note that the scales themselves are not pairwise disjoint — each has eight
pitch classes, so any two share four — but the coset structure determines
which transpositions produce the same scale and which produce a different one.</p>
<hr>
<h2 id="the-subgroup-lattice-and-all-seven-modes">The Subgroup Lattice and All Seven Modes</h2>
<p>The orbit–stabiliser theorem constrains which stabiliser sizes are algebraically
possible. Since $\mathrm{Stab}(S)$ is a subgroup of $\mathbb{Z}_{12}$, its order
must divide 12. The <em>proper non-trivial</em> subgroups of $\mathbb{Z}_{12}$ — those
with order strictly between 1 and 12 — are precisely:</p>
<table>
  <thead>
      <tr>
          <th>Subgroup</th>
          <th>Generator</th>
          <th>Order</th>
          <th>Orbit size</th>
      </tr>
  </thead>
  <tbody>
      <tr>
          <td>$\langle T_2 \rangle = \{T_0, T_2, T_4, T_6, T_8, T_{10}\}$</td>
          <td>$T_2$</td>
          <td>6</td>
          <td>2</td>
      </tr>
      <tr>
          <td>$\langle T_3 \rangle = \{T_0, T_3, T_6, T_9\}$</td>
          <td>$T_3$</td>
          <td>4</td>
          <td>3</td>
      </tr>
      <tr>
          <td>$\langle T_4 \rangle = \{T_0, T_4, T_8\}$</td>
          <td>$T_4$</td>
          <td>3</td>
          <td>4</td>
      </tr>
      <tr>
          <td>$\langle T_6 \rangle = \{T_0, T_6\}$</td>
          <td>$T_6$</td>
          <td>2</td>
          <td>6</td>
      </tr>
  </tbody>
</table>
<p>These four subgroups exist because the proper divisors of 12 that are greater
than 1 are exactly $\{2, 3, 4, 6\}$. The subgroups of $\mathbb{Z}_n$ are in
bijection with the divisors of $n$ — a consequence of the fundamental theorem of
cyclic groups. Since $12 = 2^2 \times 3$, the proper divisors are $1, 2, 3, 4,
6$.</p>
<p>Each row of the table maps onto a level in Messiaen&rsquo;s system:</p>
<ul>
<li><strong>Mode 1</strong> (whole-tone scale): stabiliser $\langle T_2 \rangle$, 2 transpositions</li>
<li><strong>Mode 2</strong> (octatonic scale): stabiliser $\langle T_3 \rangle$, 3 transpositions</li>
<li><strong>Mode 3</strong>: stabiliser $\langle T_4 \rangle$, 4 transpositions</li>
<li><strong>Modes 4 – 7</strong>: stabiliser $\langle T_6 \rangle$, 6 transpositions each</li>
</ul>
<p>The subgroup lattice of $\mathbb{Z}_{12}$ — its Hasse diagram of containment
relationships — maps directly onto the hierarchy of Messiaen&rsquo;s modes. The more
symmetric the stabiliser subgroup, the fewer distinct transpositions the mode
admits.</p>
<p>The containment relations are: $\langle T_2 \rangle \supset \langle T_4 \rangle$
and $\langle T_2 \rangle \supset \langle T_6 \rangle$ and
$\langle T_3 \rangle \supset \langle T_6 \rangle$. Correspondingly, Mode 1
(stabiliser $\langle T_2 \rangle$, order 6) is &ldquo;more limited&rdquo; than Mode 3
(stabiliser $\langle T_4 \rangle$, order 3), in the sense that $\langle T_4
\rangle \subset \langle T_2 \rangle$: every symmetry of Mode 3 is also a symmetry
of Mode 1&rsquo;s stabiliser.</p>
<hr>
<h2 id="why-exactly-seven-modes">Why Exactly Seven Modes?</h2>
<p>Messiaen was not enumerating all pitch-class sets with non-trivial stabilisers —
there are many more than seven. At the level of the stabiliser $\langle T_6
\rangle$, for example, there are numerous pitch-class sets invariant under the
tritone transposition $T_6$: any set $S$ such that $S = S + 6$ qualifies. Some
of these sets are large (ten pitch classes), some are small (two pitch classes),
some are musically coherent and some are not.</p>
<p>Messiaen selected seven that he found aesthetically and compositionally viable:
scales of moderate cardinality, with a balance of interval types, that he could
use as raw material for his harmonic language. The group theory explains the
<em>constraint</em> (modes are possible only at the four stabiliser types listed above),
not the <em>selection</em> (which specific sets Messiaen chose among the many that
satisfy the constraint).</p>
<p>The question &ldquo;why seven?&rdquo; is therefore partly combinatorial and partly
compositional. What is group-theoretically determined is the number of <em>levels</em>
(four: orbit sizes 2, 3, 4, 6) and the <em>impossibility</em> of any mode with, say,
five distinct transpositions (since 5 does not divide 12).</p>
<hr>
<h2 id="what-messiaen-knew--and-did-not-know">What Messiaen Knew — and Did Not Know</h2>
<p>Messiaen described his modes in <em>Technique de mon langage musical</em> (1944). His
account is entirely musical and phenomenological. He lists each mode by its
interval sequence, notes how many transpositions it admits, and names the
limitation a &ldquo;charm.&rdquo; The impossibility is for him a spiritual property, a form
of harmonic stasis that he associated — as a devout Catholic — with divine
eternity. A mode that cannot depart is, in his compositional theology, a glimpse
of the unchanging.</p>
<p>He was not doing group theory. The orbit–stabiliser theorem (in its abstract form)
postdates Lagrange (1771), Cauchy (early 19th century), and Galois (1832). But
the concepts were not part of music-theoretic discourse until Milton Babbitt&rsquo;s
work in the 1950s, and they were not formalised in the pitch-class set framework
I have used here until Allen Forte&rsquo;s <em>The Structure of Atonal Music</em> (1973) and
David Lewin&rsquo;s <em>Generalized Musical Intervals and Transformations</em> (1987).</p>
<p>What Messiaen had was a musician&rsquo;s ear for symmetry. He could <em>hear</em> that the
modes were closed, without having the algebraic vocabulary to explain why. The
group theory shows that he was correct, and why he was correct with a precision
that no amount of phenomenological description could provide.</p>
<hr>
<h2 id="from-messiaen-to-lewin">From Messiaen to Lewin</h2>
<p>Lewin&rsquo;s transformational theory (1987) generalises the $\mathbb{Z}_{12}$ framework
to arbitrary musical spaces. A <em>Generalized Interval System</em> is a triple
$(S, G, \mathrm{int})$ where $S$ is a set of musical objects, $G$ is a group, and
$\mathrm{int} : S \times S \to G$ assigns an interval to each ordered pair of
objects in a way that is consistent with the group structure.</p>
<p>This framework treats musical transformations — not just pitch-class transpositions
but rhythmic augmentations, timbral shifts, any structurally defined operation —
as elements of a group. The mathematics does not privilege any particular musical
parameter; it applies wherever a transformation group acts on a set of musical
objects.</p>
<p>Neo-Riemannian theory, which emerged from Lewin&rsquo;s work in the 1980s and 1990s
and was systematised by Cohn (1998), applies this framework to triadic
transformations (the operations L, P, and R that map major and minor triads to
their relatives, parallels, and leading-tone exchanges). The group generated by
L, P, and R on the set of 24 major and minor triads is isomorphic to $D_{12}$
— the same dihedral group that governs Messiaen&rsquo;s modes, but acting on a
different musical space.</p>
<p>Emmanuel Amiot&rsquo;s more recent work (2016) applies the discrete Fourier transform
to pitch-class sets, using the DFT coefficients on $\mathbb{Z}_{12}$ as a
continuous measure of a set&rsquo;s similarity to the modes of limited transposition.
The Fourier coefficients detect the algebraic symmetries that stabilisers measure
discretely: a set with large coefficient at frequency $k$ (in the DFT over
$\mathbb{Z}_{12}$) is close, in a precise sense, to having the stabiliser
$\langle T_{12/k} \rangle$.</p>
<p>The group-theoretic perspective has moved, over seventy years, from a marginal
curiosity to the dominant mathematical framework in music theory. Messiaen&rsquo;s
modes — which once seemed like personal compositional idiosyncrasies — are
revealed as structurally constrained: the possible stabiliser orders are fixed
by the divisors of 12, and the orbit sizes that Messiaen&rsquo;s ear discovered are
exactly those that Lagrange&rsquo;s theorem permits. Many pitch-class sets have
non-trivial stabilisers; Messiaen found the seven that are musically viable.
Their limitation is not a personal choice but an algebraic fact.</p>
<p>The charm of impossibilities is a theorem of group theory. And it is exactly as
beautiful as Messiaen heard it to be.</p>
<hr>
<h2 id="references">References</h2>
<ul>
<li>
<p>Amiot, E. (2016). <em>Music Through Fourier Space: Discrete Fourier Transform in
Music Theory.</em> Springer (Computational Music Science).</p>
</li>
<li>
<p>Babbitt, M. (1960). Twelve-tone invariants as compositional determinants.
<em>The Musical Quarterly</em>, 46(2), 246–259.
<a href="https://doi.org/10.1093/mq/XLVI.2.246">https://doi.org/10.1093/mq/XLVI.2.246</a></p>
</li>
<li>
<p>Cohn, R. (1998). Introduction to neo-Riemannian theory: A survey and a
historical perspective. <em>Journal of Music Theory</em>, 42(2), 167–180.
<a href="https://doi.org/10.2307/843871">https://doi.org/10.2307/843871</a></p>
</li>
<li>
<p>Forte, A. (1973). <em>The Structure of Atonal Music.</em> Yale University Press.</p>
</li>
<li>
<p>Lewin, D. (1987). <em>Generalized Musical Intervals and Transformations.</em> Yale
University Press. (Reissued Oxford University Press, 2007.)</p>
</li>
<li>
<p>Messiaen, O. (1944). <em>Technique de mon langage musical.</em> Alphonse Leduc.
(English translation: Satterfield, J., 1956.)</p>
</li>
<li>
<p>Tymoczko, D. (2006). The geometry of musical chords. <em>Science</em>, 313(5783),
72–74. <a href="https://doi.org/10.1126/science.1126287">https://doi.org/10.1126/science.1126287</a></p>
</li>
<li>
<p>Tymoczko, D. (2011). <em>A Geometry of Music: Harmony and Counterpoint in the
Extended Common Practice.</em> Oxford University Press.</p>
</li>
</ul>
<hr>
<h2 id="changelog">Changelog</h2>
<ul>
<li><strong>2026-01-14</strong>: Changed &ldquo;cosets of $D_{12}$&rdquo; to &ldquo;cosets of $\mathbb{Z}_{12}$ (the transposition subgroup)&rdquo; in the twelve-tone composition paragraph. $D_{12}$ (order 24) already includes both transpositions and inversions, yielding only 2 cosets in the full serial group. The four row forms {P, I, R, RI} correspond to 4 cosets of the transposition-only subgroup $\mathbb{Z}_{12}$ (order 12) in the full group of order 48.</li>
</ul>
]]></content:encoded>
    </item>
    <item>
      <title>Spiral Out: Tool&#39;s Lateralus, the Fibonacci Sequence, and the Mathematics of Musical Structure</title>
      <link>https://sebastianspicker.github.io/posts/fibonacci-lateralus/</link>
      <pubDate>Tue, 08 Nov 2022 00:00:00 +0000</pubDate>
      <guid>https://sebastianspicker.github.io/posts/fibonacci-lateralus/</guid>
      <description>Alongside physics and astronomy, two other things have occupied an unreasonable share of my attention since adolescence: mathematics and music. Lateralus by Tool — released 2001, still in rotation — is the piece that most conspicuously occupies the intersection. The song is structurally built around the Fibonacci sequence, from the syllable counts in Maynard Keenan&amp;rsquo;s vocals to the time signature pattern that concatenates to F(16). This post works through the mathematics in some detail and asks why it works musically.</description>
      <content:encoded><![CDATA[<h2 id="two-passions-one-song">Two Passions, One Song</h2>
<p>Physics training means coming to mathematics as a tool before arriving at it as
an object of aesthetic interest, and it
took me longer than it should have to notice that a proof can be
beautiful in the same way a piece of music can be beautiful — not
despite its rigour but because of it. Both reward attention to
structure. Both have surfaces accessible to a casual listener and depths
that only reveal themselves when you look harder.</p>
<p>Lateralus, the title track of Tool&rsquo;s 2001 album, is a convenient case
study for the overlap. It is not the only piece of music built around
Fibonacci numbers — Bartók made the connection decades earlier, and it
appears in scattered places across Western and non-Western traditions —
but it is among the most thoroughly and deliberately constructed, and
the mathematical structure is audible rather than merely theoretical.</p>
<p>What follows is an attempt to do justice to both dimensions: the
mathematics of the Fibonacci sequence and the golden ratio, and the
musical mechanics of how those structures show up and what they do.</p>
<hr>
<h2 id="the-fibonacci-sequence">The Fibonacci Sequence</h2>
<p>The sequence is defined by a recurrence relation. Starting from the
initial values $F(1) = 1$ and $F(2) = 1$, each subsequent term is the
sum of the two preceding ones:</p>
$$F(n) = F(n-1) + F(n-2), \quad n \geq 3$$<p>This gives:</p>
$$1,\; 1,\; 2,\; 3,\; 5,\; 8,\; 13,\; 21,\; 34,\; 55,\; 89,\; 144,\; 233,\; 377,\; 610,\; \mathbf{987},\; 1597,\; \ldots$$<p>The term $987$ is the sixteenth Fibonacci number, $F(16)$. Keep that
in mind.</p>
<p>The recurrence can be encoded compactly in a matrix formulation. For
$n \geq 1$:</p>
$$\begin{pmatrix} F(n+1) \\ F(n) \end{pmatrix} = \begin{pmatrix} 1 & 1 \\ 1 & 0 \end{pmatrix}^n \begin{pmatrix} 1 \\ 0 \end{pmatrix}$$<p>This is more than notational tidiness — it connects the Fibonacci
sequence to the eigenvalues of the matrix
$\mathbf{A} = \bigl(\begin{smallmatrix}1 & 1 \\ 1 & 0\end{smallmatrix}\bigr)$,
which are exactly $\varphi$ and $-1/\varphi$ where $\varphi$ is the
golden ratio. That connection gives us Binet&rsquo;s formula, a closed-form
expression for the $n$-th Fibonacci number:</p>
$$F(n) = \frac{\varphi^n - \psi^n}{\sqrt{5}}, \quad \varphi = \frac{1+\sqrt{5}}{2},\quad \psi = \frac{1-\sqrt{5}}{2} = -\frac{1}{\varphi}$$<p>Since $|\psi| < 1$, the term $\psi^n / \sqrt{5}$ diminishes rapidly,
and for large $n$ we have the convenient approximation:</p>
$$F(n) \approx \frac{\varphi^n}{\sqrt{5}}$$<p>This means Fibonacci numbers grow <em>exponentially</em>, at a rate governed by
the golden ratio. The sequence does not grow linearly or polynomially; it
spirals outward.</p>
<hr>
<h2 id="the-golden-ratio">The Golden Ratio</h2>
<p>The golden ratio $\varphi$ appears as the limit of consecutive Fibonacci
ratios:</p>
$$\varphi = \lim_{n \to \infty} \frac{F(n+1)}{F(n)} = \frac{1+\sqrt{5}}{2} \approx 1.61803\ldots$$<p>It can be derived from a simple geometric proportion: divide a line
segment into two parts such that the ratio of the whole segment to the
longer part equals the ratio of the longer part to the shorter part.
Calling those ratios $r$:</p>
$$\frac{a+b}{a} = \frac{a}{b} = r \implies r^2 - r - 1 = 0 \implies r = \frac{1+\sqrt{5}}{2} = \varphi$$<p>What makes $\varphi$ mathematically distinctive is its continued fraction
representation:</p>
$$\varphi = 1 + \cfrac{1}{1 + \cfrac{1}{1 + \cfrac{1}{1 + \cdots}}}$$<p>This is the simplest possible infinite continued fraction. It is also, in
a precise sense, the <em>hardest</em> real number to approximate by rational
fractions. The convergents of a continued fraction are the best rational
approximations to a real number at each level of precision; the
convergents of $\varphi$ are exactly the ratios of consecutive Fibonacci
numbers: $1/1$, $2/1$, $3/2$, $5/3$, $8/5$, $13/8$, $\ldots$ These
converge more slowly to $\varphi$ than the convergents of any other
irrational number. $\varphi$ is, in this sense, maximally irrational.</p>
<p>That property has a physical consequence. In botanical phyllotaxis — the
arrangement of leaves, seeds, and petals on plants — structures that grow
by adding new elements at a fixed angular increment will pack most
efficiently when that increment is as far as possible from any rational
fraction of a full rotation. The optimal angle is:</p>
$$\theta = \frac{2\pi}{\varphi^2} \approx 137.508°$$<p>This is the <em>golden angle</em>, and it is the reason sunflower seed spirals
count $55$ and $89$ (consecutive Fibonacci numbers) in their two
counter-rotating sets. The mathematics of efficient growth in nature
and the mathematics of the Fibonacci sequence are the same mathematics.</p>
<p>The golden spiral — the logarithmic spiral whose growth factor per
quarter turn is $\varphi$ — is the visual representation of this: it
is self-similar, expanding without bound while maintaining constant
proportionality.</p>
<hr>
<h2 id="fibonacci-numbers-in-music-before-tool">Fibonacci Numbers in Music: Before Tool</h2>
<p>The connection between the Fibonacci sequence and musical structure is
not Tool&rsquo;s invention. The most carefully documented case is Béla
Bartók, whose Music for Strings, Percussion and Celesta (1936) has been
analysed exhaustively by Ernő Lendvai. In the first movement, the
climax arrives at bar 55 (a Fibonacci number), and Lendvai counted the
overall structure as 89 bars — the score has 88, but he added an implied
final rest bar to reach the Fibonacci number — dividing at bar 55 with
near-mathematical precision. Lendvai argued that Bartók consciously embedded Fibonacci
proportions into formal structure, tonal architecture, and thematic
development throughout much of his output.</p>
<p>Whether these proportions were conscious design or an instinct that
selected naturally resonant proportions is contested. The same question
applies to claims about Mozart and Chopin. What is more defensible is
a structural observation about the piano keyboard and Western scales
that requires no attribution of intent:</p>
<p>A single octave on the piano keyboard has <strong>13 keys</strong>, comprising <strong>8
white keys</strong> and <strong>5 black keys</strong>. The black keys are grouped as <strong>2</strong>
and <strong>3</strong>. The numbers $2, 3, 5, 8, 13$ are five consecutive Fibonacci
numbers — $F(3)$ through $F(7)$.</p>
<p>The standard Western scales make this concrete. The major scale
contains <strong>7 distinct pitches</strong> within an octave of <strong>12 semitones</strong>.
The pentatonic scale (ubiquitous in folk, blues, rock) contains <strong>5</strong>
pitches. The chromatic scale contains <strong>12</strong> pitch classes per octave;
counting both endpoints of the octave (C to C) gives <strong>13</strong> chromatic
notes, the next Fibonacci number.</p>
<p>Harmonic intervals in just intonation are rational approximations of
simple frequency ratios: the octave (2:1), the perfect fifth (3:2),
the perfect fourth (4:3), the major third (5:4), the minor third (6:5).
The numerators and denominators are small integers, often Fibonacci
numbers or their neighbours. The major triad — the structural foundation
of tonal Western music — consists of intervals in frequency ratios
$4:5:6$, three consecutive integers that bracket the Fibonacci-adjacent
range.</p>
<p>This does not mean that Western music is secretly Fibonacci. It means
that the integer frequency ratios that produce consonant intervals are
the small integers, and small integers include the small Fibonacci
numbers. The connection is genuine but not exclusive.</p>
<hr>
<h2 id="lateralus">Lateralus</h2>
<p>Tool&rsquo;s <em>Lateralus</em> (2001, album of the same name) is unusual in that
the Fibonacci construction is not an analytical inference applied after
the fact — it was discussed publicly by the band. Drummer Danny Carey has
spoken about his engagement with sacred geometry and mathematical
structure, and the song&rsquo;s construction has been described as intentional
by multiple band members.</p>
<p>There are two primary levels of Fibonacci structure in the song. The
third — the thematic content of the lyrics — makes the mathematical
frame explicit.</p>
<h3 id="the-syllable-count">The Syllable Count</h3>
<p>The opening verses are constructed so that successive lines contain
syllable counts following the Fibonacci sequence ascending:
$1, 1, 2, 3, 5, 8, 13$. The first syllable count is a single word.
The second is another. The third is a two-syllable phrase. The sequence
continues, each line adding the weight of the previous two, until the
thirteenth-syllable line, which in structure and delivery feels like the
crest of a wave.</p>
<p>The second half of the verse then descends: $13, 8, 5, 3, 2, 1, 1$.
Or, in some analyses, the chorus and pre-chorus sections begin a new
ascending Fibonacci run before the full descent, creating a nested
structure of expansions and contractions.</p>
<p>The audible effect of this design is not arbitrary. A sequence of lines
whose syllable counts follow $1, 1, 2, 3, 5, 8, 13$ creates a
consistently accelerating density of text over the same musical time.
The vocal line becomes more compressed as the syllable count rises,
building tension — and then the descent releases it. This is not how
most pop or rock lyrics are structured. It produces a breathing,
organic quality, the way a plant reaches toward light.</p>
<h3 id="the-time-signature-987">The Time Signature: 987</h3>
<p>The verse sections of the song cycle through three time signatures in
succession: $9/8$, then $8/8$, then $7/8$.</p>
$$9/8 + 8/8 + 7/8$$<p>This three-bar pattern repeats. Now: the sequence of numerators is $9$,
$8$, $7$. Written as a three-digit number: <strong>987</strong>. And as noted above,
$987 = F(16)$, the sixteenth Fibonacci number.</p>
<p>Whether this is a deliberate encoding or a remarkable coincidence is a
matter of interpretation. The time signature sequence is definitely
deliberate — asymmetric meters of this kind require careful compositional
choice. The fact that their numerators concatenate to a Fibonacci number
is either intentional and clever or accidental and still remarkable.
Either way, the time signature pattern has a musical function independent
of the Fibonacci reading.</p>
<p>In standard rock, time is almost always $4/4$: four even beats per bar,
a pulse that is maximally predictable and maximally amenable to groove.
The $9/8 + 8/8 + 7/8$ pattern is the opposite. Each bar has a different
length. The listener&rsquo;s internal metronome, calibrated to $4/4$, cannot
lock onto the pattern. The music generates forward momentum not through
a repeated downbeat but through the continuous, non-periodic unfolding
of measures whose lengths shift. This is the rhythmic analogue of a
spiral: no two revolutions are identical in length, but the growth is
consistent.</p>
<p>The chorus and other sections use different time signatures, including
stretches in $5/8$ and $7/8$ — Fibonacci numbers again, and specifically
the $5, 8, 13$ triplet that appears so often in this context.</p>
<h3 id="the-thematic-content">The Thematic Content</h3>
<p>The lyrics are explicitly about spirals, Fibonacci growth, and the
experience of reaching beyond a current state of development. They
reference the idea of expanding one&rsquo;s perception outward through
accumulating cycles, each containing and exceeding the previous one.
The chorus refrain — about spiralling outward — names the mathematical
structure of the golden spiral directly. The song is, in its own terms,
about the process that the mathematics describes.</p>
<p>This kind of thematic coherence between structure and content is what
makes the construction interesting rather than merely clever. The
Fibonacci structure is not decorative. It is the argument of the song
made manifest in its form.</p>
<hr>
<h2 id="why-fibonacci-structure-works-in-music">Why Fibonacci Structure Works in Music</h2>
<p>The most interesting question is not whether the Fibonacci structure is
there — it clearly is — but why it produces the musical effect it does.</p>
<p>Consider what the Fibonacci sequence represents physically. It is the
growth law of structures that build on their own preceding state:
$F(n) = F(n-1) + F(n-2)$. Unlike arithmetic growth (add a constant)
or geometric growth (multiply by a constant), Fibonacci growth is
<em>self-referential</em>. Each term contains the memory of the previous two.
The sequence is expansive but not uniform; it accelerates, but always
in proportion to what came before.</p>
<p>Musical tension and release are, in an important sense, the same
mechanism. A phrase creates an expectation; its continuation either
confirms or subverts that expectation; resolution reduces the tension.
What makes a musical phrase feel like it is building toward something
is precisely the progressive accumulation of expectation — each bar
adding its weight to the previous, the accumulated tension requiring
resolution at a scale proportional to the build-up. The Fibonacci
syllable structure in Lateralus generates this literally: each line is
denser than the previous two lines&rsquo; combined syllable count would
suggest is comfortable, until the structure has to breathe.</p>
<p>The time signature asymmetry works similarly. In $4/4$, the beat is
predictable, and the listener&rsquo;s body can lock to it and then coast on
that lock. In $9/8 + 8/8 + 7/8$, the beat is never fully locked — the
pattern is periodic (it repeats) but the internal structure of each
repetition is shifting. The listener is perpetually catching up,
perpetually leaning slightly into the music to find the next downbeat.
This is not discomfort — it is engagement. The mathematical reason is
that the pattern is large enough to be periodic (it does repeat) but
small enough to be audible as a unit. The brain can learn the 24-beat
super-pattern; it just requires attention that $4/4$ does not.</p>
<p>There is a deeper reason why golden-ratio proportions feel right in
musical form. The golden section of a piece — the point at which the
piece divides in the $\varphi : 1$ ratio — is the point of maximum
accumulated development before the final resolution. In a five-minute
piece, the golden section falls at roughly 3:05. This is, empirically,
where the emotional and structural climax tends to sit in a wide range
of well-regarded music, from Baroque to jazz. Whether composers
consciously target this proportion or whether the proportion is what
accumulated development looks like when done well is not easily
separable. But the mathematical reason it is <em>a</em> proportion worth
targeting is that $\varphi$ is the only division point that is
self-similar: the ratio of the whole to the longer part equals the ratio
of the longer part to the shorter part. There is no arbitrary scale
associated with the golden section; it is scale-invariant, the same
proportion at every level of analysis.</p>
<hr>
<h2 id="a-brief-note-on-binet-and-limits">A Brief Note on Binet and Limits</h2>
<p>The closed-form expression for Fibonacci numbers,</p>
$$F(n) = \frac{\varphi^n - \psi^n}{\sqrt{5}},$$<p>has a pleasing consequence for large $n$. Since $|\psi| \approx 0.618 < 1$,
the term $\psi^n \to 0$, and $F(n)$ is simply the nearest integer to
$\varphi^n / \sqrt{5}$. The integers produced by the Fibonacci recurrence
are the integers that $\varphi^n / \sqrt{5}$ passes closest to. The
exponential growth of $\varphi^n$ and the rounding to integers together
give the sequence.</p>
<p>This is also why the ratios $F(n+1)/F(n)$ converge to $\varphi$
exponentially fast — the error is $\mathcal{O}(|\psi/\varphi|^n)
= \mathcal{O}(\varphi^{-2n})$ — and why, for musical purposes, the
Fibonacci ratios $8:5$, $13:8$, $21:13$ are already excellent
approximations of the golden ratio, close enough that the ear cannot
distinguish them from $\varphi$ in any direct sense.</p>
<hr>
<h2 id="what-lateralus-is">What Lateralus Is</h2>
<p><em>Lateralus</em> is not a math lecture set to music. It is a nine-minute
progressive metal track that is physically involving, rhythmically
complex, and lyrically coherent. The Fibonacci structure would be
worthless if the song were not also, on purely musical terms, good.</p>
<p>What the mathematics adds is a vocabulary for something the song achieves
anyway: the sense of growing without ever arriving, of each section being
both a resolution of what came before and an opening toward something
larger. The golden spiral does not end. The Fibonacci sequence does not
converge. The song does not resolve in the sense that a classical sonata
resolves; it spirals to a close.</p>
<p>The reason this is worth writing about is that it makes concrete a
connection that is usually stated vaguely: mathematics and music are
similar. They are similar in specific and articulable ways. The
self-referential structure of the Fibonacci recurrence, the scale-
invariance of the golden ratio, the information-theoretic account of
tension and expectation — these are not metaphors for musical experience.
They are, in this case, the actual mechanism.</p>
<hr>
<h2 id="references">References</h2>
<p>Lendvai, E. (1971). <em>Béla Bartók: An Analysis of His Music.</em> Kahn &amp;
Averill.</p>
<p>Benson, D. J. (2006). <em>Music: A Mathematical Offering.</em> Cambridge
University Press. <em>(For an introduction to the general theory of tuning,
temperament, and harmonic series.)</em></p>
<p>Tool. (2001). <em>Lateralus.</em> Volcano Records.</p>
<p>Livio, M. (2002). <em>The Golden Ratio: The Story of Phi, the World&rsquo;s Most
Astonishing Number.</em> Broadway Books.</p>
<p>Knott, R. (2013). Fibonacci numbers and the golden section in art,
architecture and music. <em>University of Surrey Mathematics Department.</em>
<a href="https://r-knott.surrey.ac.uk/Fibonacci/fibInArt.html">https://r-knott.surrey.ac.uk/Fibonacci/fibInArt.html</a></p>
<hr>
<h2 id="changelog">Changelog</h2>
<ul>
<li><strong>2025-11-20</strong>: Clarified the Bartók bar count: the written score has 88 bars; Lendvai&rsquo;s analysis counted 89 by adding an implied final rest bar to reach the Fibonacci number. Previously stated as &ldquo;89 bars&rdquo; without qualification.</li>
</ul>
]]></content:encoded>
    </item>
  </channel>
</rss>
