Sunday, April 12, 2026

Tonal Cognition and Quantum Search Analogies

The analogy to quantum amplitude amplification should be understood as a conceptual description of global hypothesis redistribution, closely related to attractor dynamics and winner-take-all competition in neural models of perception.

Grover-like amplification is similar to attractor dynamics in neural systems.

Tonal listening can be modeled as a probabilistic search process over competing tonal hypotheses. Musical cues act as operators that progressively amplify compatible interpretations while suppressing others. The resulting dynamics resemble continuous amplitude amplification, analogous to Grover search, where probability mass rotates toward a target state. In neural terms this corresponds to attractor dynamics within predictive processing systems, where stable tonal centers emerge as low free-energy states of interpretation.

Musical feeling becomes a visible trace of inference dynamics.

[MAIN ARTICLE: TONAL CONSTANCY]

An interesting analogy can be drawn between tonal cognition and mechanisms used in quantum amplitude amplification algorithms, particularly those related to Grover's algorithm. Although the comparison is not meant literally, the mathematical intuition behind these algorithms provides a useful conceptual framework for understanding how listeners stabilize tonal interpretations over time. At any moment during listening, the auditory system can be thought of as maintaining a distribution over possible tonal interpretations. A listener may simultaneously entertain several competing hypotheses: that a passage belongs to a particular key, that a certain pitch functions as a tonic, that an interval acts as a leading tone, or even that the passage is not tonal at all. Before sufficient musical context accumulates, none of these interpretations is fully determined.

In this sense the perceptual state resembles a superposition of tonal hypotheses. Importantly, however, the initial distribution is not uniform. Extensive exposure to tonal systems, such as the twelve-tone equal-tempered framework, creates strong prior expectations. As a result, familiar tonal structures begin with higher probability weight than unfamiliar alternatives. This situation resembles a quantum search initialized with a biased amplitude distribution, where some candidate solutions already carry greater weight because they are more strongly expected by the system.

In quantum search algorithms, the role of the oracle is to mark candidate states and allow subsequent transformations to amplify their probability. Musical events play a somewhat analogous role in perception. Individual chords, melodic motions, and rhythmic articulations act as cues that selectively reinforce or weaken particular tonal hypotheses. For instance, a dominant-seventh chord strongly favors certain key interpretations; a cadential progression sharply increases the likelihood of a specific tonic; conversely, an unexpected harmonic event can reduce confidence in previously favored interpretations.

(Note on the oracle: Music is a "Stochastic Oracle", unlike a database search where the target is fixed, the "target" in music is often being constructed by the oracle itself as it speaks. It’s a self-referential search.)

Within predictive models of perception, these events effectively reshape the error landscape of competing tonal hypotheses. Each new observation alters the relative plausibility of candidate interpretations, not simply by incrementally adding probability mass but by suppressing alternatives as well. In this respect the dynamics resemble amplitude amplification: evidence does not merely accumulate but redistributes probability across the entire hypothesis space.

As a piece unfolds within a coherent tonal grammar, repeated cues gradually concentrate probability on a single tonal interpretation. Cadential events illustrate this particularly clearly. Prior to a cadence, several tonal centers may remain plausible. As the cadential progression unfolds, the probability associated with the intended tonic increases rapidly while competing interpretations lose support. After the cadence, the system effectively commits to a single tonal interpretation. This description is compatible with Bayesian models of perceptual inference, yet the quantum analogy highlights an additional property of tonal cognition: updates operate globally across the hypothesis space rather than strictly on a note-by-note basis. Predictions interact with incoming information, reinforcing some interpretations while actively suppressing others, producing a winner-take-all stabilization of tonal structure.

Another useful parallel concerns robustness. Quantum amplitude amplification algorithms are designed to tolerate moderate noise or imperfect marking of candidate states while still converging on the correct solution. Tonal perception shows similar resilience. Even when pitches drift, tuning systems are distorted, or melodic lines deviate from strict intonation, listeners typically maintain a stable tonal interpretation until accumulated deviations exceed a perceptual threshold. From this perspective, learned tuning systems such as twelve-tone equal temperament function partly as cognitive stabilization mechanisms(an "error-correcting code"). Once internalized, they provide discrete categorical anchors toward which continuous pitch variation can be perceptually corrected. Melodic expectation and harmonic context repeatedly steer pitch interpretations back toward these categories, maintaining stable tonal structure even under conditions of acoustic variability.

Although the analogy to quantum search should not be taken as a literal claim about neural implementation, it offers a useful conceptual model for tonal cognition. Musical perception can be understood as an iterative process in which competing tonal hypotheses interact, reinforce, and suppress one another until a coherent interpretation of the tonal environment emerges.

Tonal interpretation may therefore be understood as the emergence of an attractor state in a dynamically competing hypothesis space, where musical events progressively amplify one tonal interpretation while suppressing alternatives.



Tonal tension and resolution behave very much like a continuous search process converging to an attractor which is extremely similar to continuous versions of Grover search used in quantum mechanics. The standard picture of Grover's algorithm is discrete: superposition of states, oracle marks target, amplitude amplification, collapse to solution. But physicists also describe Grover search as a continuous rotation of probability amplitude between states. Probability gradually shifts from: many possibilities towards a target state, through repeated small transformations.

Tonal perception behaves similarly, in tonal listening the brain maintains something like a probability field over tonal interpretations.

possible states: tonic = C, tonic = G, ..., modal / ambiguous, non-tonal interpretation,

At the beginning of a piece probability is spread across many states, as music unfolds dominant chords, leading tones, cadential motion, gradually rotate the probability distribution toward one tonal center. This is not a sudden update, it’s a gradual steering process, exactly what happens in continuous Grover dynamics.

Some cadences work best because they apply multiple reinforcing cues simultaneously.

Example: V7 → I
Contains: leading tone resolution, dominant function, root motion, harmonic expectation.

All cues point to the same attractor so the system rapidly converges. This connects directly to tonal tension which happens when probability mass is still distributed. Resolution happens when one attractor dominates. That explains several musical phenomena:

suspension: probability temporarily ambiguous
deceptive cadence:amplification redirected to unexpected attractor
modulation: system slowly steered toward new attractor basin
tonal ambiguity: multiple attractors competing

The Free Energy Principle basically says biological systems minimize prediction error / surprise. In tonal listening brain tries to maintain a stable generative model of the music.

If the model predicts well: low free energy, stable tonal center.
If not: model updates, new tonal interpretation.

So tonal attractors are essentially low free-energy states of musical interpretation. 12-EDO acts like a cognitive error-correcting system because tonal perception does something like: continuous pitch input > snap toward discrete tonal categories. Just like phonemes in speech. Even if tuning drifts, the brain pulls notes back into stable tonal bins, classic attractor behavior.

Tonal harmony starts to resemble navigation through an energy landscape.

---

Beyond the Analogy: What Can We Actually Model?

When we move past the metaphor, it becomes clear that traditional Bayesian models of music perception struggle to account for the global phase transitions inherent in tonal cognition. When a cadence resolves, it doesn't merely increase the probability of the tonic by a discrete percentage; it triggers a total redistribution of the hypothesis space; a global shift that stabilizes the tonal center and collapses previous ambiguities.

1. Music as a Mode of Perception
This approach aligns with a specific philosophical stance: music is not an acoustic object, but a mode of perception. It is a cognitive state facilitated by sensory stimuli that possess regular, almost universal characteristics. From these stimuli, "object-like" attributes are stabilized and eventually codified into what we call "Music Theory."

Tonality is the primary component of this experience: a hierarchical interpretation of pitch categories defined by their functional roles. While traditional theory focuses on rhythm, melody, and harmony, these cannot exist without axiomatization. Even a single note is not "music" until a perceiver adopts a generative model to interpret it.

2. Rhythm and the Information Theory of Randomness
Rhythm is not "in" the signal; it is an inference of predictability. To a perceiver, rhythm is the successful extraction of a pattern from a temporal signal (auditory or visual).

This follows the information-theoretic approach to randomness: randomness is not defined by the process (like a dice roll), but by the sequence's interaction with a predictive model. There is no "absolute" music or "absolute" randomness, only a measure of prediction success.

The Stochastic Oracle: Current deep learning models show that human "musicality" ratings correlate strongly with a model's prediction errors. Music act as a "Stochastic Oracle", a system that writes about its own predictability in a self-referential search for order.

3. The Tonal Object and Categorical Perception
If music requires a generative model, then the Tonal Object is a construct of the perceiver. We model this through Categorical Perception, which allows the system to remain stable under the "noise" of microtonality or xenharmonic intervals.

The Tonal Object is the stable configuration of primitives that allows for a continuous interpretation of sound. It functions as a "Gestalt," similar to face perception, where the whole is recognized before the parts.

4. Dynamics: Bottom-Up vs. Top-Down
Research into probe-tone data (Krumhansl, Milne) suggests that tonal hierarchies reflect "bottom-up" sensory influences (acoustics and spectra). However, perception is a two-way street. We must account for the "Soulware", the internal, running interpretation that converges with the stimuli.

The Problem with Bottom-Up Only: A "mode" is simply a rotation of a scale; its interval content is identical. Performing a Discrete Fourier Transform (DFT) on a pitch set reveals a result that is invariant to rotation.

The Temporal Solution: A mode acquires its character through temporal grouping and beat-dependence. To model this, we move from a static DFT to a windowed interval matrix vectorization. This captures how attention is redirected toward a specific tonic, transforming the same acoustic set into a different musical sensation.

5. Historical Priors vs. Enculturation
The ability to distinguish modes is likely a deep perceptual prior rather than a result of modern mass-exposure. Ancient Mesopotamian and Greek theories already assigned specific "moral characters" or deities to different scales (e.g., the "lazy" Lydians of Plato or the planetary mappings in Chaldean theory).

The fact that these rotations were recognized as distinct "personalities" thousands of years ago suggests that distinguishing diatonic rotations doesn't require a lifetime of cultural conditioning. It is a fundamental feature of how our "search algorithm" navigates interval space.

[...]
By utilizing a hybrid Quantum-Probabilistic model, we can move beyond description and begin to simulate:

Rhythm Emergence: As the stabilization of predictability.
Tonal Cognition: As a global, winner-take-all search for an attractor.
Pitch Categorization: As a dynamical, error-correcting lattice.
Scale Emergence: As the inevitable result of a system minimizing free energy in a spectral landscape.

Sunday, March 22, 2026

Spectral Congruence & Pitch Cyclicity

Pitch cyclicity (including octave equivalence) can be understood as an emergent property of spectral self‑similarity under frequency scaling; by designing or analyzing spectra for scaling congruence one can predict or create alternative perceptual pitch cycles and compatible tuning systems.

(DRAFT)

Existing models of pitch perception successfully account for consonance and sensory dissonance relationships between sounds, but they do not generally explain why pitch space is perceived as cyclic at a specific interval. We propose that octave equivalence arises as a special case of a more general principle of spectral congruence: pitch categories emerge when the spectral structure defining a timbre remains approximately invariant under frequency scaling. Under this formulation, the perceived pitch cycle (equave) is not fixed but depends on the spectral properties that define pitch identity: pitch equivalence as a consequence of spectral congruence under frequency scaling

Computational demonstrations show how spectra with controlled scaling symmetries can produce alternative pitch cycles and corresponding tuning systems. An interactive implementation enables the co-design of timbre and tuning by directly manipulating the spectral parameters that determine the distance of the perceptual equave. The approach provides a framework for constructing compatible timbres and musically practical tuning systems while also revealing edge cases in which pitch cyclicity weakens or fails to emerge.


1. Introduction

The concept of pitch is broadly consistent across fields such as auditory perception, psychoacoustics, and music theory. It is commonly defined as “that auditory attribute of sound according to which sounds can be ordered on a scale from low to high.” For simple stimuli such as pure tones, this ordering closely corresponds to frequency. However, for complex sounds, pitch is often described as something that must be extracted from the signal rather than directly given (Oxenham, 2004).

Importantly, not all sets of sounds can be meaningfully arranged along a single low–high continuum, even when each sound individually is perceived as pitched. This suggests that the existence of a continuous pitch dimension depends on constraints beyond pitch itself. In practice, such ordering is facilitated when sounds share a similar timbre that is, a comparable spectral composition, such as notes produced on a single instrument. Under these conditions, pitch becomes a stable perceptual attribute, supported by approximate invariance under spectral scaling and the formation of coherent auditory objects (Bregman; Terhardt).

However, the existence of pitch within a given timbre does not guarantee comparability across timbres. Sounds with distinct spectral structures may each exhibit clear pitch height, yet fail to align perceptually. For example, highly inharmonic or irregular resonant systems can produce well-defined pitch-like percepts that do not admit a clear unison relationship with harmonic tones. This suggests that pitch equivalence, the degree to which two sounds are perceived as equivalent or substitutable, is itself dependent on timbre.

Pitch equivalance, or also affinity, refers to the perceptual similarity between sounds: the extent to which one sound may be confused with, or replace, another. Within a given timbre, the strongest equivalence occurs at unison, followed by the octave, corresponding to a doubling of frequency (2:1). This relationship is widely observed across musical cultures and has historically served as the foundational interval for scale construction. The resulting phenomenon, known as octave equivalence, is often treated as universal (Burns & Ward).

Crucially, octave equivalence does more than establish similarity between discrete sounds. Because it arises from a continuous transformation (frequency scaling), it induces a topological structure on pitch space: a cycle. In this way, pitch is not merely ordered linearly, but organized into repeating classes. This cyclic structure provides a stable perceptual framework that supports categorization, largely independent of individual differences in hearing range or discrimination thresholds. Pitch equivalence thus functions as a perceptual strategy for structuring an otherwise continuous auditory dimension.

Empirical studies, however, reveal that the octave is not perceived as a fixed interval. For example, Lola L. Cuddy (1982) found that listeners tend to stretch the octave when tuning sine waves, while accuracy improves in musically structured contexts such as triads. Similarly, tuning practices in instruments such as the piano exhibit systematic deviations from the ideal 2:1 ratio. These findings suggest that octave equivalence, while robust, is not exact, and may reflect underlying perceptual and physiological constraints. At the same time, its presence even in simple stimuli has been linked to internal auditory templates and mechanisms associated with virtual pitch (Terhardt).

Moreover, pitch perception is strongly shaped by context. Musical expectation and cognitive factors influence how sounds are categorized, beyond their raw spectral content. Phenomena such as the ambiguity of Shepard tones, or the functional reinterpretation of identical pitch material in different harmonic contexts, illustrate that pitch perception is not a passive reflection of acoustic input. These effects have been extensively studied by researchers such as Diana Deutsch and Carol Krumhansl.

The origins of octave equivalence remain debated. Hermann von Helmholtz proposed that it arises from shared spectral components between tones separated by a 2:1 ratio. In contrast, more recent work by Peter A. Cariani and Bertrand Delgutte (1996) suggests that octave equivalence may emerge from neural coding strategies, rather than from spectral similarity alone.

Closely related to equivalence is the concept of consonance. Originally formalized by Helmholtz, modern psychoacoustics explains sensory dissonance in terms of roughness arising from interactions within critical bandwidths. Building on this, William A. Sethares developed a model linking timbre and tuning, showing that consonant intervals correspond to minima in the dissonance curve derived from spectral interactions (following Reinier Plomp and Willem J. M. Levelt). This framework enables the derivation of optimal tuning systems for a given timbre, although the inverse problem, constructing timbres for a desired set of intervals, remains computationally difficult.

Despite these advances, existing models do not explain why certain intervals such as the octave become perceptual equivalence classes, nor why pitch space assumes a cyclic structure. In other words, while consonance models account for interval preference, they do not account for the emergence of categorical periodicity in pitch.

The model introduced in this work addresses this gap by explaining pitch equivalence and cyclicity without requiring a commitment to specific mechanisms of pitch encoding, such as place-based or temporal theories.

The next section reviews the conventional model of pitch organization distinguishing pitch height, pitch class, and chroma and introduces standard representations such as the pitch helix.


2. Pitch Height, Chroma, and Cyclic Representations

Géza Révész, William L. Idson, and Dominic W. Massaro proposed that pitch should be understood as a two-dimensional perceptual attribute, in which pitch height constitutes only one dimension. The second dimension, termed chroma, refers to the cyclical, categorical aspect of pitch. Within this framework, a tone is characterized both by its height (low to high) and by its position within a repeating set of pitch categories, often associated with musical functions such as “fifth-ness,” “leading-tone-ness,” or, most fundamentally, “octave-ness,” which provides the reference frame for the others.

This dual structure is commonly represented using a helical model. In this representation, pitch height corresponds to vertical position, while chroma is mapped to angular position around the helix. Tones sharing the same chroma (e.g., those labeled with the same pitch class in musical notation) align vertically, separated by octave intervals.

The most influential formalization of this idea is the pitch helix introduced by Roger Shepard and later developed by Diana Deutsch. In this model, pitch height corresponds to logarithmic frequency, while chroma corresponds to position modulo octave. Formally, this can be expressed as a mapping of frequency onto a circular dimension, such that tones separated by a factor of 2 occupy the same angular position, reflecting octave equivalence.

Conventional accounts typically explain octave equivalence by noting that harmonic spectra exhibit self-similarity under doubling of frequency. The central claim of the present work generalizes this idea: octave equivalence is not unique, but rather a specific instance of a broader principle. Pitch cycles arise whenever a timbre exhibits sufficient spectral congruence under frequency scaling. Under this view, the equave depends on the spectral structure of the sound, and need not be fixed at a 2:1 ratio. If a spectrum is approximately invariant under scaling by a factor k, then pitch equivalence may emerge at that ratio.

It is important to distinguish this perceptual notion of equivalence from the concept of pitch class as used in music theory. In many theoretical contexts, pitch classes function as abstract labels akin to equivalence classes in algebra used for analytical purposes. While in standard twelve-tone equal temperament chroma and pitch class coincide, this correspondence is not necessary. Musicians frequently impose alternative analytical structures, for example by redefining equivalence relationships for purposes of reharmonization or compositional experimentation.

Such distinctions become more evident in non-standard tuning systems. For example, in the Bohlen–Pierce scale, the period is often described as a “tritave” (3:1 ratio). Although equal divisions of this interval (e.g., 13-EDT) define a repeating structure analytically, this does not imply perceptual equivalence in the same sense as octave equivalence in harmonic spectra. When realized on harmonic instruments, such systems may produce continuously expanding chroma rather than stable repetition, and the assigned pitch classes do not necessarily correspond to perceptual identity.

This highlights a key point: analytical structure does not guarantee perceptual equivalence. In practice, however, musical systems tend to align these two aspects. A clear example is found in transcription across instruments. For instance, Clair de Lune by Claude Debussy spans a wide range on the piano, yet can be effectively adapted to instruments with a more limited range, such as the guitar, by compressing distant octaves. Despite these transformations, the piece remains recognizable because octave equivalence preserves functional relationships. By contrast, substituting intervals based on a different periodicity (e.g., tritave equivalence) would fundamentally disrupt these relationships and compromise recognition.

At the same time, pitch perception is not strictly determined by chroma. Experimental evidence shows that pitch contour can dominate categorical identity, and that listeners tolerate significant deviations from exact tuning (e.g., stretched octaves) without loss of recognition. While some have argued that chroma is therefore a weaker or secondary perceptual dimension, such conclusions may be overstated. Octave equivalence does not imply identity of sound, but rather a structured form of perceptual similarity.

Particularly revealing are the stimuli described by Roger Shepard as “perfect octaves,” which give rise to well-known auditory illusions such as Shepard tones and the endlessly ascending or descending scale. In these cases, spectra are constructed to exhibit near-perfect self-similarity across octave shifts, making tones separated by a factor of 2 difficult to distinguish. The resulting perceptual ambiguity can produce bistable interpretations of pitch direction, depending on context.

This phenomenon extends to the tritone paradox described by Diana Deutsch, in which pairs of tones separated by a tritone can be perceived as ascending or descending depending on prior context. Within the helical framework, this can be understood as a consequence of cyclic structure: if pitch space contains a point of return (octave equivalence), it must also contain perceptual oppositions that generate directional ambiguity. However, such effects depend on specific spectral conditions and do not arise for all pitched sounds.

This suggests that the pitch helix is not a universal representation directly tied to all pitch perception, but rather an emergent structure that applies under particular spectral conditions. In this sense, cyclic pitch organization may reflect a perceptual strategy rather than a fixed property of auditory processing.

The notion of “perfect octaves” can be generalized as a form of autocorrelation in log-frequency space. This principle underlies the synthesis methods used in the present work to construct and test alternative pitch cycles. While some generated sounds may occupy ambiguous positions with respect to pitch, the presence of analogous perceptual effects such as cyclic equivalence and directional ambiguity suggests that spectral self-similarity plays a central role in the formation of pitch categories.

The next section reviews existing models of timbre, harmonicity, and dissonance, which account for consonant interval structures but do not directly explain why pitch equivalence arises at specific intervals.


3. Harmony, Consonance, and Spectral Structure

The systematic study of harmony, as traditionally conceived in music theory, is complicated by the influence of aesthetic and cultural factors, which often obscure more fundamental perceptual mechanisms. Models that classify harmony in terms of chords, dyads, or triads tend to produce inconsistent results, as perceived consonance depends strongly on musical context. For example, a major triad evaluated in isolation may receive a moderate rating, yet be judged significantly more consonant when it follows a dominant or leading-tone context. Conversely, the same chord, removed from context, may be perceived as less stable. This suggests that consonance is not an inherent property of the triad itself, but emerges within a broader tonal framework.

Traditional theory emphasizes intervals defined by simple integer ratios as the foundation of consonance, a view supported by the harmonic series of vibrating strings. However, this principle has remained largely heuristic, functioning more as an intuitive guideline than as a predictive or explanatory model.

To address this, psychoacoustic research has focused on sensory consonance, isolating it from musical structure. This line of work examines roughness and beating phenomena arising from interactions between nearby frequency components.

Hermann von Helmholtz first proposed that consonance is governed by auditory roughness. Building on this idea, Reinier Plomp, Willem J. M. Levelt, and others formalized the relationship between roughness and interval perception through the concept of critical bandwidth, leading to the development of dissonance curves for both pure and complex tones.

While early interpretations suggested that these curves did not align with musically significant intervals, later work by William A. Sethares demonstrated that, when extended to complex spectra, the minima of total roughness across all frequency pairs do in fact correspond closely to conventional musical intervals. In harmonic spectra, these minima often align with intervals found in 12-tone equal temperament or simple rational ratios, thereby linking sensory dissonance with established tuning systems.

This framework provides a powerful method for relating timbre to optimal tuning systems. However, it primarily accounts for simultaneous (vertical) combinations of tones. It does not fully explain the perception of intonation in sequential (melodic) contexts, where judgments of pitch accuracy cannot be reduced to instantaneous spectral interactions alone. Moreover, its integration with broader theories of perceptual organization and category formation remains limited.

An additional complication arises from the presence of combination tones generated by nonlinear processes in the cochlea, as shown by Sylvain Pressnitzer and Roy D. Patterson (2001). These effects reintroduce spectral components that are not explicitly present in the stimulus, making precise control of perceived timbre more difficult.

In summary, given a timbre understood as a spectral profile it is possible to compute a dissonance function and identify intervals that are physiologically consonant. However, this still leaves an open question: which of these intervals, if any, becomes a perceptual equivalence class, and why?

As argued in the previous sections, this problem cannot be resolved solely in terms of sensory consonance. Instead, it is necessary to consider the role of spectral congruence under frequency scaling. When a timbre exhibits approximate self-similarity across scales, stable equivalence relationships may emerge. In this sense, pitch cyclicity can be understood as arising from structured invariances in the spectrum.

The next section introduces a formal model of spectral congruence and demonstrates how perceptual equivalence and the corresponding pitch cycles can emerge from these properties, along with methods for constructing compatible tones and tuning systems.


4. Spectral Congruence and the Emergence of Pitch Cycles

The idea that pitch equivalence may arise from spectral self-similarity across frequency scaling is not new. As discussed previously, various researchers have suggested that perceptual equivalence, particularly octave equivalence, can be explained either through acoustic structure or through neural coding mechanisms. Computational approaches have also explored related ideas by analyzing self-similarity within existing sounds (e.g., work by Andrew Milne).

Rather than analyzing arbitrary signals for such patterns, the approach taken here is constructive. Instead of searching for spectral self-similarity, we directly generate spectra that contain it by design. The well-known example of Shepard tones provides a clear illustration of this principle: spectra composed of octave-spaced partials exhibit perfect self-similarity under scaling by a factor of two. However, there is nothing intrinsically unique about the octave in this construction. Similar structures can be generated using other scaling ratios.

The central idea is simple. A timbre can be understood as a spectral distribution, and frequency scaling corresponds to multiplying all frequencies by a constant factor. When a spectrum closely matches a scaled copy of itself, we say that the sound exhibits spectral congruence.

4.1 Spectral Self-Similarity Under Scaling

Let the spectrum of a timbre be represented as \( S(f) \), where \( S(f) \) denotes the amplitude (or energy) at frequency \( f \). In other words, the spectrum describes how acoustic energy is distributed along the tonotopic frequency axis.

Spectral congruence occurs when the spectrum is approximately invariant under scaling by a factor ( r ):

\[S(f) \approx S(rf)\]

Here \( r \) is a scaling factor that produces maximal alignment between the spectrum and its scaled copy.

If this condition holds strongly for some value of \( r \), then tones related by this scaling are expected to produce highly similar spectral patterns in the auditory system. Under these circumstances, listeners may treat tones separated by the factor \( r \) as perceptually equivalent.

The scaling factor \( r \) therefore defines a pitch cycle, or equave.

4.2 Simple Examples

The principle becomes clear in simple synthetic spectra.

A spectrum constructed from octave-spaced partials

\[f, 2f, 4f, 8f, 16f, \dots\]

is invariant under scaling by a factor of two. Scaling the spectrum by two simply translates the pattern:

\[2f, 4f, 8f, 16f, 32f, \dots\]

The same idea applies to other scaling ratios. For example, a spectrum built from powers of three,

\[f, 3f, 9f, 27f, \dots\]

exhibits self-similarity under scaling by a factor of three, producing a “tritave” cycle rather than an octave cycle.

In general, constructing spectra using a multiplicative generator automatically embeds a preferred scaling periodicity into the sound. When such spectra are synthesized, the resulting tones provide minimal examples of timbres with a specified pitch cycle.

These structures also inherit many of the perceptual effects associated with Shepard tones, including cyclic pitch relationships and directional ambiguities.

In this framework, octave equivalence is not a special property of pitch perception, but a consequence of the spectral structure of harmonic sounds.


The next section extends this framework to richer spectral constructions, demonstrating how timbre and tuning can be co-designed to produce musically usable systems with alternative pitch cycles.


5. Constructing Timbres Compatible with a Given Pitch Cycle

The previous section introduced spectral congruence as the mechanism underlying pitch cycles. The next question is how to construct richer timbres that preserve this property while supporting musically usable tuning systems.

Two complementary approaches are explored. The first begins by specifying the pitch cycle (equave) and derives compatible timbres and scales. The second starts from spectra and constructs a tuning structure that reinforce it.


5.1 Starting from the Equave

Suppose a pitch cycle is defined by a scaling ratio (q). For example, consider the case

\[q = 2.71\]

A minimal spectrum exhibiting this cycle can be generated from the sequence

\[f, qf, q^2 f, q^3 f, \dots\]

This construction produces a spectrum that is invariant under scaling by \(q\), ensuring strong spectral congruence at that ratio.

While such spectra already demonstrate the principle, musical practice typically requires more structure. In particular, tuning systems usually satisfy two practical conditions:

- they allow transposition and modulation, which favors equal divisions of the equave;
- they contain a manageable number of notes, typically corresponding to step sizes between roughly 50 and 200 cents.

Given an equave \(q\), one can therefore explore equal divisions of the equave, analogous to equal temperament but with a different periodic interval.


5.2 Equal Divisions of the Equave

Let \(N\) denote the number of divisions of the equave. The step size in cents is then

\[s = \frac{1200 \log_2(q)}{N}\]

For example, if

\[q = 2.71\]

then the equave size is approximately

\[1200 \log_2(2.71) \approx 1725 \text{ cents}\]

If the equave is divided into \(N=11\) equal steps, the step size becomes

\[s \approx 156 \text{ cents}.\]

The resulting tuning consists of the set

\[{ ks \pmod{q} \mid k = 0,\dots,N-1 }.\]


5.3 Generator Structure

The structure of such tuning systems depends on the modular properties of \(N\). In particular, the interaction between spectral partials and scale steps can be understood in terms of generators of the cyclic group \( \mathbb{Z}_N \).

If \(N\) is prime, every non-zero step is coprime with \(N\), and therefore acts as a generator of the group. In this case, interactions between partials and scale degrees distribute across all pitch classes.

For example, if \(N=11\), any step size generates the entire set of pitch classes.

By contrast, when \(N\) is composite, only those integers that are coprime with \(N\) act as generators. For instance, when \(N=12\), the generators are

\[{1,5,7,11}.\]

Using partial generators corresponding to non-coprime values (such as 2 or 3) produces spectra whose interactions with the tuning occupy only a subset of pitch classes. In such cases additional chromatic material may emerge when higher partials are considered.


5.4 Timbre–Tuning Interaction

This perspective highlights an important point: spectral structure and tuning structure interact through their shared modular organization.

Partial generators define the distribution of spectral energy, while the equal division determines the available pitch classes. Their interaction determines how spectral components reinforce or destabilize particular intervals.

In the computational implementation developed for this work, once an equave and division number are specified, the system enumerates possible generators and visualizes the resulting spectral–tuning interactions.


The next section explores the complementary approach, in which a desired spectra is specified first, and compatible tuning systems are derived to maximize congruence.


5.5 Finding a Compatible Equal Division

In the previous construction the equave was fixed first, and equal subdivisions were explored afterward. This revealed that a given equave admits many possible timbre–tuning combinations through different spectral generators.

A complementary situation arises when part of the spectral structure is already fixed. For example, while exploring timbre one might choose an equave \(q\) together with an additional spectral generator \(g\), producing a spectrum containing components

\[f, qf, g f, qg f, q^2 f, \dots\]

In this case the available freedom is reduced: not every equal division of the equave will align well with the spectral structure. Instead, the problem becomes determining whether there exists an equal division of the equave that approximates the interaction between the generators.

Logarithmic representation

The relationship between the two generators becomes simpler in logarithmic coordinates. Let

\[x = \log_q(g)\]

which expresses the generator \(g\) as a fraction of the equave in log space.

For example, if

\[q = 2.71, \qquad g = 1.49\]

then

\[x = \log_q(g) \approx 0.3999.\]

In this representation, successive powers of \(g\) correspond to rotations on the unit interval:

\[k x \pmod{1}.\]

This sequence determines how the spectral generator distributes partials across the pitch cycle.

Approximating the rotation with an equal division

To construct an equal division that captures this structure, we approximate \(x\) by a rational number

\[x \approx \frac{m}{N}.\]

When such an approximation is good, the relationship

\[q^{m} \approx g^{N}\]

holds approximately. This means that \(N\) equal divisions of the equave produce a step size compatible with the spectral generator.

A practical way to obtain good rational approximations is to expand \(x\) as a continued fraction and examine the denominators of its convergents. These denominators provide candidate values for \(N\), the number of divisions of the equave.

Example

In the example above,

\[x \approx 0.4 \approx \frac{2}{5}.\]

This suggests using an equal division of the equave into \(N = 5\) steps. In this tuning system, the second step approximates the generator \(g\), since

\[q^{2/5} \approx g.\]

Equivalently,

\[q^2 \approx g^5.\]

Thus a 5-division of the equave provides a tuning system whose intervals closely reflect the spectral relationships of the chosen timbre.

Interpretation

This construction reveals a direct connection between spectral generators and tuning systems. Spectral generators determine rotations on the logarithmic pitch circle, while equal divisions correspond to rational approximations of those rotations.

Small denominators in the continued fraction expansion of \(x\) therefore correspond to tuning systems that efficiently capture the spectral structure of the sound.

In this sense, designing a tuning system compatible with a given timbre becomes a problem of approximating spectral rotations with a finite cyclic structure.

This formulation reveals that the problem of matching timbre and tuning reduces to a classical problem of Diophantine approximation on the logarithmic pitch circle.


5.5 Multiple generators

When several spectral generators are present, the problem becomes one of simultaneous rational approximation. Each generator \(g_i\) defines a rotation \(x_i = \log_q(g_i)\) on the logarithmic pitch circle. A compatible equal division corresponds to finding a denominator \(N\) such that all \(x_i\) are well approximated by fractions \(m_i/N\).

When the x_i are irrationally independent, the partials become equidistributed over pitch classes, which produces timbres that refuse to stabilize any tuning.

Irrational rotation (\(x \notin \mathbb{Q}\)) orbit is dense in the circle. So the tuning-finding procedure is finding rational approximations of circle rotations.

When you add multiple generators

\[g_1, g_2, ..., g_n\]

Then

\[x_i = \log_q(g_i)\]

and the orbit becomes

\[(k x_1, k x_2, ..., k x_n) \pmod{1}\]

an n-torus

\[\mathbb{T}^n\]

This is why the problem becomes simultaneous Diophantine approximation.

Finding a tuning means finding \(N\) such that

\[x_i \approx \frac{m_i}{N}\]

for all \(i\).

If the numbers \(x_1,...,x_n\) are rationally independent, the orbit

\[k(x_1,...,x_n)\]

becomes dense in the torus.

That implies partials generated by those spectral relations will wander through pitch space without forming a finite cycle.

Which means: no stable equave. no stable pitch classes. no simple equal division captures the structure well. When the logarithmic generators are irrationally related, the resulting spectral interactions do not produce a finite pitch cycle and instead distribute across the pitch continuum.


This problem is closely related to the classical theory of musical temperaments, where equal divisions of the octave are chosen to approximate several harmonic ratios simultaneously. Systems such as 31-EDO arise because they provide particularly good approximations to ratios such as 5/4, 3/2 and 7/4 ( \(2^{10/31}, 2^{18/31}, 2^{25/31}\)). In the present framework, however, the generators are derived from the spectral structure of the timbre itself rather than from a fixed set of harmonic intervals.


31ED2 Mathematical "lucky strike"


5.6 Alternative Sound Design

In earlier sections it was noted that the presence of pitch does not necessarily place all sounds within a single unified dimension of pitch height. While strongly inharmonic spectra may fail to establish clear unisons with other instruments, more familiar musical examples illustrate this separation.

In drum performance, for instance, players often tune the main drum components (snare, toms, and kick)so that each has a recognizable pitch. However, these pitches rarely correspond to the tuning system used by the melodic instruments of the ensemble. Even when drummers spend considerable time adjusting their instruments, the goal is usually internal balance within the drum set rather than harmonic alignment with the rest of the music. As a result, two largely independent pitch domains coexist: the harmonic pitch space of melodic instruments and the relative pitch relationships within the percussion set.

Cymbals provide an even more ambiguous example. They are rarely assigned a definite pitch in musical practice, yet when cymbal samples are mapped across a keyboard, listeners often report the emergence of a pitch sensation. Interestingly, cymbals frequently produce different perceived pitches during the attack and sustain portions of the sound, making them an unusual case of temporally shifting pitch.

Such sounds provide useful material for exploring spectral congruence experimentally. By deliberately imposing self-similar scaling relationships onto an existing sound, it is possible to construct spectra that exhibit controlled pitch cycles. Conceptually, this process resembles the generation of fractal or procedural textures (such as Perlin noise), where a signal is iteratively scaled, blended, and combined with transformed copies of itself.

A simple example can be constructed using a cymbal sample. When a cymbal recording is mapped across a sampler, octave transpositions typically do not produce a convincing sense of octave equivalence due to the strongly inharmonic spectrum. However, if the sample is layered with a version of itself transposed by a factor of two, optionally shaped through filtering or equalization, and this process is repeated recursively, the resulting composite spectrum begins to exhibit self-similarity under octave scaling. The new sound therefore contains built-in spectral congruence, and the perceptual quality of “octaveness” becomes more apparent.

Although such procedures do not always produce musically useful sounds, they illustrate a more exploratory and artistic approach to constructing spectra with controlled pitch cycles.


6. Discussion


6.1 Implications for music theory

The proposed framework has both analytical and ontological consequences for music theory. Musicians who experiment with alternative tuning systems quickly encounter a familiar paradox when parameterizing step sizes. For example, if a scale is defined as an equal division such as 11.5-EDO, it becomes unclear how many pitch classes the system actually contains. Systems that do not align with an octave or other perceptually stable cycle often exhibit what might be called chromatic inflation: the absence of a clear repeating interval makes it difficult to determine where pitch classes recur.

In practice this difficulty is usually resolved by introducing an arbitrary analytical equivalence. A chosen interval is declared to represent a cycle, allowing musical manipulation of pitches through familiar operations such as transposition and scale construction.

Within the present framework this paradox is largely avoided. Because the spectral structure of a timbre determines the interval of spectral congruence, the system directly implies a perceptual pitch cycle and therefore a specific set of chromas. As shown earlier, even with only two spectral generators it is possible to construct multiple timbres that share the same pitch cycle while exhibiting different spectral interactions. The resulting sounds are not restricted to a single timbral character; rather, they can resemble a wide variety of instrumental types, ranging from bell-like to organ-like, string-like, or pad-like textures.

This flexibility suggests practical possibilities for ensemble writing. Different instruments within a group could employ distinct timbral realizations of the same spectral cycle while remaining compatible with a shared tuning system. One performer might use a bass-oriented spectrum, another a lead-oriented timbre, and another a midrange texture, all operating within the same pitch framework. Familiar musical operations such as defining scales as subsets of the pitch cycle, constructing chords as subsets of scales, or assigning pitch names to chromas remain available and function in much the same way as in conventional tonal systems.


6.2 Implications for psychoacoustics and interval affect

These observations also have implications for research in psychoacoustics. Definitions of pitch vary across the literature, and models of pitch perception often emphasize different mechanisms, ranging from spectral pattern matching to temporal coding. Flexibility and paradoxes appear at many stages of this process, including phenomena such as the tritone paradox or context-dependent reinterpretations of pitch within tonal frameworks.

Studies investigating the affective qualities of intervals already suggest that perceptual judgments depend strongly on timbre. Experiments examining the perceived character of intervals within systems such as the Bohlen–Pierce scale have produced inconsistent results when different sound sources are used, for example piano tones, guitar-like timbres, or pure sine waves.

From the perspective proposed here, such variability is not surprising. If pitch cycles emerge from spectral congruence, then the perceptual meaning of intervals depends not only on the tuning system but also on the spectral structure of the sound producing those intervals. The domain of possible timbre–tuning combinations is therefore extremely large. Even a familiar tuning system such as 12-EDO may produce significantly different perceptual results if the underlying spectral cycle is displaced or altered.

Consequently, systematic studies of interval affect may be exploring only a small region of a much larger parameter space. The relative stability of traditional tonal systems may therefore reflect a historically convergent combination of spectral properties, tuning practices, and musical conventions rather than a uniquely determined perceptual optimum.


Multiple Spectral Cycles and Equave Precedence

When the abstract model is examined from a more practical perspective, several consequences of the spectral congruence hypothesis become apparent.

A natural question arises: what happens when a timbre contains more than one scaling symmetry?

If pitch equivalence emerges from spectral self-similarity under scaling, then a spectrum that is invariant under multiple scaling factors might appear to support multiple equaves simultaneously. At first glance, this seems paradoxical.

Consider a simple constructed spectrum composed of two independent harmonic chains:

an octave chain: 1, 2, 4, 8, …

a tritave chain: 1, 3, 9, 27, …

If a tuning system is chosen that accommodates both ratios, the question becomes: which ratio functions as the perceptual cycle?

Several observations help clarify the situation.

First, for any finite spectral range, different generators populate that range with different densities. Within the same bandwidth, the 1:2 generator produces more repetitions than the 1:3 generator simply because its growth rate is smaller. Consequently, the octave chain forms more frequent alignments across the spectrum.

Second, the relative amplitude and decay of each harmonic chain strongly influence perceptual dominance. In practical synthesis, partial families rarely have equal strength. One generator typically dominates the spectral energy distribution, while others appear weaker or decay faster. In such cases, the perceptually relevant equave corresponds to the most structurally reinforced scaling symmetry.

These considerations suggest that equave precedence emerges from spectral weighting, not merely from the mathematical presence of scaling invariances.

However, the parameter space grows rapidly when multiple generators, amplitudes, and spectral limits are considered. A systematic exploration of equave precedence in such spectra remains an open direction for further study.

Clarifying the Meaning of Scaling Symmetry

This discussion also highlights an important conceptual distinction.

Consider a spectrum generated by repeated multiplication by √2:

f, √2f, 2f, 2√2f, 4f, …

Because √2² = 2, this sequence contains frequencies related by a 1:2 ratio. One might therefore say that the sequence “contains octaves.” Strictly speaking, however, this is not correct.

A ratio of 1:2 only functions as an octave when the spectrum itself supports that ratio as a cycle of equivalence. In other words, the octave is not defined purely by a numerical interval, but by a spectral congruence that makes tones separated by that ratio perceptually interchangeable.

This distinction illustrates the central conceptual shift of the present framework:

an octave is not simply the ratio 1:2; it is the perceptual cycle that arises when a spectrum supports 1:2 as an equivalence. 

Practical Implications

In real musical sounds, spectra may contain multiple approximate scaling symmetries. Yet musical timbres are rarely perfectly balanced across them. Differences in amplitude, spectral density, and decay typically make one symmetry perceptually dominant.

As a result, the ambiguity predicted by the abstract model is rarely problematic in practice. Instead, it provides a useful perspective on edge cases and perceptual illusions, including phenomena such as the tritone paradox, where competing spectral cues can destabilize directional pitch perception.

Interactive experimentation makes these relationships particularly clear. By constructing spectra with controlled scaling structures, one can observe how different spectral weightings promote different perceptual cycles.


7.

In retrospect, the emergence of pitch cycles from spectral scaling symmetries appears almost inevitable. However, previous research typically approached pitch from either neural, harmonic, or musical perspectives. The present formulation attempts to bridge these viewpoints by treating pitch equivalence as a consequence of spectral congruence under frequency scaling.


APP:



A.1 Mathematical Representation of Chroma

This section introduces the equations and concepts necessary for a precise analysis of chroma and its relationship to musical intervals.

Cyclic equivalence is mathematically captured by defining chroma as the fractional part of the base-q logarithm of a pitch frequency ratio, expressed in terms of the equave cycle (1:1 represented as a power of q):

\( \text{chroma}(x)=q^{\log_q(x)\mod 1} \)

Alternatively, expressed in terms of a normalized ratio modulo operation:

\( \Xi(x) = x \mod 1:q \)

This signifies that the chroma of a pitch is invariant under equave multiplication or division (scaling by \(q^n\), where \(n \in \mathbb{Z}\)). This approach identifies their equivalent "color" regardless of absolute frequency.

The following mathematical expressions, formally defining an equivalence class and an isomorphism of topological groups, are familiar in principle to musicians. These equations, which define structure preservation, enable the construction of pitch class diagrams, such as the well-known "circle of fifths."

Chroma can be formalized in terms of ratio equivalence relations. For \( x, y \in (0, \infty) \)

\( x \sim y \Leftrightarrow x = q^n \times y \, \) for some \( n \in \mathbb{Z} \)

The following mapping is established:

\( \frac{(0, \infty)}{\sim} \xrightarrow{\log_q(\bullet)} \mathbb{\frac{R}{Z}} \xrightarrow{\exp(2\pi i \bullet)} \mathbb{S^1} \subseteq \mathbb{C} \)

In general, the mapping can be expressed as:

\( [x] \mapsto \log_q(x) + \mathbb{Z} \mapsto e^{2\pi i \log_q(x)} \)

The mathematical nature of chromas reveals that melodies and chords necessitate more than equaves alone; other "colors" or fractional parts of the \(log_q\) scale are essential.

A.2 Measuring Spectral Congruence

To quantify this property, we can define a spectral similarity function comparing the spectrum with a scaled version of itself:

\[C(r)=\int_{f_{min}}^{f_{max}} S(f)S(rf)df\]

This function measures the degree of overlap between the original spectrum and the scaled spectrum. When many spectral components align under the scaling transformation, the value of \( C(r) \) increases.

Peaks in \( C(r) \) therefore indicate candidate scaling ratios that produce strong spectral congruence.

For harmonic spectra, the largest peak occurs at \( r = 2 \), corresponding to octave equivalence.

A.3 Log-Frequency Representation

Because auditory pitch perception is approximately logarithmic, it is often convenient to express frequency on a logarithmic scale. Let

\[x = \log(f)\]

Under this transformation, frequency scaling becomes translation. The congruence function can then be expressed as a shift correlation:

\[C(\Delta) = \int_{x_{min}}^{x_{max}} S(x) S(x+\Delta), dx\]

Here \( \Delta \) represents an interval size in log-frequency space.

In this representation, spectral congruence appears as periodic structure along the log-frequency axis, and peaks of \( C(\Delta) \) correspond directly to candidate pitch cycles. Mathematically, this operation corresponds to an autocorrelation of the spectrum in log-frequency space.

Thursday, February 5, 2026

Sophistication-to-era ratio

The Myth of the "Non-Mathy" Ancient

We have developed a strange, collective amnesia regarding what the Assyrians and Babylonians actually achieved. We treat their sophistication as a fluke rather than a foundation.

Consider the data, these were cultures that didn't just "watch" the sky; they calculated it. They predicted eclipses derived from long-period cycles, tracked lunar anomalies, and accounted for nodal motion with terrifying precision. They solved quadratic problems on wet clay and navigated the complex world of sexagesimal reciprocals and sexagesimal approximations without breaking a sweat.

And yet, when the topic of music theory arises, the modern skeptic says:

"But surely they didn't think in harmonic ratios? That feels too abstract."

Let’s be serious.
If you can compile an eclipse syzygy table, you can grasp the relationship between 2 and 3. To suggest otherwise is like saying a master architect can design a cathedral but can't figure out how a seesaw works.

Music isn't "harder" math; it is simply friendlier math, Math with a better User Interface. It is the audible manifestation of the same numerical ratios they were already using to stop the sun from disappearing. If they had the geometry for the stars, they certainly had the arithmetic for the strings.



Historians often suffer from a fear of anachronism—they are so terrified of projecting "modern" concepts backward that they end up projecting simplicity instead.

They look at a tablet like the Enuma Anu Enlil and see "Superstition" or "Omens," while ignoring the fact that the engine driving those omens was high-precision calculus. If you are calculating the "speed of the shadow" in 665 BCE, you aren't just guessing; you are modeling a four-dimensional event on a piece of mud.

The "Chromatic" Denial
The irony of the "they didn't work out the octave" argument is that the Octave is a physical constant. It’s not an invention; it’s a discovery. If you have a string and you're not a complete klutz, you notice that 1/2 the length sounds like the "same" note. To suggest that a culture capable of sexagesimal reciprocals didn't notice a 1:2 ratio is almost insulting.


Historians want "the first mention." If they don't see a literal diagram of a vibrating string with the number 2 written next to it, they assume it didn't exist.

 Anyone who has ever tuned a harp or a lyre knows that the math finds you. Harmonic ratios are the path of least resistance for the ear.

Tuesday, December 2, 2025

Chapter X Dürer’s The Lute Designer: The Epistemology of Iconographic Accuracy


When an Artwork Teaches You How to Read Art:

Iconographic analysis of musical instruments walks a fine line between data and illusion.

Paintings may present convincing but geometrically impossible fretboards, stylized images that accidentally mimic equal temperament, or intentional, measured depictions reflecting real workshop practices.

Most artworks leave us guessing about intention, training, and technical fidelity.
But Albrecht Dürer’s The Lute Designer is unique: it is not merely a picture of an instrument, 
it is a picture about how instruments are pictured.


© GrandPalaisRmn (Musée du Louvre) / Tony Querrec
 

It is the only major Renaissance artwork that openly displays a projection grid, measurement instruments, a workshop-like setting, the translation of 3D form into 2D geometry.
 
This painting is meta-evidence, it depicts the very apparatus through which accuracy enters representation.

It’s the only known case where the act of scientific representation of a musical instrument is itself the subject of the artwork. 

Thus, Dürer’s work becomes a calibration point for the entire method of inferring historical tunings from visual materials.

 
The Epistemic Problem: Realism vs Accuracy

Historical tuning reconstruction from iconography suffers from a fundamental paradox:

-Some highly realistic paintings fail to produce any coherent tuning system under projection correction.
-Some crudely stylized medieval paintings unexpectedly snap cleanly to 12edo or meantone after geometric reconstruction.

This generates a central methodological challenge: Visual realism does not guarantee geometric or acoustic accuracy, stylization does not guarantee ignorance, and randomness can masquerade as intention.

Dürer shows exactly how precision is manufactured.
 

Dürer’s Demonstration: Representation as a Technical Act

In The Lute Designer, we see:

-a craftsman measuring a lute with a stick,
-an assistant drawing on a grid plane,
-a perspectival device mediating the translation between 3D and 2D,
-the lute represented twice: once physically, once as projection.

Dürer is visually documenting what his treatises openly discuss: the accuracy of representation is not a matter of eye, but of procedure.

Thus, the fretboard drawn here is not filtered through symbolism, idealization, or expressive distortion.
It is the output of a technical system.

This makes The Lute Designer the nearest thing to a “photograph” available in Renaissance visual culture but more importantly, it reveals how photographic accuracy was laboriously constructed.

 
A. Musical Iconography

A.1 The Tuning Reconstruction Problem

Reconstructing the tuning of a historical fretted instrument is non-trivial:

Mathematical treatises are often contradictory or incomplete.
Rational systems (Pythagorean, meantone) cannot explain aligned frets across multiple strings.
Surviving instruments were frequently modified, repaired, or mis-labeled.
Paintings range widely in accuracy and intent.

Yet many artworks even very early ones depict perfectly aligned frets.
 
A.2 The Equal Temperament Implication

Aligned frets across all strings on a multi-course lute require irrational divisions.
No rational tuning system (including Pythagorean or meantone) can produce identical fret positions across strings unless all strings are in unison (they are not), or the system is an equal division of the octave.
Thus, when an artwork displays consistent fret spacing, perspective-correctable parallelism, proportional alignment across strings, It strongly implies that the artist is referencing an actual physical instrument tuned with an empirical equal-step system, or a constructional practice that uses equal divisions intuitively, without theoretical formalization. Dürer’s painting proves artists could and did intentionally encode such geometry.


The Painting That Reveals the Method

Dürer is the only Renaissance artist for whom we have treatises on measurement, projection, and proportion, didactic illustrations of gridded drawing systems, explicit discussions of geometric accuracy, a workshop context of scientific instrument-making.
It provides not only an unusually accurate depiction of a historical instrument,
but a visual explanation of accuracy itself. His painting becomes the theoretical key to interpreting all earlier and later images. It lets us distinguish intention, error, and randomness.
It retroactively validates the plausibility that empirical equal-step fret systems existed long before theoretical equal temperament was formalized and it places iconographic reconstruction on firmer epistemological ground.


E.3. Music, Instruments And Tuning Iconographic Analysis:


The implementation of a particular tuning system on a musical instrument, as well as the analytical reconstruction of the pitch sets it produces, are complex and demanding tasks even for experienced musicians, luthiers, and theorists. Consequently, historians and musicologists can hardly be faulted for drawing uncertain or even incorrect inferences about ancient musical practices from iconographic, literary, or theoretical sources. Such materials frequently rely on ambiguous or inconsistent mathematical formulations and on numerical systems fraught with their own internal debates and interpretive challenges.

What, then, substantiates the claim that forms of equal temperament may have been practiced long before they were formally theorized?
The most direct and abundant evidence derives from Ancient Egypt and Babylon, where numerous surviving artworks depict stringed instruments with visibly aligned frets, a feature that, in practice, presupposes some form of equal step system, potentially an octave division.


figurine -2004 / -1763 (Isin-Larsa [?])
© 1998 GrandPalaisRmn (musée du Louvre) / Hervé Lewandowski

Subtle ambiguities and inconsistencies in tuning practice persisted from the medieval period through the Renaissance and well into modernity. While many visual representations of instruments such as the lute portray perfectly aligned frets, contemporary theoretical treatises and even surviving design schematics consistently reflect a Pythagorean framework, grounded in rational-number ratios. Vincenzo Galilei’s well-known attempt to construct a rational twelve-tone division using a constant ratio of 18/17 is a revealing case: although conceptually elegant, it produced an imperfect octave ((18/17)¹² ≈ 1.9855), demonstrating the intrinsic limitations of a purely rational approach.

Most instruments of the lute family in the Renaissance were conceived according to either the Pythagorean scale or one of the various meantone temperaments, both of which relied on rational intervallic calculations. The critical methodological oversight lies in the assumption that these ratios could be uniformly applied across all strings: a single fret position extended orthogonally across the neck, as if the instrument functioned as a monochord. Once any inter-string tuning pattern is introduced, however, this rational model fails, as each string generates its own distinct scalar framework. The result is a proliferation of pitch positions, the pitch set gets multiplied in number with each string.
Yet, in practice, these instruments performed effectively. The discrepancy was either tacitly accepted or simply disregarded, as the resulting differences are perceptually negligible. On fretted instruments, this produces a structural contradiction fundamentally unlike that of keyboard instruments: whereas keyboards merely exhibit the chromatic inflation inherent in unequal divisions, fretted instruments multiply these discrepancies across their strings.

A single, rationally derived Pythagorean scale applied to a multi-stringed, fretted instrument could never yield aligned frets, regardless of the tuning relationships between strings. The only systems capable of resolving this geometric inconsistency are those based on irrational divisions, such as equal temperament.

This tension invites a reinterpretation of the Renaissance theorists’ position:

“The lute has existed for millennia; it possesses multiple strings and aligned frets and functions flawlessly in practice. Yet my theoretical framework cannot account for it without contradiction.”

Thus, when ancient or early artworks (sculptures, reliefs, or paintings) depict stringed instruments with proportionally consistent and geometrically aligned fret patterns, these representations may reasonably be read as evidence of empirical equal-division systems. Whether these systems were arrived at through intuitive craftsmanship or through procedural mathematics remains uncertain. Indeed, an approach would later be formalized by Pythagoras, who recognized the small but persistent discrepancy, the “comma”, that arises when one attempts to reconcile such divisions using only rational numbers.

Such observations underscore the potential of iconographic analysis not merely as a descriptive tool but as a methodological bridge between visual representation, material design, and theoretical acoustics. By assessing the geometric accuracy of depicted instruments, their fret alignments, proportional spacing, and constructional logic, one may begin to distinguish between idealized imagery and depictions that encode authentic technical knowledge.

DRAFT//

A Hierarchy of Epistemic Trust

Zone A Scientific Representation
(e.g., Dürer, workshop schematics, treatises)
→ High-confidence tuning inference

Zone B  Geometric Realism
(optical accuracy but not explicitly technical)
→ Medium-confidence inference

Zone C  Ordinary Realism
(good but inconsistent perspective)
→ Medium-to-low confidence

Zone D  Stylized Iconography
(medieval, Byzantine, Islamic manuscripts)
→ Low confidence, but occasional random 12edo matches

Zone E  Symbolic Depictions
(allegories, angels, genre scenes)
→ No reliable inference


----

Museum Google Street View, rare fretboards and artifacts in the world!:

These images are special because user-uploaded photos on Google Maps exist in a completely different 'vault' than standard search results. Not every museum has a complete, high-resolution digital catalog; some of these artifacts would remain totally invisible to the world if it weren't for a random visitor taking a photo, geo-tagging it, and uploading it for the rest of us to find. 


museo musica barcelona








military rusia music



rome /national museum musical instruments



italy:


venezia

italy multiethnic



italy museo violino


españa museo etnico


museo guitarra almeria




interactive museum spain





italy degli strumenti



bolivia la paz
https://maps.app.goo.gl/JwSRyncXA1e2gz9WA penta charango Andean fret skipping!


belgium



palazzo della pilotta, parma, girolamo cittern



misc:

https://maps.app.goo.gl/pc1KkRuCuw56dWLN6
https://maps.app.goo.gl/drhhZJ5bBGX3apom7
https://maps.app.goo.gl/7Hbd5GYrDkYdqV4Z7
https://maps.app.goo.gl/reSPUacyqtn3X6M4A
https://maps.app.goo.gl/ZEZbMV7KYEzJrHeL7
https://maps.app.goo.gl/zzKBS2vR15dcgh9A8
https://maps.app.goo.gl/eMJx9k6CwGxrPEzB9
https://maps.app.goo.gl/BxuWXfv8PRMhmou98
https://maps.app.goo.gl/Y2vSZ5jaSh6BLEnd6

----

references text/images:
https://www.researchgate.net/publication/348809751_Numerus_surdus_y_armonia_musical_Sobre_el_temperamento_igual_y_el_fin_del_reinado_pitagorico_de_los_numeros
vicenzo galilei and 18:17 17/18 (string ratio) 
de musica libri septem - francisco de salinas 1577 - meantone /mesotonico
Sopplimenti musicali 1588- Gioseffo Zarlino- laud/lute 12 edo, mesolabio / euclidean theorem

Sunday, March 30, 2025

Dual Group Structures in Diophantine Approximations


This page describes an algorithm that extracts continued-fraction convergents of an irrational parameter by observing return events in a dynamical system. Rather than computing the function value (e.g. log or sin) and/or explicitly expanding its continued fraction, the method tracks a normalized orbit and an integer cocycle, detecting convergents via gap-structure collapse as described by the Three Gap Theorem. The approach applies uniformly to multiplicative dynamics (logarithms) and rotational dynamics (trigonometric functions), revealing a shared group-theoretic structure underlying these computations.

The algorithm is a single dynamical template for different coordinate systems: additive (circle rotations), multiplicative (logs / exp / MLA),  angular (trig), and, implicitly, lattice / torus dynamics.
The core object is a rotation on a 1-torus, everything reduces to \(x \mapsto x+\alpha \pmod 1 \) for irrational \(\alpha\) : (Logs: \(\alpha = \log_b a\), angles: \(\alpha = \theta / 2\pi\), continued fractions: best rational returns of this rotation). The MLA is rotation dynamics written multiplicatively.
At a convergent \(p/q\): \(q\alpha \approx p\), the orbit nearly closes, the circle is partitioned into exactly two gap lengths. That’s the Three Gap Theorem at a convergent, this is why the ordering stabilizes, the gap structure simplifies, and discrete group actions suddenly appear. (The “dual cyclic groups”, at a convergent \(p/q\))

Object              |   Group    |   Generator
sorted indices   |   \(\mathbb{Z}/q\mathbb{Z}\)    |   \(p^{-1} \bmod q\)
overflow terms |   \(\mathbb{Z}/p\mathbb{Z}\)   |   \(q^{-1} \bmod p\)

That symmetry is forced by \(p q' - q p' = \pm 1\) from CFs. So each convergent induces a pair of mutually dual cyclic actions, one “horizontal” (ordering) and one “vertical” (overflow). A lattice/tprus.
The “overflow” sequence is key, from the inequalities: \(\left|\alpha - \frac pq\right| < \frac{1}{q^2}\), MLA tracks modular advancement (floor terms, wrap counts) so that turns Diophantine approximation into explicit dynamics, group actions generated by irrational flow.
Trig case is the same object, replace multiplication with addition, \(\mathbb{R}^+/\langle b\rangle\) with \(S^1\) since \(a^q \approx b^p \quad \leftrightarrow \quad q\theta \approx 2\pi\).
All these computations (Pythagorean Scale/Sanfen-Sunyi, Grover Serach), are the same algorithm acting on different groups, with convergence limited by Diophantine structure.


-Python implementation for the logarithm case, simple version dosnt handle arguments in the 0–1 range, provides the list of consecutive convergents found, excluding the first one if integer.


def mla(a, b, max_q):
    if b <= 1 or a < 1:
        return []
    if a == 1:
        return ["0/1"]  # log_b(1) = 0
    if a == b:
        return ["1/1"]  # log_b(b) = 1

 for p in results]
    results = []
    link, lower, upper = a, 1, b
    p, q = 1, 1
    while q < max_q:
        while link < 1:
            link *= b; p += 1
        while link > b:
            link /= b; p += 1
        if q == 1:
            H = b / link; p += p - 1
        q += 1
        if link == b:
            results.append(f"{p-1}/1"); break
        if lower < link < upper:
            commas = [link/lower, upper/link]
            if (max(commas) - 1) / (min(commas) - 1) <= 2:
                results.append(f"{p}/{q}")
            lower, upper = (link, upper) if link < H else (lower, link)
        link *= a
    return results


(Dual cyclic structure at convergents) /rotation-theory + groups

Let \(\alpha \in \mathbb{R}\setminus \mathbb{Q}\) with continued fraction convergent \(p/q\). Consider the rotation sequence \(r_x = \{x\alpha\}\in [0,1),\quad x=1,\dots,q\),

and let \(\sigma\) be the permutation that sorts \(r_x\) in increasing order: \(r_{\sigma(1)} < r_{\sigma(2)} < \cdots < r_{\sigma(q)}\).

Then:

(Index cycle) \(\sigma\) is an arithmetic progression modulo \(q\): \(\sigma(k) \equiv k\cdot p^{-1} \pmod q\),

where \(p^{-1}\) is the multiplicative inverse of \(p\) modulo \(q\).


(Overflow cycle / floor terms) Writing \(x\alpha = y_x + r_x\) with \(y_x=\lfloor x\alpha\rfloor\), the sequence \(y_{\sigma(k)}\) (as \(k=1,\dots,q\)) takes exactly two adjacent values that differ by 1 and forms \(q\) samples from a cycle in \(\mathbb{Z}/p\mathbb{Z}\) whose step is \(q^{-1}\pmod p\).


(Gap control) The consecutive differences \(r_{\sigma(k+1)}-r_{\sigma(k)}\) take two values (the “short” and “long” gaps) determined by \(p/q\); this is the Three Gap Theorem specialized at a convergent, where only two gaps appear across the first \(q\) points.

 
Proof sketch

Because \(p/q\) is a convergent, \(\|q\alpha-p\|\) is minimal in its range. The return map of the rotation by \(\alpha\) to the set of \(q\) points partitions the circle into two gap lengths. (TGT gives gap sizes.)


The order of the points is governed by the congruence \(x\alpha \approx x\frac{p}{q}\) modulo \(1\), so sorting by \(x\alpha\) matches sorting by \(xp/q\) modulo \(1\). The residues \(xp \bmod q\) run through \(\mathbb{Z}/q\mathbb{Z}\) in steps of \(p\), hence the sorting permutation is
\(\sigma(k)\equiv k\cdot p^{-1}\ (\bmod q)\). (This gives gap order.)


The floor/overflow terms satisfy \(y_{\sigma(k+1)}-y_{\sigma(k)} \in \{\lfloor p/q\rfloor, \lceil p/q\rceil\}\),

and, tracked modulo \(p\), they advance by \(q^{-1}\) because
\(q\alpha\approx p\) forces \(p\) steps in \(\alpha\)-space to coincide with \(q\) wraps. This yields the dual \(\mathbb{Z}/p\mathbb{Z}\) cycle.
 
(Logarithmic case via an isomorphism)

Let \(a,b>1\) and set \(\beta=\log_b{a}\). Define the multiplicative sequence \(R_x \;=\; a^x\, b^{-y_x} \in [1,b),\qquad y_x=\big\lfloor x\beta\big\rfloor\).

Then \(R_x = b^{\{x\beta\}}\). Hence ordering the \(R_x\) is the same as ordering \(\{x\beta\}\), and all claims of the Theorem transfer with \(\alpha=\beta\):

Sorting indices are \(\sigma(k)\equiv k\cdot p^{-1}\ (\bmod q)\) for any convergent \(p/q\) of \(\beta\).

The overflow exponents \(y_{\sigma(k)}\) form \(q\) samples from a \(\mathbb{Z}/p\mathbb{Z}\) cycle with step \(q^{-1}\ (\bmod p)\).

The MLA’s “stack-and-fold” is just rotation on the circle in log-coordinates, so its consecutive outputs are convergents whenever you use windows aligned with denominators \(q\).


Every Diophantine approximation problem generates a dual pair of cyclic group structures, one indexed by the convergent’s denominator, one by its numerator. a lattice of relationships between \((p,q)\) and their inverses modulo each other. It's not just about inequalities, but about explicit dynamical group actions tied to each irrational. For irrational \(\alpha\), from the overflow sequence of its natural dynamical action produces exactly the convergents of its continued fraction.





From the MLA(Mesopotamian Logarithm Algorithm) for logarithmic convergents, a similar property appears in other irrationals when analyzed in their corresponding space.


Logarithm Case Recap:

Irrational: \(\alpha = \log_b(a)\)

Convergent: \(p/q \approx \log_b(a) \Rightarrow q \times log_b(a) \approx p \Rightarrow a^q \approx b^p\)

Sequence: \(r_x = a^x \times b^{y_x}\) reduced to \([1, b)\). This is like looking at \(a^x\) "modulo \(b\)" multiplicatively. \(y_x\) tracks the 'overflow' exponent of \(b\). (This highlights the absence of a standard shorthand notation for multiplicative modulus; see link)

Sorted Sequence: Sorting \(r_x\) for \(x=1\ldots q\) gives indices \(x_k\).

Structure: \(x_k\) forms \(\mathbb{Z}/q\mathbb{Z}\) (gen \(p^{-1} \mod q\)), \(y_{x_k}\) forms \(q\) terms of \(\mathbb{Z}/p\mathbb{Z}\) (gen \(q^{-1} \mod p\)).






Trigonometric Case (Angle)

Irrational: We need an irrational quantity related to the angle. Let's use \(\alpha = \theta / (2\pi)\). (assuming \(\theta\) is not a rational multiple of \(2\pi\)).

Convergent: \(p/q \approx \theta / (2\pi) \Rightarrow q \times \theta / (2\pi) \approx p \Rightarrow q\theta ≈ 2\pi p\). This means \(q\) rotations by \(\theta\) is close to \(p\) full \(2\pi\) rotations.

Sequence: What's the equivalent of \(a^x \mod 1:b\)? The natural analogue for angles is \(x\theta \mod 2\pi\). Let \(r_x = (x\theta) \pmod{2\pi}\). This sequence lives in \([0, 2\pi)\).

What is \(y_x\) ? It's the number of full rotations removed: \(xθ = y_x \times 2\pi + r_x\). So, \(y_x = \lfloor x\theta / (2\pi)\rfloor\).

Sorted Sequence: Sort \(r_x\) for \(x=1\ldots q\) to get indices \(x_k\).

Structure: \(x_k\) forms \(\mathbb{Z}/q\mathbb{Z}\) (gen \(p^{-1} \mod q\)), \(y_{x_k}\) forms \(q\) terms of \(\mathbb{Z}/p\mathbb{Z}\) (gen \(q^{-1} \mod p\)).




This directly mimics the log case by replacing the multiplicative group \((\mathbb{R}^+, \cdot)\) modulo \(b\) with the additive group \(\mathbb{R} \mod 2\pi\) (the circle group \(S^1\)). The relationship \(q\theta \approx 2\pi p\) is the direct analogue of \(a^q \approx b^p\). The Three Gap Theorem describes the structure of the sorted \(r_x\) values (the points \(x\theta \mod 2\pi\) on the circle), and their ordering is intimately linked to the continued fraction convergents \(p/q\). The generators likely arise from the relationship \(q(p'/q') - p(q'/q') = \pm \)1 between consecutive convergents.


(Need to test which inverse/element works. The structure \(p_{n-1} q_n - p_n q_{n-1} = (-1)^n\) from continued fractions is key here, likely determining the specific generators.)


TGT Details:
The three gap theorem says that for any irrational number \(\alpha\) and any positive integer \(n\), the set of fractional parts \(\{k\alpha\}\) (that is, \(k\alpha \bmod 1\)) for \(k = 1, 2, ..., n\), when arranged on the unit interval \([0,1)\), divides it into at most three distinct gap sizes. It also connects \(\alpha\) and the k values where gap changes relates to its continued fraction expansion. This algorithm uses a logarithmic isomorphism instead of \(k\alpha \bmod 1\); for instance, with \(\log_b(a)\), it looks at the modular remainders of exponential sequences \(a^x \cdot b^{y_x} \in (1, b]\), effectively rotating the \(a^x\) sequence within a modular space of \(b^{y_x}\). It detects when gaps change, marking that a convergent of the continued fraction expansion has been found. As the location of the change narrows, it focuses on comparing only the most recent gaps in that area.

This live demo calculates a few example convergents and renders one of the earlier ones. By default, it’s set to log2(3), showing a list of 10 convergents up to 1054/665, with the graphic displaying the fractional part of 19/12. (7/12)




Tonal Cognition and Quantum Search Analogies

The analogy to quantum amplitude amplification should be understood as a conceptual description of global hypothesis redistribution, closely...