Part I: The Phenomenon
1: An Introduction to Tonal Constancy
For such reconstruction to be possible, the brain requires a stable framework. This reveals a deeper process: the perceptual categorization of cycles. Just as we naturally group spatial patterns into shapes or rhythms into meters, we instinctively categorize frequencies into closed, circular structures—with the octave being the most dominant. This circular mapping is not merely cultural convenience. It is perceptually anchored, strongly supported by the harmonic series (and the missing fundamental phenomenon), which offers the octave as an early perceptual prior.
-Resolution — the minimum perceivable difference between two pitches (JND).
-Cycle — the perceptual tactic of wrapping pitch into repeating units to create a structure.
-Why random or uniformly spaced tones often produce surprisingly familiar tonal structures.
-Why even trained musicians, when exposed to unfamiliar scales (e.g. 13-EDO), often fail to notice that they’ve left the 12-tone universe—so long as timbre and motion remain within certain thresholds.
2: From 'Xenharmonic' to 'Induodecimable': The Need for a Precise Lexicon.
Over time, however, the term’s scope has broadened. Today, xenharmonic may be applied to any music using non-standard tunings, alternate instruments, or unfamiliar timbres. As its use has expanded, its precision has diminished. A piece might now be labeled xenharmonic even if it maps closely onto 12-EDO, or if it retains gestures that remain tonally functional within familiar paradigms. In this diluted form, the term no longer guarantees that the music is truly untranslatable into the 12-tone system.
To address this ambiguity, we use a more semantically precise term: induodecimable—from Latin roots meaning not reducible to twelve. It describes musical structures, scales, or timbres that cannot be effectively translated into 12-EDO without a perceptual or functional loss. Unlike xenharmonic, this term emphasizes irreducibility, not just unfamiliarity. Moreover, its morphology is cross-linguistically stable (e.g., induodecimable reads identically in English and Spanish), and it admits extensions for greater specificity, such as indiatonizable, referring to pitch content incompatible with diatonic function.
It is important to note that this property is not binary. Whether a given musical structure is duodecimable (that is, whether it is approximable by 12-EDO) is ultimately a perceptual judgment. Some cases are obvious: inharmonic spectra perceived as noise, or very specific microtonal systems with step sizes that fall far outside typical pitch categories. But others lie in a gray zone where perceptual context, cultural exposure, and learned listening habits strongly shape what we "hear."
This gray zone is precisely where Tonal Constancy becomes critical. Even when a melodic or harmonic structure defies analytical reduction to diatonic scales, listeners often project familiar tonal frameworks onto it, constructing implied functions, modes, or centers through context and inference. The ability to “make tonal sense” of unfamiliar material is not evidence of universality in the structure itself, but of the perceptual elasticity of the listener.
As this study will show, the diatonic scale functions as a tonal attractor—a kind of perceptual sink into which ambiguous or approximate materials are pulled. The 12-tone system serves as its most stable host, offering a resolution of the scale’s unequal steps into evenly spaced units that align (imperfectly, but reliably) with physical redundancies in the harmonic series.
The question then arises: why twelve? It's not the timbre, the Pythagorean algorithm or the more compelling harmonic semi-group, which, while seemingly more ontologically robust, doesn't necessarily relate to or reflect our perception. Why does this specific internal subdivision act as the dominant attractor, rather than systems based on ten, fifteen, or nineteen tones? Why does duodecimability seem to represent a perceptual threshold?
The answer is not solely historical. Nor is it purely acoustic. Instead, we find ourselves in the deeper terrain of categorical emergence: how perceptual systems construct stable reference frames from continuous data. Just as we learn to divide the color spectrum into culturally specific “basic colors,” so too do we divide the pitch continuum into categories that are both learned and constrained, by cognition, biology, and acoustics.
This chapter lays the foundation for a more precise taxonomy of perceptual translatability in music. The aim is not only to explore how Tonal Constancy works, but to examine the deeper question: where do musical categories come from at all? At what point do categories cease to form, or become unstable? And when do they dissolve into pure context-dependence; when Tonal Constancy, in effect, runs out?
3: Tonal Reconstruction in 7-EDO and the Elasticity of Pitch Meaning
This raises a crucial question:
How does a scale composed entirely of uniform steps give rise to perceived modes, cadences, and tonal direction?
The answer lies in trajectory, rhythm, and expectation. When notes are arranged in a musically structured sequence—when certain tones fall on emphasized beats, when contours mimic known harmonic arcs—the perceptual system imposes meaning where none exists acoustically. That is tonal constancy at work: the brain reconstructs pitch relationships that aren't literally present in the signal.
The Case of 7-EDO: Structure Without Hierarchy
Let us consider a concrete example. The full 7-EDO scale, in cents, is:
[0, 171, 342, 513, 684, 855, 1026]
[missing]
Audio Evidence and Cognitive Confirmation
Audio examples accompanying this section allow the reader to identify this phenomenon. In the absence of harmonic overtones (e.g., using pure sine waves), the effect still holds. The listener perceives functional pitch relationships despite the absence of spectral cues.
While the evidence here is intersubjective rather than quantitative, it aligns with existing empirical findings in psychoacoustics and pitch perception. Later sections will explore more controlled studies that provide data-driven support for this auditory elasticity.
Trajectory, Momentum, and Torsor Space
These examples demonstrate another key mechanism: pitch momentum. As a melody unfolds, successive steps form a trajectory, a kind of auditory vector through pitch space. Even when the underlying tuning lacks functional landmarks, the shape of the melodic path builds expectation and invites closure. This perceptual momentum allows the brain to infer tonality where none exists acoustically.
We might say that musical space behaves more like a torsor than a vector space: there is no fixed origin, no absolute zero point, only relationships. A pitch acquires meaning not from its frequency but from its relational chroma: its position within a trajectory, its placement in rhythm, and its alignment with culturally learned tonal archetypes.
When Does Meaning Collapse?
This leads to deeper questions that this study will return to again and again:
How much can an interval be stretched or displaced before it ceases to evoke its expected function?
At what point does a “minor third” cease to be heard as such?
How much cognitive reinterpretation is too much before tonal constancy fails?
Such boundaries are neither fixed nor purely acoustic. They are context-dependent, modulated by familiarity, cultural exposure, and moment-to-moment listening conditions.
We might ask:
Have we learned categories so deeply that we now impose them reflexively? Have we trained ourselves into hearing functions that are not there? And if so, can we ever unlearn them? (The microtonalist asks: where are my new notes? are there?)
These questions will be addressed further in upcoming chapters, where we explore tonal attractors, cultural priors, and the emergence of categorization itself.
In summary, tonal constancy in 7-EDO reveals just how much pitch meaning is a construct—a dynamic synthesis of trajectory, rhythm, relational placement, and cognitive inference. The auditory system is not a passive receptor of interval sizes. It is an active interpreter, constantly reconstructing tonal function from even the most alien input.
Where conventional theory sees symmetry and ambiguity, perception hears hierarchy and direction. And in this, we find the essence of tonal constancy, not as a property of sound, but as a property of mind.
The following audio examples present melodies in both 7-EDO and their 12-tone equivalents. Despite the equal step sizes in 7-EDO, the melodies evoke a sense of familiar tonal functions. For example, the second step in 7-EDO, a neutral third at 342 cents (approximately halfway between a major and minor third), is often perceived as having either a "major" or "minor" quality depending on the surrounding musical context, such as the implied harmony or the melodic contour. This effect, which persists even with pure sine waves (thereby eliminating harmonic artifacts), demonstrates tonal constancy: the listener's brain interprets the neutral intervals within a tonal framework, resolving them into functionally familiar pitches. When the same progression is rendered in 12-tone equal temperament, the listener/performer naturally resolves each step into the "correct" functional pitch to satisfy the implied cadence.
Part II: The Mechanism
4: The Four Layers of Duodecimability
This question leads us directly to the core assumption embedded in most musical thinking: that pitch exists within some kind of lattice, and that we can—or must—choose a finite subset of it. Whether this subset is the Western 12-tone equal temperament (12-EDO), or any other system, the act of pre-selection implies a structure. And the moment we impose a structure, we are already translating sound into something else—a symbolic language, a tool of control, a perceptual grid.
But what exactly does that structure point toward?
This chapter presents the concept of duodecimability, a way to classify how various tuning systems interact with, adapt to, or diverge from the gravitational pull of the 12-tone system. It doesn’t argue for an ontological “truth” in tuning but offers a practical framework, a method to discuss tonal constancy using familiar musical ideas, while recognizing that these ideas are culturally specific and not universal.
The Four Layers of Duodecimability
A tuning system’s “duodecimability” refers to the degree to which its pitches, functions, or perceptual structures can be interpreted (or misinterpreted) through the lens of the 12-tone system.
Each layer below marks a progressively deeper departure from 12-EDO as both perceptual default and theoretical grammar:
- Layer 1: Perceptual Forcing
("Duodecimability by Proximity and Momentum")
Definition:
A system that is mathematically unrelated to 12-EDO, but is perceptually coerced into a 12-tone framework by the listener’s tonal expectations.
Mechanism:
Tonal constancy combined with melodic trajectory. The listener’s brain fills in “missing” functions based on contour, rhythm, and cultural conditioning.
Example:
7-EDO. Despite its equal-step structure (~171 cents per step), melodies played in 7-EDO can imply tonic-dominant relationships, cadential closure, and even modal coloration. The actual intervals don’t match—but the function does. The translation happens inside the brain, not the score. This is tonal constancy at its most extreme: imposed, not earned. - Layer 2: Relational Reference
("Duodecimability by Analogy and/or Refinement")
Definition:
A system that contains more than 12 pitches, but still describes itself in terms derived from the 12-tone world, thus retaining it as a conceptual anchor, even if not a physical one.
Mechanism:
Category refinement. These systems acknowledge and orbit around 12-EDO concepts like major/minor, fifths, thirds, etc., often subdividing them or redefining their boundaries.
Examples:
Arabic Maqamat: The maqam system, while utilizing microtonal intervals, maintains its identity through scalar steps and modal gravity. These intervals are often compared to their 12-EDO counterparts, such as neutral seconds and sub-minor thirds.
19-EDO, 22-EDO: These temperaments expand the lattice without abandoning diatonic principles. Concepts like “tonics,” “leading tones,” and “dominants” persist, even with altered sizes. These systems enhance rather than alienate. (It's worth noting that these highly divided systems enrich the 12-EDO framework and also accommodate other unfamiliar structures. However, they are designed, both theoretically and practically, as better approximations of just intervals and known categories. They can also feature indiatonizable structures but are not typically used in that way, see next layer on macrotonality.)
The language used in pedagogy and notation often betrays this link: if you’re still calling something a “flattened third,” you’re still playing the 12-tone game, just with extra tiles. - Layer 3: Structural Alienation
("True Induodecimability with Harmonic Timbre")
Definition
A tuning system whose internal logic (how it generates scales, harmonies, and motion) has no connection to diatonic or 12-tone principles.
Mechanism:
Constructed from different mathematical or harmonic seeds. Even when using harmonic instruments (e.g., with overtone series), the structural rules cannot be mapped onto 12-EDO categories.
Examples:
The Bohlen-Pierce Scale: Built on the tritave (3:1) instead of the octave (2:1), and divided into 13 steps. Its chords are derived from odd harmonics (3, 5, 7...), and its “fifths” and “thirds” have no analogues in 12-EDO. It creates functional progressions, just not those functions.
8/10-EDO: This system, while octave-repeating, generates arpeggios with highly ambiguous internal logic. A sequence might include chords that, individually, can fit into 12-EDO, but collectively shift too much, requiring multiple conflicting translations. The system resists stable mapping.
A clear example of structural divergence but local translatability is seen in stacking fourths in 10 vs 12edo. In 10edo, the "subfourth" is at 480 cents, while in 12edo it is at 500. For some listeners, they are perceptually indistinguishable, and for trained musicians, they are expressive variations of the same categorical function. Both versions of the "perfect fourth" remain consonant in traditional instruments with harmonic timbres like the guitar. However, as fourths continue to stack, the resulting "chord" becomes noticeably different after a few steps. In 10edo, the double octave is reached at 2400 cents, creating a periodic, stable "pure" categorical fourths overlay, whereas 12edo reaches 2500 cents, destabilizing overall consonance while remaining locally consonant. In modern music theory terms, these arpeggios are "equivoques" relative to one another.
Certain Just Intonation Lattices: Especially those incorporating the 11th or 13th harmonics. These systems often sound “smooth” or “consonant” when played with harmonic timbres, but their intervallic logic has no fixed counterpart in 12-EDO. Trying to approximate them with standard pitches is like translating poetry using only rhymes, it misses the meaning entirely.
This is the tipping point: these systems can sound traditionally good, but their logic is alien. You can enjoy them, but you can't name them with your old words.
- Layer 4: Timbral Dissolution
("Induodecimability by Substrate")
Definition:
A musical system whose inharmonic timbres prevent any pitch structure from aligning with 12-EDO attractors, rendering not only tuning, but pitch itself, unstable or secondary.
Mechanism:
The overtone series is no longer harmonic. With no integer multiples, the usual anchors, octave, fifth, third, do not emerge naturally in perception. The substrate itself erodes tonal identity.
Example:
Gamelan music. The metallophones used in Balinese and Javanese ensembles produce inharmonic spectra. Their tuning systems (e.g., slendro, pelog) are not “approximations” of 12-EDO; they are entirely separate epistemologies. A tone's pitch is defined by its timbral fingerprint, not by harmonic ratios. Even the concept of an interval can dissolve into a cloud of color and resonance.
Attempting to analyze this using Western theory is like applying Latin grammar to birdcalls: the medium does not support the metaphor. These musics are not microtonal, they are extratonal.
Closing Reflection
These four layers don’t represent value judgments. They describe degrees of translatability, not superiority or purity. Each layer tells us more about how listeners (trained and untrained) perceive, categorize, and force-fit sound into symbolic boxes.
They also suggest that duodecimability is not a binary. It is a gradient, and perhaps a contested one: where you place a system may depend not only on its design, but on your listening history, your training, and your linguistic tools.
We have learned to hear with twelve ears. But what happens when we encounter a sound that won’t sit still on any of those notes?
In the next chapter, we turn from classification to emergence: how tonal categories form in the first place, and what kinds of mental scaffolding make tonal constancy (and its resistance) possible.
5: The Spectrum of Familiarity — Microtonal Flavor vs. Functional Break
Music doesn’t become "otherworldly" just because it uses strange intervals. Not all microtonality is revolutionary. In many cases, it is ornamental, expressive, a kind of seasoning—a flavor layered atop an underlying structure that is still resolutely tonal.
The difference between microtonal flavor and functional departure is a spectrum, not a binary. But it’s crucial—because it defines whether a piece of music is interpretable, translatable, or cognitively disorienting. And that distinction hinges on tonal constancy: whether the listener can still rely on familiar perceptual anchors—tonic, cadence, resolution—even as the tuning system mutates around them.
Historic Flavors: Chopin and Meantone Coloration
A famous example: Chopin referred to D minor as the “saddest” key.
At first glance, this seems metaphysical or poetic. But in fact, there was a physical reason. During his time, many pianos were tuned in meantone temperament, a system optimized for certain intervals using simple integer ratios. While 12-tone equal temperament (12-EDO) was theoretically known—and even in use—it was still rare for instruments to be tuned to it precisely. Ear-based tuning methods favored rational approximations. Algorithmically, meantone was simply more practical before electronic tuners.
The result: each key had a unique color, a subtle deviation in interval sizes that made D minor sound distinctly different from, say, B minor. These were microtonal inflections, not fundamental departures. The harmonic framework remained diatonic. What changed was the flavor profile of each key.
Modern Examples of Flavor: Bends, Blues, and Maqamat
Today, the idea persists in many styles:
Blues music bends between notes of the pentatonic and chromatic scales, sliding into pitches that don't "exist" in 12-EDO notation. These expressive bends act as stylistic inflections, not harmonic challenges. The tonic remains the tonic.
Arabic Maqamat and Persian Dastgah systems incorporate quarter-tones and nuanced scalar steps, often creating pitches "between the keys." Yet these systems still rely on cadential logic and tonal gravitation. The microtones serve as ornaments, bridges, flavors. They rarely seek to dissolve the entire structure—they aim to enrich it.
In both cases, duodecimability remains possible, even if imperfect. A skilled listener can still find the center of gravity. These are flavored tonalities, not alternative logics.
Functional Break: When Tonal Constancy Fails
What happens, though, when the system no longer submits to interpretation?
Below is an example from 8-EDO—a tuning system that divides the octave into eight equal steps (150 cents each). It contains two maximally symmetric diminished scales, and enough pitch density to form chords and melodies. However, the logic of this system is non-diatonic by design.
Try to map its harmonic progressions to 12-EDO, and tonal constancy breaks. No amount of perceptual coercion or melodic expectation can fully translate its motion. The listener doesn’t "mishear" it as tonal—they simply hear it as strange.
Why?
Because 8-EDO sits out of phase with 12-EDO. There are no simple ratios shared between their step sizes. Their intervals don’t approximate one another; they contradict each other.(Except for the diminished scale, or 4EDO) This is the threshold at which duodecimability fails entirely. Translation is not fuzzy—it is impossible.
Octave Retention vs. Structural Alienation
Interestingly, 8-EDO still uses the octave as a repeating unit. This gives it a slight advantage in group performance and instrument design: parts can be transposed, ranges can be shared.
Compare that to the very similar scale, Bohlen-Pierce (13-ED3)—a system that replaces the octave (2:1) with the tritave (3:1). While rich in harmonic possibilities (especially with odd harmonics), it loses the universal reference point that the octave provides. The result: true structural alienation, especially in chordal writing. Melodies still function, but harmonies drift into perceptual limbo. An approximate 1.96 ratio, close to the octave, exists—but it is harmonically incoherent with traditional instruments.
This is why 8-EDO, though less famous, can feel more playable. Its symmetrical design makes it excellent for exploring alien harmonic functions while maintaining just enough structure for ensemble use.
The Takeaway: The Diatonic Ghost is Hard to Kill
Even in highly divided systems like 19, 22, or 31-EDO—often used for their greater consonance or intonation precision—diatonic templates resurface. Musicians use them to better approximate known categories, not to invent new ones. In fact, the higher the division, the more tempting it becomes to overfit microtonal pitch space to traditional harmonic roles.
By contrast, systems like 8-EDO or 10-EDO—low-subdivision tunings that avoid rational alignment with 12-EDO—offer fewer handholds. Their symmetry, spacing, and internal logic prevent easy mapping. They don't flavor tonal music—they replace it.
These systems are functionally distinct, and their progressions defy tonal constancy. This is the boundary line: where the mind stops hearing “altered chords” and starts hearing new grammar.
Closing Note
The difference between flavor and functional break is not merely theoretical. It defines whether music can still operate within a shared perceptual vocabulary, or whether it demands the invention of a new one.
In the chapters ahead, we’ll explore this boundary more formally: how tonal categories form, and what kinds of cognitive attractors allow—or prevent—the perception of coherence when pitch structures drift too far.
Or put more provocatively: when does a microtone become a mutiny?
6: Anchor Density — Diatonic Memory and the Illusion of Familiarity
Tonal constancy does not act on systems uniformly. Some tunings invite diatonic reinterpretation easily. Others resist it entirely. The key difference is not simply the number of steps per octave or the presence of harmonic intervals, but what we call Anchor Density—the frequency and distribution of elements that are close enough to recognizable tonal functions that the brain tries to interpret them as such.
This is not a binary switch. Induodecimability, as introduced earlier, isn’t “on” or “off.” It’s a gradient of perceptual traction—the ease with which the listener's cognitive machinery can hallucinate tonal relevance from non-diatonic material.
Anchors: The Seeds of Tonal Illusion
We define an anchor as a moment—a pitch, a dyad, a short progression—that approximates a recognizable function within the 12-EDO system. These are not structural absolutes; they are perceptual affordances. A chord that sounds like a major triad—even if it's technically off—is an anchor. A cadence that feels like resolution is an anchor, regardless of its tuning origin.
The brain uses these moments to project a familiar tonal grid over unfamiliar territory. It’s a form of perceptual compression, and it’s why even radically mistuned systems can feel “not quite right” instead of “completely alien.”
Case Study: The Diatonic Categorization Experiment
A striking example of this phenomenon appears in Jason Yust’s paper, "Diatonic Categorization in the Perception of Melodies." In the study, ~30 participants—primarily musicians or audio professionals—were asked to categorize melodies played in an unusual subscale of 13-EDO: a seven-note scale with step intervals [0, 2, 4, 8, 10, 11,12](notes selected pattern notation).
Despite the scale’s deep structural departure from 12-EDO, participants consistently used diatonic terms to describe what they heard—“major third,” “perfect fourth,” “leading tone,” etc. Not a single subject identified the system as non-12-EDO. Even with highly trained ears, the internal tonal schema overrode the signal.
This was not 13-EDO as a novel tuning. This was 12-EDO imposed on a foreign substrate.
The Anchor Density Spectrum
Let’s break down two critical points on the anchor density gradient.
1. High-Density Anchors: Partial Duodecimability
Systems: 11-EDO, 13-EDO, high-fidelity Just Intonation subsets (e.g., 13edo roughly approximates \((11^x \times 2^{y_x}) \in (1,2] \, x \in [0,12]\))—a chain of thirteen 11th harmonics folded by octaves 1:2. For example: \(2^{6/13} \approx 1.377 \approx 11/8 = 1.375\)
Perceptual Experience: Slippery, ambiguous, “almost tonal”
Mechanism: Local phrases strongly resemble 12-EDO intervals, triggering familiar categories
These systems produce local illusions of tonality. You might hear a chord that “feels” like a major triad—your brain engages tonal constancy, and you momentarily experience a key center. But when the next chord arrives, the illusion fails. The logic collapses. The system can't sustain a consistent diatonic mapping over time.
We call this the Shifting Grid Problem: tonal constancy can win a battle, but it loses the structural war. The mind would have to rebuild the entire tonal scaffold for each new event—a computation it can't maintain over time. (Or can be used to exploit the less familiar categories)
This explains why many listeners describe such music as “drifting,” “haunting,” or “unstable.” It’s not unfamiliar in total—it’s familiar in fragments, and that inconsistency is deeply disorienting.
2. Low-Density Anchors: Global Induodecimability
Systems: Bohlen-Pierce, 8-EDO, 10-EDO, some dissonant Just Intonation networks
Perceptual Experience: Profound alienation, unfamiliarity, or unclassifiability
Mechanism: Few (or zero) interval categories resemble 12-EDO constructs
Here, tonal constancy doesn't just fail intermittently—it fails completely. There are no islands of recognition. The harmonic series is differently parsed, the scale is divided in unfamiliar ratios, and resolution itself might not even be meaningful.
These are truly induodecimable systems. Even approximate mappings to 12-EDO don’t make sense. The brain has to make a choice: either accept this new musical logic on its own terms, or reject it as noise.
Implications: When Hallucination Meets Constraint
Anchor density reveals a deep cognitive tradeoff. Tonal constancy can only operate within a limited domain of error. Systems like 19-EDO or 22-EDO can stretch that domain without breaking it; systems like 8-EDO snap it in half.
Flavor becomes function when too many anchors accumulate. Microtonality starts as expression—but once enough structural anchors reappear, the listener’s mind imposes full tonality onto the scale.
Historically, the octave inherits its importance from multiple overlapping domains:
Physically, it manifests as half the length of a string or air column.
Musically, it frames the diatonic scale and tonal structures.
Mathematically, it represents a 2:1 frequency ratio—elegantly simple, multiplicative, recursive.
It’s tempting to accept the octave as self-evident, especially under Pythagorean or harmonic frameworks. However, perceptual mechanisms introduce complexities that challenge this physical simplicity. In the brain, the octave is not merely heard—it's constructed.
Even if their pitch resolution is equalized, the topology of pitch space is not. One spans 10 octaves; the other spans less than one semitone. The experience of distance—of musical identity, direction, and hierarchy—is rooted in cyclical frameworks, not in raw resolution.
This demonstrates that perceptual distance is not simply a product of range or JND. It emerges relative to cycles, and is shaped by shared statistical and structural expectations—like those derived from the harmonic series.
So what gives rise to perceptual constants like pitch equivalence, or intervals like the octave?
The answer is statistical redundancy in the harmonic series. When you double a frequency, all overtones align perfectly—no new information is introduced. Whether simultaneous (consonance) or sequential (temporal prediction), the brain notices this and treats such transformations as perceptual "returns."
Even with pure sine waves—devoid of harmonic cues—the octave retains salience. This suggests that the octave is a learned statistical attractor, reinforced by both acoustic regularities and internal models.
Importantly, if you stretch this cycle—say, to 1100 or 1300 cents—and still use recognizable melodies within it, perception holds. The melodic identity persists, even as the physical cycle distorts. This shows that tonal constancy is momentum-based: the mind keeps cycling where it's been taught to.
The perceptual flexibility of the octave and its role as a framework for monophonic melodic structure are demonstrated through a series of audio examples. Each example features a 12-EDO diatonic major scale subjected to proportional stretching. The notes of the scale are presented sequentially, followed by a short melody, to illustrate the preservation of tonal meaning and relative intervallic distances despite the stretching. This process results in a relative error distribution of less than 10 cents between adjacent notes. Specifically, audio example 1 features a stretching of the octave from 1200 cents to 1150 cents, while audio example 2 features a stretching from 1200 cents to 1250 cents.
(Audio.01) 12-EDO diatonic stretched to 1150cents.
(Audio.02) 12-EDO diatonic stretched to 1250cents.
Let’s propose a basic abstraction:
Perceptual Cyclicity: A phenomenon in continuous, one-dimensional perceptual space where sustained motion in a single direction yields a recurring encounter with regularly spaced, qualitatively similar percepts—producing a sense of cyclical sameness despite uninterrupted progression.
This is distinct from raw discriminability. The cycle introduces relational structure: every point within it has a maximum antipode (e.g., 600 cents from a reference tone in a 1200-cent cycle), and recursive midpoint divisions (50%, 25%, etc.) define internal geometry.
Even with degraded range or resolution, this internal geometry remains. That’s what makes tonal constancy constant.
An interesting experiment is to consider whether this system could be artificially manipulated. Suppose we design timbres where "sameness" is defined by a 2.5:1 ratio. Could we train a new perceptual cycle?
Maybe. But statistical coherence breaks down. Combination tones, missing fundamentals, and overtone interactions all push the brain back toward the harmonic series. You can stretch the cycle, but not endlessly. Eventually, you lose pitch identity—and with it, the cycle.
This suggests the octave is not a physical given but rather a 'habit' of perception..., anchored in physics but shaped by cognition. Without a cycle, there is no tonal identity—only pitch. Tonal constancy, diatonizability, and musical distance all require some cycle to act as a reference structure.
Neural Mechanisms and Predictive Models of Tonal Constancy
Neuroimaging and electrophysiological studies reveal that specific regions of the auditory cortex selectively respond to structured sound patterns such as speech, melody, and harmonic sequences, but remain relatively inactive during exposure to unstructured noise. Notably, areas such as the planum temporale, located posterior to the primary auditory cortex, appear to engage dynamically when pitch structures exhibit internal regularities, even when those regularities are statistically subtle or culturally learned.
These regions are not merely passively decoding incoming sound—they participate in an active predictive process. The brain constructs internal models of melodic or harmonic progression, and generates expectations for future events. When a pitch contour unfolds predictably, it minimizes error between the expected and actual input, triggering dopaminergic reward responses in associated circuits such as the nucleus accumbens. These reward-linked responses, observed even in anticipation of musical climaxes, suggest that successful pattern prediction is inherently satisfying, reinforcing the learned tonal templates over time.
Such findings align well with a Bayesian perspective: the brain updates internal priors based on the statistical structure of the sound environment, forming what we might call tonal basins—perceptual attractors that stabilize around culturally salient pitch configurations. These basins guide pitch interpretation even when the physical signal is ambiguous, distorted, or derived from non-standard tunings (e.g., 7-EDO or inharmonic timbres). The result is tonal constancy, rooted in statistical expectation, contextual prediction, and hierarchical sensory processing.
Neural/Perceptual Concept || Physical/Mathematical Analog:
Contextual pitch trajectory || Particle path with momentum
Dopaminergic reinforcement || Entropy minimization with energy input
Such analogies are speculative, but not without foundation. Perception is inherently path-dependent, sensitive to both immediate context and long-term learning. The momentum of pitch trajectories, as previously discussed, may correspond to cumulative Bayesian updating or even to forms of inertial processing, where prior motion in tonal space biases future interpretations.
draft
Exhibit A: The Anti-Randomness Engine
These musical examples belong to a larger work in which I explored the role of randomness in pitch selection—beginning with the question of what randomness even means in a musical context. For the purposes of that study, I treated randomness as non-intentional design.
The research (see [link]) investigates in depth why apparently random pitch sets remain musically functional, often without producing the sense of “oddity” one might expect. While this section does not include all of those examples—some pitch systems are far less straightforward to analyze than the 7-, 8-, or 10-EDO cases presented here—the broader study incorporates sets derived from planetary data, mathematical functions, noise distributions, and other sources.
What unites these varied systems is that, despite escaping the 12-tone grid in every permutation and defying conventional tuning logic, they still exhibit traditional musical utility. In practice, many of them do not even sound “microtonal.” Instead, musical intent, cognitive expectation, and perceptual organization stabilize them, allowing listeners to hear them as familiar or “normal.”
The key finding is simple:
Uniform subdivisions of the perceptual cycle—even allowing for clustering or irregular spacing—still guarantee tonal functions.
This “anti-randomness engine” illustrates how music perception is less about strict mathematical grids and more about how the mind organizes trajectories into meaningful tonal categories.
No comments:
Post a Comment