Xeneize

Tuesday, May 6, 2025

Diophantine Limits in Quantum Search: A Three Gap Theorem Perspective

The Rhythms of Chance

The application of Grover's quantum search algorithm to solve specific Diophantine equations within bounded integer ranges is a known demonstration of its utility, hinting at an intrinsic link between quantum computation and number theory. However, this article posits a far more profound connection: the probability amplification mechanism central to Grover's algorithm inherently shares qualitative limitations and structural parallels with concepts from Diophantine approximation, particularly illuminated by the Three Gap Theorem (TGT).

The Three Gap Theorem (also known as the Steinhaus Conjecture) is a remarkable result in number theory. It states that for any irrational number \(\alpha\) and any positive integer \(n\), the set of fractional parts \(\{k\alpha\}\) (i.e., \(k\alpha \bmod 1\)) for \(k = 1, 2, ..., n\), when ordered on the unit interval \([0,1)\), partitions this interval into subintervals of at most three distinct lengths. If exactly three lengths occur, the largest is always the sum of the other two. This theorem reveals an astonishing regularity in a seemingly simple iterative process.

Crucially, the TGT is deeply intertwined with the theory of continued fractions. The continued fraction expansion of \(\alpha\) provides the key to understanding the sequence of gap lengths and their evolution as \(n\) increases. Specifically, the denominators of the convergents of \(\alpha\)'s continued fraction mark the values of \(n\) where the structure of these gaps undergoes significant reorganization. Thus, the "approximability" of \(\alpha\) by rational numbers—a central concern of Diophantine approximation and characterized by its continued fraction—directly governs the pattern of gaps.

Grover's algorithm, when viewed geometrically, performs a series of rotations within a two-dimensional Hilbert space spanned by the initial uniform superposition state \(|s⟩\) and the marked (target) state \(|w⟩\). Each "Grover iteration," composed of an oracle call followed by a diffusion operation (which can be seen as an inversion about the mean of amplitudes), effectively rotates the quantum state vector by a specific angle \(\theta\) towards \(|w⟩\). This rotation angle is given by \(\theta = 2\arcsin(\sqrt{M/N})\), where \(N\) is the total number of states in the search space and \(M\) is the number of marked states. After \(r\) iterations, the cumulative rotation is \(r\theta\).

The core analogy proposed here is between this rotational dynamic in Grover's algorithm and the sequential principle of the TGT. The sequence of angular positions \(r\theta\) (modulo \(2π\)) on the unit circle (representing the quantum state's phase relative to \(|s⟩\) and \(|w⟩\)) mirrors the sequence \(n\alpha\) (modulo \(1\)) on the unit interval in the TGT. Consequently, the number-theoretic implications governing the distribution of \(n\alpha\) can be inherited to understand the behavior of \(r\theta\). We are not "approximating" the angle \(\theta\) itself in the Diophantine sense, but rather the quality of how \(r\theta\) "approximates" \(π/2\) (the angle required to align the state vector with \(|w⟩\) for maximal success probability) is subject to number-theoretic influences.

While there's an optimal number of iterations \(r_{opt} \approx \frac{\pi}{4} \sqrt{N/M}\) for Grover's algorithm, continued iteration leads to the state vector rotating past \(|w⟩\), decreasing the success probability, only to approach it again later. The TGT, with its complex patterns of gap restructuring, suggests that subsequent near-alignments with \(|w⟩\) will not necessarily be progressively better or occur at simply predictable intervals. The precise quality of these subsequent "good" iteration counts could be dictated by the Diophantine properties of the angle \(\theta\).

This implies that the continued fraction convergents of \(\theta\) (which is itself a function of \(N\) and \(M\)) might reveal not just \(r_{opt}\), but also subsequent, potentially less optimal but still significant, iteration numbers where the state vector comes close to \(|w⟩\). The "approximability" of \(\theta\) plays a critical role:

If \(N\) and \(M\) are such that \(\theta\) is a "badly approximable" number (like the golden ratio, characterized by small, bounded partial quotients in its CF), the sequence \(r\theta \bmod 2\pi\) will be very evenly distributed. This might mean achieving extremely high precision (very close alignment to \(|w⟩\)) is "harder," or that the probability of success degrades more slowly around \(r_{opt}\), or that subsequent good alignments are more spread out. This suggests a fundamental limit on the "quality" of amplification achievable for a given number of iterations, dictated by \(\theta\)'s Diophantine nature.
Conversely, if \(\theta\) is very well-approximated by a rational \(p/q\) with a small denominator \(q\), then after \(q\) iterations, \(q\theta\) might be very close to a multiple of \(\pi\), leading to either a very good or very poor alignment, depending on the numerator \(p\).

Therefore, the choice of \(N\) (the search space size, related to qubit count) becomes paramount, as it directly influences \(\theta\) and thus its Diophantine character. Selecting an \(N\) that results in a \(\theta\) with a "favorable" first CF convergent might yield the fastest high-probability result. However, an \(N\) leading to a badly approximable \(\theta\) (e.g., if \(\sqrt{M/N}\) is related to the golden ratio) might represent a scenario where the algorithm is robust but achieves its peak probability more "gently" and might offer fewer opportunities for significantly better alignments with further iterations.

This perspective doesn't claim to find algorithms faster than Grover's \(O(\sqrt{N})\) for unstructured search, as that bound is proven optimal. Instead, it suggests that the intricate dance of probabilities in Grover's algorithm is choreographed by deep number-theoretic principles. Understanding these principles could lead to a more nuanced comprehension of the algorithm's behavior across different problem sizes and solution densities, potentially informing choices of \(N\) or strategies for problems where multiple near-optimal iteration counts are relevant. The intertwined nature of quantum mechanics, search, and number theory suggests a rich tapestry of connections still waiting to be fully explored.

Sunday, March 30, 2025

Dual Group Structures in Diophantine Approximations

As revealed by the MLA(Mesopotamian Logarithm Algorithm) for logarithmic convergents, a similar property appears in other irrationals when analyzed in their corresponding space.

Logarithm Case Recap:

Irrational: \(\alpha = \log_b(a)\)

Convergent: \(p/q \approx \log_b(a) \Rightarrow q \times log_b(a) \approx p \Rightarrow a^q \approx b^p\)

Sequence: \(r_x = a^x \times b^{y_x}\) reduced to \([1, b)\). This is like looking at \(a^x\) "modulo \(b\)" multiplicatively. \(y_x\) tracks the 'overflow' exponent of \(b\). (This highlights the absence of a standard shorthand notation for multiplicative modulus; see link)

Sorted Sequence: Sorting \(r_x\) for \(x=1\ldots q\) gives indices \(x_k\).

Structure: \(x_k\) forms \(\mathbb{Z}/q\mathbb{Z}\) (gen \(p^{-1} \mod q\)), \(y_{x_k}\) forms \(q\) terms of \(\mathbb{Z}/p\mathbb{Z}\) (gen \(q^{-1} \mod p\)).

Trigonometric Case (Angle)

Irrational: We need an irrational quantity related to the angle. Let's use \(\alpha = \theta / (2\pi)\). (assuming \(\theta\) is not a rational multiple of \(2\pi\)).

Convergent: \(p/q \approx \theta / (2\pi) \Rightarrow q \times \theta / (2\pi) \approx p \Rightarrow q\theta ≈ 2\pi p\). This means \(q\) rotations by \(\theta\) is close to \(p\) full \(2\pi\) rotations.

Sequence: What's the equivalent of \(a^x \mod 1:b\)? The natural analogue for angles is \(x\theta \mod 2\pi\). Let \(r_x = (x\theta) \pmod{2\pi}\). This sequence lives in \([0, 2\pi)\).

What is \(y_x\) ? It's the number of full rotations removed: \(xθ = y_x \times 2\pi + r_x\). So, \(y_x = \lfloor x\theta / (2\pi)\rfloor\).

Sorted Sequence: Sort \(r_x\) for \(x=1\ldots q\) to get indices \(x_k\).

Structure: \(x_k\) forms \(\mathbb{Z}/q\mathbb{Z}\) (gen \(p^{-1} \mod q\)), \(y_{x_k}\) forms \(q\) terms of \(\mathbb{Z}/p\mathbb{Z}\) (gen \(q^{-1} \mod p\)).

This directly mimics the log case by replacing the multiplicative group \((\mathbb{R}^+, \cdot)\) modulo \(b\) with the additive group \(\mathbb{R} \mod 2\pi\) (the circle group \(S^1\)). The relationship \(q\theta \approx 2\pi p\) is the direct analogue of \(a^q \approx b^p\). The Three Gap Theorem describes the structure of the sorted \(r_x\) values (the points \(x\theta \mod 2\pi\) on the circle), and their ordering is intimately linked to the continued fraction convergents \(p/q\). The generators likely arise from the relationship \(q(p'/q') - p(q'/q') = \pm \)1 between consecutive convergents.

(Need to test which inverse/element works. The structure \(p_{n-1} q_n - p_n q_{n-1} = (-1)^n\) from continued fractions is key here, likely determining the specific generators.)

Tuesday, November 5, 2024

Interval Reduction (Multiplicative mod)

This page is dedicated to the interval reduction operation, a foundational concept in music theory that I’ve explained briefly in other articles where it often plays a key role. Interval reduction is a universal yet frequently misunderstood process, with "octave reduction" as one specific example. Here, I offer a formal definition and consistent mathematical notation for interval reduction, aligning it with the notation developed for the interval matrix. This approach provides a basis for broader generalization in music analysis and tuning theory.

Essentially, this is a shorthand notation for the multiplicative counterpart of the regular (additive) modulus.

Note on Ratio Notation: In music theory literature, ratios and fractions are often used interchangeably, particularly when discussing relationships like string length, frequencies, harmonics, or subharmonics. This overlap can lead to ambiguity, especially when referring to intervals without specifying direction. For instance, while octave equivalence remains clear across notations (1:2, 2:1, 1/2, 2/1), we may lose the specific octave reference—whether above or below—if this is not explicitly noted. Even worse with 2:3, 3:2, 2/3, 3/2, without specifying we can't be sure if it's a fifth or a fourth.

To address this, we follow the convention that treats ratios like 4:5:6 as indicative of pitch relationships, where each number represents a multiple of a fundamental (1). For example, in this format, 4:5:6 represents a major chord as the intervals 1/1, 5/4 and 6/4=3/2.

Examples:
- Octave (second harmonic): Ratio 1:2, Fraction 2/1, Decimal 2
- Second Subharmonic: Ratio 2:1, Fraction 1/2, Decimal 0.5
- Fifth: Ratio 2:3, Fraction 3/2, Decimal 1.5
- Fourth (below unison): Ratio 3:2, Fraction 2/3, Decimal 0.666...

This notation standard helps clarify both direction and pitch relationships, reducing ambiguity when discussing intervals and chords across different contexts.

Interval Definition:
To avoid ambiguity, interval here refers specifically to a musical interval, where values represent multiples of a fundamental frequency, typically normalized to a unison. When referring to a mathematical range, such as \((1,2]\), we will use the term space.
This distinction helps clarify references to musical intervals, such as the octave (a frequency multiplier with a value of 2), as distinct from the octave space, defined as any interval \((a,2a]\) for \(a \in \mathbb{R^+}\).

Interval Reduction

Definition:
Interval Reduction is a scaling transformation that maps a positive real number \(x\) within the bounds of a specified space, denoted \((1,b]\), by repeatedly multiplying or dividing \(x\) by \(b\), until \(x\) falls within \((1,b]\). Here, \(b,x \in \mathbb{R}^+ \) with \(x \notin (1,b]\).

Notation:
The interval reduction operation can be represented as a mapping into a modular "interval space." We use the following notation to generalize this process concisely:

Using the mod operator:
\[x \bmod 1:b = [x] \] where \([x]\) represents the equivalence class or representative of \(x\) within the \((1,b]\) space.

This notation mirrors modular arithmetic while specifying that it applies to interval reduction:
\[ x \equiv [x] \pmod {1:b} \] To capture all equivalence classes in a similar form:
\[ xb^n \equiv [x] \pmod {1:b},\, n \in \mathbb{Z} \] This expression shows that the fifth 2:3 is the class representative in octave space of the third harmonic (1:3), the sixth (1:6), twelfth (1:12), (1:24), (1:48), and so on, with respect to unison.

Example Application:

For \( x=48, b=2\):
\[48 \bmod 1:2 = 1.5 = 3/2 \text{ (Fifth)}\] Normalization for Other Ranges:

To perform interval reduction in ranges other than \((1,b]\), normalize the space as follows:

\(x \bmod a:b ⇒ x \bmod 1:b/a \)

For example, with \(x=7, a=4, b=5\):
\[7 \bmod 4:5 = 7 \bmod 1:5/4 = 1.174\ldots\]This approach is particularly clear for rational values in the space. We avoid directly writing \(7 \bmod 5/4\) to prevent confusion with traditional modular arithmetic; using ratio notation, \(7 \bmod 4:5\) explicitly denotes interval modularity, simplifying it as \(7 \bmod 1:5/4\). For spaces involving irrational values, we write the ratio starting from 1, such as \(5 \bmod 1:\sqrt{2}\), to maintain clarity.

The process can also be expressed explicitly using logarithms and the classic mod operator:
\[ x \bmod a:b = (b/a)^{log_{b/a}(x) \bmod 1} \] Example:
\[5 \bmod 1:2 = 2^{log_2(5) \bmod 1} = 5/4 = 1.25 \text{ (Major third)}\] In this case, we map the fractional part of the logarithm (base 2) of 5 into the octave cycle.

(The following clarifies the notation I used earlier, which appears in videos and some texts that may be found on the web, but is now deprecated)

While interval reduction is not a strict mathematical function—relying on iterative scaling rather than a single closed-form—it can be expressed as a function, especially within octave space. This space is particularly relevant to music theory as it relates to chroma equivalence and pitch grouping in human perception. Here, the "octave reduction" is the chroma function:
\[\Xi(x) = x \bmod 1:2 = 2^{log_2(x) \bmod 1}\] This function is a special case of our interval reduction process, where b = 2. This "octave reduction" maps any positive number to its equivalent within the octave cycle, representing its chroma.
We can extend this idea to define a general chroma function \(\Xi_{1:b}(x)\), where 'b' defines the specific interval space: \[\Xi_{1:b}(x) = x \bmod 1:b\] Example, for \( x = 7, b = 5/4\): \[\Xi_{4:5}(7) = 1.17440512\] Therefore, \(\Xi_{1:b}(x)\) effectively represents interval reduction within a given space. The chroma function \(\Xi(x)\), previously used in my work, is a specific case of this more generalized interval reduction function that without parameters, defaults to octave space.

The choice of notation can depend on context. For example, if one simply needs to find the chroma of a pitch within a tuning system, the compact chroma function provides a straightforward approach. However, the modular notation for interval reduction clarifies the process when building tuning systems and can also be applied in other mathematical contexts. For instance, in \( x \bmod 1:b = [x]\), the modularity is evident by viewing the operation as \(x \bmod 1:b = x/b^n\) , where \(n\) is the integer that scales \(x\) into the space \((1,b]\). While our primary interest may be in the result, here [x], the value of \(n\) and the sequences it generates with various inputs form the basis of the logarithm algorithm I introduced in [link].

Coding:

Most programming languages support logarithmic functions, so the chroma function can be implemented concisely. For example, in Python:

import math

def chroma(x):
    return 2 ** (math.log2(x) % 1)

This implementation uses logarithmic reduction to map x to its equivalent within the octave space (1,2]. However, for a more generalized approach that applies to any interval space (1,b], here’s the Python code to perform interval reduction for x mod 1:b :

def interval_reduction(x, b):
    # Ensure inputs are positive real numbers and x is outside (1, b]
    if x <= 0 or b <= 1 or (1 < x <= b):
        raise ValueError("Ensure x > 0, b > 1, and x is not within (1, b].")

    # Apply interval reduction by scaling x within the (1, b] range
    while x > b:
        x /= b
    while x <= 1:
        x *= b
        
    return x

Tuesday, August 20, 2024

The Interval Matrix

DRAFT
This article introduces the concept of the interval matrix from a traditional music theory perspective, alongside a software tool designed to create and visualize these matrices. In this context, intervals refer to proportions or ratios between numbers.

The interval matrix is built from all possible representations of a set's values under an equivalence relation, using each element as a base, resulting in a numerical or geometrical table—a matrix—that represents this expansion.

These matrices are not initially intended for conventional matrix operations; the focus lies in the geometric structure that emerges from different sets and their elements' relationships.

Interval Matrix software. Prime numbers up to 19(set to periodic), with equivalence 1:2 (octave-space)
\(\mathbf{Ä}_{1:2}(P_{19})\)

For an infinite set, the matrix cannot be fully generated. However, if the set has a repeating pattern (period), a minimal generating set can be identified. The matrix is then built and completed using this minimal set, (n-by-n) as seen in a common musical tuning system (a set of pitches or rhythms used to create or perform music).

This period typically becomes the primary equivalence relation (equave) parameter in the set's function for constructing the matrix and analyzing the intervals within.

The matrix can be constructed for a finite set that isn't meant to repeat. For example, in music, this approach can be used to analyze notes on an instrument where there's no indication to continue calculating additional pitches. This method applies to any finite set. In a finite matrix, each row contains one element less than the previous row.

Set and matrix construction:

For analyzing a set \(S\) that is already normalized and within the desired range—such as in any pre-calculated musical tuning system—the set remains unaltered, and the matrix is built directly \(\mathbf{A}(S)\). The only required parameter is its periodicity: Is the given set a minimal generating set of an infinite set, or does it represent a fixed, finite number of elements?

Most examples here will use periodic matrices. To denote matrix periodicity or non-periodicity, we might use different notation, such as \(\mathbf{Ä}\) for periodic matrices and \(\mathbf{A}\) for non-periodic ones.

The generalization of the interval matrix construction allows us to relate different sets and reductions, enabling us to find congruences between systems. The reduction function (which corresponds to the chroma function when the space is the octave, 1:2) for a real matrix, where the set consists of any real numbers, operates as follows:

The absolute value of each element is taken, and the function then returns this value, reduced or remapped (if necessary) by an equivalence relation:

For a value \(s_x\) larger or smaller than the chosen equivalence relation \(r\), it is reduced to a new element \(\tilde{s}_x\) by applying the operation:

\(\tilde{s}_x = |s_x| \bmod 1:r\)

(This uses the mod symbol because it effectively returns the intervallic remainder. This process involves repeatedly multiplying or dividing \(s_x\) by \(r\) until \(s_x\) falls into \((1, r]\) space. This page has details about interval reduction.)

Since the matrix is defined by reinterpreting the set values with each element as the base, all rows inherently start with 1. Consequently, the reduction, or normalization, is consistently performed as \(\bmod 1:r\)

Optional: A constant \(\delta\) may be applied to each element of the set before performing the base change.(this in relevant for other uses explained in other article)

The reduction can be notated and performed for sets \(S_{1:r}\) without considering any matrix. It can also be used in constructing the matrix, \(\mathbf{A}_{1:r}(S)\), which implies both reduction and base shifting.

Example: If \(S\) = {1, 2, 3, 5}, then \(S_{1:2}\) = {1, 3/2, 5/4, 2}, and \(\mathbf{Ä}_{1:2}(S)\) would yield [{...},{...},{...},{...}]. (reduced and periodic)

Interval Matrix Definitions:

Full Interval Matrix: \(\mathbf{A} = \mathbf{A}_{s_n}^{\delta}\)
This matrix uses the last or largest element of its generating set as the equivalence relation.

Local Interval Matrix: \(\mathbf{A}_{s_i}^{\delta}\)
This matrix uses any element within the generating set as the equivalence relation, except for the largest one.

External Interval Matrix: \(\mathbf{A}_{x}^{\delta}\)
This matrix uses a value outside the generating set as the equivalence relation.

A full interval matrix built from a periodic set is inherently a symmetric matrix.

A full or local interval matrix is not "useful" for isotropic sets (where the chosen period or relation is a member of the set). This leads to identical and overlapping shifts of the elements.

Musical Interpretation:
For example, the 12-tone equal temperament \(\text{12ed2}\) guitar is an interval matrix (incomplete) representing the infinite set generated by the constant \(2^{1/12}\). Each row is shifted by five elements from the previous row (except between \(\text{G}\) and \(\text{B}\), where the shift is four). The matrix is trivial for this set's intervallic analysis, as columns (frets) are always aligned regardless of the shift or element taken as base.

Interval matrices are tipically shifted by one element until they are complete.

Consider this group: \(\langle 2, 3 \mid 3^2 = 1 \rangle \). This represents a set of infinite fifths and octaves. One of its minimal generating sets is \(S\) = (1, 3/2, 2]. The resulting matrix \(\mathbf{Ä}_{1:2}(S)\) has only two rows:

(1, 3/2, 2]
(1, 4/3, 2]

Interval Matrix Accumulation: \(\text{Acc}(\mathbf{A}(S))\)

This is a new set with all the representations of the elements under the set equivalence relation, which unfiltered, might repeat values, helping to find prevalent proportions. Isotropic sets always have an accumulation identical to any of their matrix rows. (The accumulation is a vectorization or flattening of the matrix)

In this case, the infinite set generated by \(\langle 2, 3 \mid 3^2 = 1\rangle\) = { ..., 1/2, 2/3, 1, 3/2, 2, ...} has an interval accumulation (under the equivalence 1:2): (1, 4/3, 3/2, 2].

The distinction between full, local and external interval accumulations reflects the matrix type.

For example, consider a local matrix \(\mathbf{A}_{1:2}(S)\) constructed from the set {1, 2, 3, 4} in octave space (with an equivalence relation of 1:2). The local accumulation would be:

\(\text{Acc}(\mathbf{A}_{1:2}(S))\) = {1, 4/3, 3/2, 2} (filtered, with non-repeated values)

To obtain the global or full accumulation, the space is set to the largest element in the set. Thus, the matrix built from the set {1, 2, 3, 4} under the equivalence relation 1:4 would yield:

\(\text{Acc}(\mathbf{A}(S))\) = {1, 4/3, 3/2, 2, 3, 8/3, 4} (filtered)

For larger and more complex sets, the accumulation also provides a method for finding a possible natural mode of the set, if any.

Let’s take the pentatonic \(\langle 2, 3 \mid 3^5 = 1 \rangle\)
a minimal generating set is { (1, 9/8, 81/64, 3/2, 27/16, 2/1] }, its full matrix (omitting 1):

{9/8, 81/64, 3/2, 27/16, 2/1}
{9/8, 4/3, 3/2, 27/16, 2/1}
{9/8, 4/3, 3/2, 16/9, 2/1} Natural Mode
{32/27, 4/3, 3/2, 16/9, 2/1}
{32/27, 4/3, 128/81, 16/9, 2/1}

The natural mode of any set is the particular representation that includes the most frequent values appearing after shifts; it is the most faithful or weighted representation of the set.

How the Interval Matrix App Works

It accepts a list of numbers, treating them always as a minimal generating set(for now).

If the list/set is an already a reduced tuning system, the matrix is full and the equave(period, interval of equivalence) parameter should initially be set to match that of the set, typically the last and largest value. It does not adjust it automatically.

The matrix displays for each element in each row: the original value inserted, the reduced value(if it was reduced), a delta value(if it was displaced), and a rational approximation of the value.

The delta value comes from the delta parameter, usually 0. This value is added to every element in the original set before the rest of the calculations. This is useful for understanding how a minimal set, while maintaining its original absolute difference between members, shapes through this change.

For example, you can start with period/equave 1:2, and this set {1,2,3} reduces to {1, 3/2, 2}, but with delta = 3, it becomes {4, 5, 6}, and reduced, {1, 5/4, 3/2, (2)}.

Prime numbers up to 19. Delta = -1, octave-space.
\(\mathbf{Ä}_{1:2}^{-1}(P_{19})\)

The rational approximation has an adjustable tolerance value.

On top of the interval matrix, there is a configurable equal division ruler that helps with intervallic/ratio measures.

The chroma matrix has a fixed equivalence relation of an octave and, by default, starts at red. You can select whether the chromas displayed are absolute or relative to each row. When selecting relative, the full spectrum located in the bottom UI expands to display all the possible chroma shifts. (The full spectrum isn’t really "full"—you set a maximum space to occupy, with a logical maximum of the human hearing range.)

This last part is the most important when dealing with musical tuning systems; practical tuning systems have a simpler chroma matrix.

Unlike Scala files, the 1 must be inserted (remove it to understand what happens). You can, if you want, omit the equave in this list; it will be added (invisibly) from the equave parameter. However, it’s useful to keep it too, for example, when analyzing a non-octave tuning using an equave 3 (tritave). You can omit it, but if you want to inspect these intervals reduced to an octave, you might want to keep it and track it. So if when the set has an element equal to the equave, you will find two identical rows in the matrix.

Future Development

If you paid close attention to the code of this app and the SFINX app, you may have noticed that they use the same engine. That’s because, as I have pointed out, a guitar is essentially an interval matrix by string length.

My goal is to finally reunite both apps—SFINX was developed to aid in the graphic and diagram generation of scales for microtonal guitars, while the Interval Matrix was developed ideally for geometric analysis of sets and chromas.

(DRAFT)

Link to the apps:

jbcristian.github.io/xeneize/

Monday, August 5, 2024

Chroma: A Unifying Principle of Auditory and Visual Perception

Abstract:

This manuscript presents a critique of established paradigms in color vision science and proposes a novel theoretical framework grounded in the fundamental correspondence between the perceptual processing of musical chroma and visual hue. Extending beyond superficial analogy, this work posits a shared, underlying cyclical mechanism governing both sensory domains. Current models of color vision, particularly those emphasizing photoreceptor contributions (trichromacy) and opponent processing, are argued to be incomplete, often neglecting the inherent interdependence of color percepts and the crucial role of post-receptoral processing stages. Specifically, the concept of independent "unique hues" and unsubstantiated claims regarding "new colors" in non-standard vision (e.g., tetrachromacy) are critically re-evaluated. Drawing upon mathematical principles derived from music theory—specifically the cyclical nature of chroma—this study analyzes empirical data from color perception, including color constancy, afterimages, and binocular chromatic fusion. Preliminary findings reveal significant, mathematically describable symmetries within perceptual color space, offering enhanced predictive power for various visual phenomena and challenging the prevailing photoreceptor-centric dogma.

Introduction:

The research detailed in the accompanying comprehensive draft (currently on hold) initiates a fundamental reassessment of contemporary color theory. It challenges the prevailing view that color percepts arise independently, arguing instead for an inherent interdependence structured around a cyclical continuum, analogous to musical chroma. This perspective suggests that many widely accepted tenets and recent research directions, particularly those concerning tetrachromacy, animal color vision, and the notion of "expanded" or "new" colors, may be based on flawed foundational assumptions regarding color independence—a property yet to be empirically demonstrated.

The initial sections of the comprehensive work establish a rigorous understanding of musical chroma, not merely as an analogy, but as a necessary conceptual tool. It addresses the often-overlooked implications of octave equivalence, the cyclical nature of the pitch continuum, its mathematical representation (logarithmic and unit circle mappings), and the inherent properties derived from this cyclicity. This exploration delves into fundamental questions regarding perceptual substitution, cycle flexibility, the ontological status of intervals and scales (e.g., major chords, diatonic structures), and the nature of boundaries between discrete perceptual categories (notes). While not providing definitive answers to all, this musical foundation elucidates the core concept: perception can be organized cyclically, and the properties of elements within that cycle are relationally defined by their position within it. This framework is subsequently applied to the analysis of visual hue.

The analysis of color vision begins by critically examining and reconceptualizing established concepts, notably "unique hues," which are redefined as environmentally and linguistically influenced "color attractors" within an interdependent color cycle. Two complementary models are proposed:

The Spectral Octave Model: This model addresses the physical (wave nature) and early perceptual stages. Utilizing a logarithmic octave representation (analogous to the musical octave) of empirical color perception data, it reveals statistically significant symmetrical arrangements of color attractors around a central green hue. These symmetries predict complementary color relationships, often involving consistent wavelength ratios (approximating √2), which align with observations from color constancy, afterimage studies, and stereoscopic color mixing, frequently deviating from standard additive (RGB) model predictions. Potential underlying mechanisms involving wave interference and neural frequency synchronization are explored.
The Axiomatic (Cyclical Neural) Model: This model focuses on the abstract, neural representation of hue. Derived from mathematical principles governing the unit circle and incorporating a proposed "mix" function, it models color space as an interdependent cycle. This abstract model demonstrates consistent predictive capability for phenomena heavily reliant on post-receptoral processing, particularly binocular color fusion. It successfully accounts for color constancy and afterimage generation, suggesting they share a common underlying neural mechanism operating within this cyclical framework.

A significant portion of the analysis concentrates on stereoscopic vision and its interaction with color constancy and afterimages. The axiomatic model accurately predicts the observed complex interactions. During this investigation, specific mechanisms governing binocular fusion emerged, leading to the concept termed "Subspace Dominant Chromatic Information" (SDCI). This observation suggests that the visual field can contain multiple regions undergoing independent binocular fusion strategies; however, each region (SDCI) acts as a unit where the color mixing instruction applies globally within that subspace, preventing internal sub-regions or "holes" from adopting conflicting strategies. Conflicts are not always resolved locally, even when identical conflicts are resolved in adjacent regions operating under a different SDCI. Several stereoscopic images were constructed to demonstrate these specific properties.

Results and Current Status:

The primary results presented are qualitative, supported by statistical analysis of the spectral octave symmetry and numerous visual demonstrations (including stereoscopic images) validating the predictive power of the proposed cyclical models for color constancy, afterimages, and binocular fusion dynamics (including SDCI). While the project is currently on hold, the compiled observations and theoretical framework offer a foundation for researchers seeking alternative perspectives to current color science paradigms, particularly those questioning inflated claims regarding tetrachromatic vision that often minimize or ignore post-receptoral processing stages. This synthesis serves as a guide to the detailed arguments and evidence presented in the comprehensive draft.

ORIGINAL DRAFT:

This study investigates the relationship between musical and visual chroma, proposing a deeper, shared organizational principle beyond mere analogy. Leveraging the cyclical and continuous nature of both chromas, the study employs musical theory concepts—specifically interval matrices and chroma analysis—to analyze hue data from color perception research. A logarithmic representation of the visible spectrum, termed the "spectral octave," reveals symmetrical patterns in the distribution of color attractors (re-concenptualized "unique hues") in both trichromats and tetrachromats. This symmetrical arrangement, centered around green, predicts complementary color relationships consistent with observations from color constancy, stereoscopic color mixing, and afterimage studies, often deviating from traditional RGB representations. Furthermore, the study explores perceptual parallels between the auditory and visual domains, including tonal/color constancy, afterimages/aftersounds, and sensory conflict, suggesting shared processing mechanisms. These findings challenge current color vision models, offering a novel framework for understanding color perception grounded in mathematical principles of musical harmony and raising broader philosophical questions about the nature of sensory experience.

The initial music theory section provides a concise definition of chroma, its mathematical representation, and its application in modern music and tuning theory, including microtonalism, interval matrices, chroma analysis software, and color-coding. These aspects of tuning theory are often under-documented, making this section a valuable resource in itself. These concepts are then applied to the analysis of visual chroma.

1. Musical Chroma: Definitions and Mathematical Representation
1.1 The Nature of Chroma and Octave Flexibility
1.2 Mathematical Representation of Chroma
1.3 The Impact of Non-Octave Tunings
1.4 Color-Coded Octaves, Interval Matrices, and Chroma Analysis
2. Chromas and Color Perception: Beyond Analogy
2.1 Rethinking "Unique Hues": Introducing Color Attractors and Co-unique Hues
2.2. A Logical Foundation for the Hue Continuum
2.3. The Spectral Octave
3. Spectral Analysis of Color Attractor Locations
3.1. Logarithmic Representation Reveals Underlying Symmetry
3.2. Consistent Internal Ratios Despite Individual Variation
3.3. Mathematical Expressions of Symmetry
3.4 Statistical Analysis
3.5. Millioctaves for Color Measurement
4. Wavelength-Derived Musical Scales
4.1. Mathematical Process Summary
4.2. Examples of Unique Hue-Based Scales
4.3. Musical Properties and the Role of Uniform Distribution
5. Prediction Mechanism and Perceptual Parallels
5.1. Binocular Rivalry and Stereoscopic Color Mixing
5.2. Color Constancy and Tonal Constancy
5.3. Sensory Adaptation and Aftereffects
5.3.1. Complexity of Afterimage Perception
5.3.2. Types of Negative Afterimages
5.3.3. The Octave Hue Wheel and Afterimage Prediction
5.4. Afterimages and Stereoscopic Mixing Interactions
6. The Central Role of Green
7. Deconstructing the "New Color" Misconception
8. Applying the Axioms to Visual Experience: A Logically Constrained Color Space
9. Probabilistic Color
9.1 Retinal Structure and Photon's self-interference
9.2 Quantum Biology

Chromas and Mathematical Representation: (DRAFT)

Chroma, as a perceptual attribute of sound, is rooted in the categorization of pitches. This concept, often subject to misinterpretations due to conflation with synesthesia, represents a pitch class under octave equivalence. Chroma is distinct from the broader term "pitch class," which denotes a class within a system of note names. Chroma concerns the identification of perceptually similar pitches, irrespective of absolute frequency differences. This recognition of similarity is attributed to various aspects of auditory perception, including the logarithmic nature of hearing, a preference for harmonic timbres, and the physical interaction of sound waves, encompassing resonance and interference.

Octave (Definition): In music, an octave defines the interval between a reference pitch P and another pitch with twice the frequency. This corresponds to the second harmonic. Octaves, in general, from a reference pitch P are defined by all frequencies \(P \times 2^n\), where n is an integer. Specific designations are employed for related intervals: the double octave (corresponding to the fourth harmonic) and the sub-octave (corresponding to the second subharmonic), etc. Octaves are perceptually identified as similar, as exemplified by all C notes on a piano. This perceptual similarity is rooted in the simple mathematical relationship of multiplying or dividing frequency (or wavelength) by 2. This relationship can be considered a perceptually grounded equivalence class of pitches.
Chroma (Definition): Chroma denotes the relative perceptual quality of a musical pitch, representing its position within a reference octave space. Chromas are conventionally designated with names such as "fifth," "major third," and "augmented unison." Chroma is not an inherent property of an isolated pitch. Rather, chroma emerges from the relationship between a pitch and a reference point. Specifically, an isolated pitch P does not possess a chroma. When P is considered in relation to a reference pitch A, P—and all of its octave duplicates (\(P \times 2^n\), where n is an integer)—are considered to share the same chroma, C. This chroma C represents the perceptual quality of that specific pitch relationship with the reference root. It constitutes the "color" of the interval formed between the reference pitch and the pitch in question, irrespective of octave displacement, this implies that chromas are cyclic. While Western music theory may differentiate between a 2nd and a 9th (intervals separated by an octave), these intervals share the same chroma with respect to the root. In tuning systems with finite periodic chromas, this octave equivalence enables the transposition of chords and voicings while retaining their function. It also facilitates musical transcription between ranges, such as from piano (with approximately 80 notes) to guitar (requiring fewer than 30), and it allows for the perceptual substitution of a note with its octave duplicates without loss of tonal meaning and function.
Pitch Class (Definition): In modern music theory, "pitch class" (or simply "class") is a more abstract and generalized concept, analogous to an equivalence class in algebra. It denotes a set of pitches related by a specific, defined equivalence relation. This relation is not inherently perceptual and can be arbitrarily defined according to the requirements of a particular musical system or analytical context. While in octave-based systems such as 12-tone equal temperament (12-TET) the most common equivalence relation is octave equivalence (resulting in the frequent coincidence of pitch class and chroma), this constitutes only one possibility. A "pitch class" represents an assigned label or identifier given to a specific pitch. This assignment is absolute for that particular pitch within a defined system but does not inherently convey information regarding the pitch's relationship to other pitches. The class identifies the pitch but does not inherently reveal its intervallic relationship with other pitches or its function within a tonal structure.

The Nature of Chroma and Octave's Flexibility:

While the octave is explained through its mathematical and harmonic basis as a framework for pitch organization, certain perceptual aspects allow for flexibility within the strict mathematical definition of multiplying or dividing by 2. Human perception identifies pitches with a 1:2 frequency ratio as equivalent, sharing the same "color." This perceptual phenomenon aligns with the logarithmic nature of hearing. For example, the perceived interval between pitches with fundamental frequencies at 200 Hz and 400 Hz is equivalent to the perceived interval between pitches at 1500 Hz and 3000 Hz. This principle is exemplified by the practice of tonal music, in which the tension and resolution of chord progressions, such as \(\text{V}_7 \to \text{I}\), remain unaffected by octave displacements of chord members, though the addition of unrelated pitches can disrupt cadence and functional meaning.

One source of octave flexibility is timbre. Timbre influences consonance perception. Techniques of partial manipulation can render otherwise dissonant intervals more consonant, including octaves with frequency ratios deviating slightly from 2 (e.g., 2.1). However, human pitch grouping, in terms of simultaneously presented notes, persists even when listening to pure sine waves devoid of timbral complexity.

(Video.01 - Color-coded octave equivalence)

Video.01: Octave equivalence is demonstrated through a common chord progression exhibiting a known tension-resolution characteristic: \(\text{V}_7 \to \text{I}\). Within a 12-tone equal temperament (12-EDO) framework, with middle C standardized at 261 Hz, the progression \(\text{G}_7 \to \text{C}\) is employed. An initial sequence, represented in MIDI format, comprises approximately one pitch class per chord. Subsequent sequences introduce randomized octave doublings of chord members, illustrating the preservation of harmonic function and tonal meaning. Introduction of other random intervals in further sequences results in the loss of this harmonic function. While the octave's significance may appear self-evident within certain modern consonance models and given the observed perceptual flexibilities, such examples serve to reaffirm its fundamental role. The synthesized sounds in these examples utilize sine waves, thus eliminating timbral complexity and ensuring that the observed pitch grouping is independent of partials.

Another aspect, independent of consonance, is the sequential and melodic use of notes. Monophonically, the octave can be stretched even within a ±100 cent range without loss of tonal meaning within pentatonic or diatonic scales.

The perceptual flexibility of the octave and its role as a framework for monophonic melodic structure are demonstrated through a series of audio examples. Each example features a 12-EDO diatonic major scale subjected to proportional stretching. The notes of the scale are presented sequentially, followed by a short melody, to illustrate the preservation of tonal meaning and relative intervallic distances despite the stretching. This process results in a relative error distribution of less than 10 cents between adjacent notes. Specifically, audio example 1 features a stretching of the octave from 1200 cents to 1150 cents, while audio example 2 features a stretching from 1200 cents to 1250 cents.

(Audio.01) 12-EDO diatonic stretched to 1150cents.

(Audio.02) 12-EDO diatonic stretched to 1250cents.

(Audio.03) Auditory stimulus used in pitch distance estimation tests. A sequence of randomly generated pitches with constant, randomized step sizes is presented. Participants estimate the overall interval between the first and last pitches. Step sizes and number of notes are withheld from participants to prevent calculation-based responses.

The missing fundamental effect and combination tones exemplify our preference for harmonic timbres. Both phenomena involve the perception of phantom pitches, often explained by mathematical and physical principles involving integer multiples between frequencies.

Missing Fundamental Effect: A perceptual phenomenon where the fundamental frequency of a harmonic series is perceived even when physically absent, due to neural processing of the harmonic overtones and the brain's sensitivity to octave relationships.
Combination Tones: Additional tones perceived when two or more tones are sounded simultaneously, arising from nonlinearities in the auditory system and resulting in the generation of new frequencies.

Thus, despite its inherent perceptual flexibility, the octave constitutes a natural reference for human perception and physical phenomena such as wave behavior. This relationship is well-established musically and mathematically.

Mathematical Representation of Chroma

For readers familiar with tuning theory, the following mathematical treatment of chroma provides a rigorous foundation. For those less mathematically inclined, the core concept is that chroma represents the fractional part of a frequency ratio within an octave. This section introduces the equations and concepts necessary for a precise analysis of chroma and its relationship to musical intervals.

Octave equivalence is mathematically captured by defining chroma as the fractional part of the base-2 logarithm of a pitch frequency ratio, expressed in terms of the octave cycle (1:1 represented as a power of 2):

\( \text{chroma}(x)=2^{\log_2(x)\mod 1} \)

Alternatively, expressed in terms of a normalized ratio modulo operation:

\( \Xi(x) = x \mod 1:2 \)

This signifies that the chroma of a pitch is invariant under octave multiplication or division (scaling by \(2^n\), where \(n \in \mathbb{Z}\)). For instance, relative to 1:1, the chroma of 3, 6, 12, 24, etc. (representing a fifth) is 1.5, corresponding to the frequency ratio 2:3. This approach identifies their equivalent "color" regardless of absolute frequency.

Using \(\log\) and \( \bmod 1\) notation makes the process explicit for coding. For example, the chroma function can be implemented as 2**(math.log2(x) % 1). This method bypasses the need for manual interval reduction (such as repeated division by 2 for values greater than 2 or multiplication for values less than 1).

The following mathematical expressions, formally defining an equivalence class and an isomorphism of topological groups, are familiar in principle to musicians. These equations, which define structure preservation, enable the construction of pitch class diagrams, such as the well-known "circle of fifths." These same principles are subsequently employed in the development of a spectral representation and a hue wheel (mapping the visual spectrum to the unit circle in the complex plane)..

Chroma can be formalized in terms of ratio equivalence relations. For \( x, y \in (0, \infty) \)

\( x \sim y \Leftrightarrow x = 2^n \times y \, \) for some \( n \in \mathbb{Z} \)

The following mapping is established:

\( \frac{(0, \infty)}{\sim} \xrightarrow{\log_2(\bullet)} \mathbb{\frac{R}{Z}} \xrightarrow{\exp(2\pi i \bullet)} \mathbb{S^1} \subseteq \mathbb{C} \)

In general, the mapping can be expressed as:

\( [x] \mapsto \log_2(x) + \mathbb{Z} \mapsto e^{2\pi i \log_2(x)} \)

The mathematical nature of chromas reveals that melodies and chords necessitate more than octaves alone; other "colors" or fractional parts of the log₂ scale are essential. The implications of this mathematical understanding of chroma for different tuning systems, particularly those deviating from the familiar octave-based structure, are now considered.

The Impact of Non-Octave Tunings on Music:

In the analysis of any tuning system, an understanding of its chroma content is of paramount importance. The finiteness or infinitude of its chroma set, along with the precision of its octave approximation, constitutes a primary factor in assessing the system's inherent complexity, practical applicability, and potential for integration within established musical frameworks. Chroma content provides fundamental insight into the structural characteristics of the tuning system and the musical operations that are readily supported or significantly challenged. However, without specialized tools, such analysis can be computationally demanding and often impractical for general artistic endeavors. Nonetheless, certain fundamental principles can be grasped without extensive calculation.

A fundamental principle states that for finite generating sets (such as those represented in typical tuning files), a non-octave period implies an infinite chroma set. While such systems offer theoretical interest, their practical application in conventional music creation and workflows is generally limited.

Generating Sets (in the context of tuning): Within the context of musical instruments and software, a generating set constitutes a finite collection of pitches (or frequency ratios) employed to define a tuning system. This set provides the fundamental building blocks from which other pitches can be derived. While some tuning systems are designed to be periodic (repeating at octaves or other intervals), generating sets are utilized even for non-periodic systems. Software or instruments then map these pitches across the audible range. This practical usage should be distinguished from the more abstract mathematical definition of a generating set within group theory.

If a tuning is defined with an octave period (or powers of the octave, 2ᵏ/1, where k is an integer), the chroma set is finite. Conversely, systems in which the period—or any internal step—does not sum to an integer multiple of the octave yield infinite chroma sets.

Consider, for example, the Bohlen-Pierce tuning system, specifically its equal-tempered form known as 13-ED3 (13 equal divisions of the tritave). While this system is often described as having "13 classes," the presence of infinite chromas introduces complexities. On a standard six-string guitar tuned in 13-ED3 (with each string tuned to the 4th fret of the preceding string), 28 unique notes are generated across the fretboard (13 + 3 + 3 + 3 + 3 + 3). Each of these notes represents a distinct chroma relative to the open lowest string. Consequently, the guitarist encounters 28 unique chromas, significantly exceeding the 12 chromas of the conventional system and contradicting the initial assertion of 13 pitch classes.

In octave-based systems, collaboration among musicians is facilitated. Participants can perform within any register, matching pitch classes that share the same chroma (e.g., performing a C major chord across different octaves). In contrast, non-octave tunings present a significantly more challenging collaborative environment.

When participants perform within different "periods" of a non-octave tuning system, the established functional roles of harmony are disrupted. Unless all musicians possess mastery of the tuning's note content and intervallic relationships across all potential periods, coordinated performance becomes exceedingly difficult.

This highlights a significant limitation of non-octave tunings in collaborative musical contexts. While these tunings offer potential for unique sonic exploration, their applicability within shared, traditional musical practices is inherently constrained.

Color-Coded Octaves, Interval Matrices, and Chroma Analysis

The infinite, continuous, and cyclical nature of musical chroma reveals similarities with the visual spectrum, particularly when color-coding is introduced into musical notation for the visualization of pitch relationships, both in complex microtonal scenarios and in conventional 12-tone contexts.

Several perceptual questions then arise:

Does the visual spectrum provide sufficient distinct color categories to represent the nuanced distinctions between musical pitches?
Where are the perceptual boundaries between adjacent note/color pairings located?

‘the just confines of the colours are hard to be assigned,
because they pass into one another by insensible gradation’ (Newton).

When does one color end and another begin? This is strikingly similar to a fundamental question in music theory: when does one pitch function shift to the next? Functional music theory often resolves this by prioritizing contextual relationships over the precise, often ambiguous boundaries of intervals.

Various music notation systems are currently employed in conventional musical practice, including traditional Western staff notation, MIDI roll notation, and alternative systems incorporating color coding. Many of these systems utilize color to represent octave equivalence and pitch classes, as exemplified by tools such as the "colored piano" and color-coded staff notations.

Modern software facilitates the implementation of such color-coded systems, even within standard 12-tone workflows. These visual aids not only assist in working with microtones and unconventional scales but also enhance learning and comprehension of standard musical concepts.

For instance, the demonstration of octave equivalence presented earlier (Video.01) employs a color-coded MIDI roll. Within this 12-tone context, 12 uniformly distributed colors from the sRGB hue wheel are assigned, with an arbitrary origin (in this case, red is assigned to the pitch class C). This visualization enables rapid identification of the constituent notes within a chord. In one example, a nine-note chord is presented; however, a brief visual inspection reveals only three distinct colors, indicating three pitch classes corresponding to a major chord. Without color coding, individual note class analysis or intervallic calculation would be required—a significantly more time-consuming process. This system provides an efficient visual representation, irrespective of any theoretical connections between musical notes and the color spectrum.

The Spiral Harp: A Case for Color Coding

Color coding becomes particularly advantageous in more complex musical contexts, such as the Spiral Harp. This virtual musical instrument generates pitches by interpreting the lengths of spiral polygonal chain segments as string lengths. The instrument supports a wide range of configurations, enabling the creation of complex microtonal setups and interactive performance within intricate, web-like structures.

The Spiral Harp is designed to facilitate free exploration rather than impose a rigid theoretical or methodological framework. However, understanding relationships between notes significantly enhances learning and navigation. Traditional labeling of each string proves impractical due to the instrument's capacity to generate over 1,000 distinct pitches within the audible range. Furthermore, given the infinite number of possible configurations, enumeration or calculation of all string ratio relationships becomes both infeasible and of limited utility.

Color coding offers a practical solution. By assigning an arbitrary origin and denoting octave equivalence with consistent color assignments, performers can readily identify strings belonging to the same chroma or octave class. Strings of varying lengths sharing the same color will produce consonant sonorities, as they belong to the same octave class.

Within this software implementation, the sRGB hue wheel is utilized for color coding. This facilitates the recognition of octave equivalence and also reveals additional intervallic relationships. For instance, the RGB complementary colors (red-cyan, green-magenta, and blue-yellow) correspond to tritone relationships. This correspondence echoes parallels observed in art and music: both tritones and complementary colors are frequently associated with tension or dissonance.

In music theory, the tritone is defined as the geometric mean of the octave, represented by the square root of 2. Unlike intervals such as perfect fifths and fourths, which exhibit inverse mirroring, the tritone possesses symmetry—remaining invariant under inversion—reinforcing its ambiguous, achromatic quality.

Interval and Chroma Matrices

In music theory, interval matrices serve as analytical tools for tuning systems and instruments. While some systems exhibit octave-based periodicity, others employ alternative periods or lack periodicity altogether, potentially generating an infinite number of chromas. Comprehensive understanding of such systems necessitates the calculation of pitches beyond the minimal generating set, examining the resulting scale extensions and emergent musical possibilities.

For the visualization and analysis of these intervallic relationships, a chroma matrix can be constructed. This matrix constitutes an extended interval matrix with the octave as a fixed period.

Color coding can enhance both interval and chroma matrices, with color assignments based on octave equivalence and an arbitrary reference point.

In a tuning system whose interval or chroma matrix displays only one color, the system is comprised solely of octave duplications. Conversely, a non-octave tuning system yields a chroma matrix with a growing number of colors as pitches are added, demonstrating its infinite chroma nature.

Interval Matrix (Definition): An interval matrix is a tabular representation of the intervals between all pairs of pitches within a given tuning system or scale. It proves particularly useful for analyzing non-equal temperaments or scales characterized by non-uniform intervallic distances between scale degrees. In equal temperaments, the interval matrix exhibits redundant patterns, rendering it less informative. A simple example is the diatonic scale, whose interval matrix reveals the characteristic intervallic structures of its various modes (e.g., Ionian, Dorian, Phrygian).

(Image.02) sRGB color-coded interval matrix of the 3-limit diatonic scale (group presentation: <2, 3 | 3⁷≡1>). Each row represents a cyclic permutation of the scale. The matrix is displayed logarithmically, with a 12-tone equal temperament (12-EDO) ruler for reference.

Visualization Example

[Video.02]

This animation demonstrates the construction of a chroma matrix using specialized software. The demonstration comprises two examples:

Calculation of the 12-tone system, illustrating how the addition of pitches beyond its period does not introduce new chromas, resulting in a repeating pattern.
Analysis of a non-octave tuning, demonstrating how the addition of pitches reveals new chromas.

For further information on interval matrices and access to software tools for their creation, refer to [link].

Analogous to the limited utility of interval matrices in analyzing equal divisions of the octave (where all permutations yield the same relative set of pitches), chroma matrices also offer limited analytical value for octave-based tunings, even those with unequal divisions. In such systems, the addition of pitches beyond the octave period does not generate new chromas.

Chroma matrices find their primary application in the analysis of non-octave tunings. For example, an interval matrix of the 13-ED3 system (equally dividing the tritave) exhibits identical rows. Given that its period of repetition (the tritave) and its equivalence class are arbitrarily defined, and considering the system's equal division, any local interval matrix provides limited information. Specifically, employing any pitch as the equivalence class results in the same local intervallic relationships, which do not capture the global structure of the system. In this context, chroma analysis provides the most informative approach, in interval matrix terms the chroma matrix is an "external interval matrix", folding the set with an external element, in this case the octave.

Chromas and Color Perception:

Despite the disparate physical nature of sound and light, and the distinct sensory organs responsible for their processing, both phenomena share fundamental characteristics, including their wave-like and continuous-cyclic nature, as well as several related perceptual phenomena.

This analysis posits that the connection between chroma in music and color perception transcends mere associative analogy. A deeper, non-arbitrary relationship, rooted in shared perceptual principles and a common mathematical structure, is proposed, extending beyond simple visualizations.

Color perception research suffers from substantial ambiguities that are not just worth addressing but critical to any meaningful analysis. Many conventions are thus necessarily challenged.

"Unique hue," "non-spectral colors," and "new colors" all require revision.

Across numerous studies, ambiguity in both terminology and methodology complicates interpretation. While current color theory is sufficient for practical implementations and understanding, the very nature of color itself remains unclear. Not just from a philosophical standpoint, but its physical, biological, and neurological basis deserves revision. Notably, the simplistic interpretation of color qualia tied directly to cone activation, which effectively renders photoreceptors the arbiters of color awareness even before a fully formed percept is achieved, ignores crucial post-receptoral stages. This study focuses particularly on binocular chromatic fusion and its interaction with other phenomena, which directly demonstrates that color dimension is not a simple cone addition rule. The idea of three dimensions tied to three cones is overly convenient; color is continuous, and there are far more than "three colors."

Before continuing this extensive interdisciplinary analysis, a few definitions need to be adjusted for precision, aiming to minimize ambiguity and misinterpretation.

The Flawed Concept of "Unique Hues"

The traditional concept of "unique hues" is fundamentally flawed. The subjective definition as a hue "without a tint of another" is scientifically weak. It lacks a singular, universally accepted definition, mirroring the ambiguity surrounding "primary colors." While "unique hue" is often defined as a color without admixture of another hue, this definition is subject to further scrutiny. For example, green is not "unique" or "primary" in the sense of being irreducible, as it can be obtained through color mixing, like any other point in the color continuum.

A study that relies on the assertion that "red can't be described any other way" while simultaneously acknowledging that "red can be obtained with magenta and yellow" presents a clear contradiction. This highlights the need for more robust and objective definitions.

Throughout this text, the term "unique hues" will be used either when discussing research that used that term, or when ambiguity is not present.

Introducing Color Attractors and Co-unique Hues

To differentiate from "unique hues," we must introduce the concepts of "color attractors" and "co-unique hues," to then better define spectral, non-spectral and complex colors and hues. Later in this study, a more fundamental concept of "co-unique hues" is defined for developing a logical color space.
These concepts offer a more rigorous and objective framework for understanding color perception.

Color Attractors: Anchors for Language and Perception

"Unique hues" are often conflated with "primary colors" due to their prominence in language and culture. The traditional unique hues (red, yellow, green, blue in English) are simply discrete names that have gained currency through evolutionary and cultural pressures. We have a name for the sensation of red because it's important for survival (blood, ripe fruit); blue might originate from the sky, water important for navigation, and green from vegetation-food source. The ecological significance of these colors is well-established.

The mistake of equating attractors with "unique hues" arises from confusing the external influence with an internal, irreducible perceptual quality. We've taken the commonness and salience of certain colors in our environment and mistakenly interpreted them as fundamental building blocks of color perception.

However, it's crucial to distinguish between linguistic salience and fundamental perceptual importance. Consider this thought experiment: if our blood were a reddish-orange, would we have a discrete name for red, or would we describe it as a compound sensation? If vegetation were a greenish-blue, would we have a distinct category for green? Where, then, do these discrete sensations originate?

Unique hues are often confounded with primary colors, mistakenly considered more important perceptually. There is no conclusive evidence that these traditional unique hues hold any special status in brain processing; any perceived difference arises primarily in memory, language and environment.

Therefore, "color attractor" is a more appropriate term. Color attractors are the necessary sensations in our environment that serve as anchors for language and perception. "Primary colors" can then be used to describe the colors used by a given system (e.g., the primaries of a printing process, the RGB primaries), but they aren't fundamental in any deeper perceptual sense. The term "unique hue," with its inherent ambiguity, should be retired.

Beyond Hue: Brown and Non-Spectral Colors

The idea of a color attractor extends beyond the hue dimension. Colors like brown, which have discrete names and widespread use, can be represented as a region within the red-orange-yellow part of color space. Brown, commonly found in nature, is a complex color that involves more than just hue.

Brown is often incorrectly classified as a non-spectral color.

However, there is only one true non-spectral hue: magenta. We might more accurately call it a non-spectral hue.

Magenta is the only color sensation that cannot be elicited by a single wavelength of light. It's a combination of red and violet/purple, the two extremes of the visible spectrum. All other bichromatic mixtures within the visible spectrum result in either a spectral color, an achromatic gray, or magenta. (magenta is a consequence of the geometry of the neural hue wheel).

Thus brown, and other traditional colors like gold or copper, are non-spectral but it is not convenient to define them that way, as they involve more than the hue and lightness dimension and their sensation and distinction accounts for complex material properties.

Magenta, or "fuchsia" as it's known in some regions, highlights the ambiguity of color naming. The same hue might be called different names in different places. Even within a single language, there can be variations. In Spanish (specifically Argentinian Spanish), there are multiple terms for what English speakers might broadly call "blue" (e.g., azul, celeste), none of which directly correspond to cyan or RGB blue. RGB blue might be described as "dark blue" (azul oscuro) or "navy blue" (azul marino). So, even the seemingly straightforward "unique hue" of blue is subject to significant variation and interpretation.

The definition of co-unique hues will be further developed later in the study; consider them simply as the complementary colors, those that cancel each other out, mixes that achromatize.

In summary, there are no truly "unique hues" in the traditional, subjective sense. Instead, we have co-unique hues, defined by their relationships within the color space, and color attractors, which are perceptually salient and linguistically encoded color categories that serve as anchors for our experience and communication about color.

Throughout this text, when referring to particular hue attractors when discussing specific visual effects and image examples, the names: red, orange, yellow, green, cyan, blue, violet and magenta, will refer to specific RGB values unless specified.

------

The analysis reveals a consistent mathematical structure explaining spectral locations for "color attractors" (formerly "unique hues") based on empirical data from individual observers. This structure predicts the positions and relationships between complementary colors through a simple wavelength ratio, coinciding with observations of color constancy, stereo vision color mixing, afterimage hues and their complex interactions.

The analysis initially focuses on the proportional arrangement of color attractors within the visible spectrum, utilizing data from studies on trichromats' and tetrachromats' color wavelength matching. The analytical techniques employed for musical chroma and interval ratio analysis are applied to this data, resulting in the construction of musical scales that exhibit consistent results and extend the observed correspondences. This approach ultimately reveals a common color structure across individuals.

Subsequently, several parallels between vision and hearing are analyzed. Some parallels highlight direct analogies between chroma and complementary colors, while others suggest similar perceptual effects arising from distinct mechanisms.

Finally, a logical model of color, derived from observations of complementary colors, is introduced, aligning with established modern opponent process theory.

Demonstration: The Blue-Yellow "Non-Complementarity"

The emergent color mapping, as detailed subsequently, reveals complementary pairs reminiscent of canonical subtractive color models such as RYB.

(Image.03) Stereo-vision color mixing demonstration.

The presented image juxtaposes two modified copies of a landscape photograph. The left image is predominantly rendered in yellow (255, 255, 0), with some regions incorporating orange (255, 127, 0). The right image is predominantly rendered in blue (0, 0, 255), with some regions incorporating violet (127, 0, 255). Stereoscopic merging of these images results in a green percept where yellow and blue overlap, contradicting the complementary relationship posited by additive color models such as RGB. Instead, the orange regions are neutralized by the blue, and the violet regions by the yellow, resulting in a landscape dominated by green grasslands against achromatic rocks, mountains, and clouds.

Vision Instructions: While optimal viewing is achieved with stereoscopic equipment such as a VR headset (or even a simple cardboard viewer and mobile device), viewing via the cross-eye technique on a computer monitor is also possible. The image should be displayed at a comfortable size and viewing distance, with the viewer's head held straight and horizontal. By slowly converging the eyes, a focal point where the images merge can be found. Initial attempts may require some time due to potential binocular rivalry. Once the images are fused, the eyes will relax, and the resulting "true" colors will be perceived.

-----

The Spectral Octave

A striking initial parallel between sound and light lies in the scale of human perception. The visible spectrum spans a frequency range approximating one octave, defined as the interval (a, 2a], where a ∈ ℝ⁺. Observed frequencies range from approximately 400 THz to 800 THz, corresponding to wavelengths ranging from approximately 750 nm to 375 nm (a 1:2 ratio). Given the linear relationship between energy and frequency (E ∝ f), photon energy also doubles across the visible spectrum. This electromagnetic range is designated the "spectral octave," mirroring the musical octave and suggesting a shared organizational principle based on frequency ratios. Both the continuum of chromas and hues exist within a frequency range corresponding to a doubling in frequency.

Crucially, both sound and light perception exhibit logarithmic characteristics. This logarithmic nature is reflected in several aspects of light perception. The relationship between physical light intensity and perceived brightness is logarithmic, a well-established finding in psychophysics often described by power laws closely related to logarithms. The visual system's adaptation to changes in ambient light color also involves logarithmic processes. At the neural level, logarithmic transformations are common in sensory processing, with neurons often exhibiting a logarithmic or compressive response to stimuli, enabling the encoding of a wide range of input intensities.

Just-Noticeable-Difference of Chroma and Hue:

The just-noticeable difference (JND) for hue, defined as the smallest perceptible change in color, exhibits non-uniformity across the visible spectrum. Empirical studies have demonstrated that the JND is smaller in the blue region (approximately 2 nm at ~400 nm) and larger in the red region (approximately 6 nm at ~700 nm), exhibiting an increase with increasing wavelength. This non-linear distribution suggests a logarithmic relationship between wavelength and perceived hue. Similarly, the distribution of color attractors also presents a non-linear pattern. For instance, the spectral range occupied by violet, blue, and cyan is roughly equivalent to the entire red band, further indicating a logarithmic compression at higher frequencies (shorter wavelengths).

Considering the entire visible spectrum, the average JND approximates 1–2%, a threshold comparable to that of pitch, which is approximately 10–20 cents (equivalent to 1–2% of the musical octave). While these values are subject to variation depending on factors such as timbre, loudness, brightness, and saturation, this parallel suggests the possibility of shared perceptual processes underlying the perception of small changes in both sound and light.

(Image.04) Six pairs of colors are presented, each pair representing a distinct segment of the visible spectrum: red, yellow, green, cyan, blue, and magenta. The hue difference within each pair is set at 6 units on a 360-unit scale. Under the hypothetical assumption of displays emitting monochromatic light for each wavelength, a 6-unit step on a 360-unit scale corresponds to approximately 1–2% of the spectral octave (assuming a simplified visible range of 375 nm to 750 nm).

-----

Spectral Analysis of Color Attractor Locations: Unveiling Hidden Relationships

The following analysis of color attractor locations is based on data from existing studies on "unique hues" and color bands. While these studies offer valuable insights, they are not without methodological ambiguities and potential misinterpretations, discussed in detail in Appendix A. Despite these limitations, the data employed here exhibits sufficient internal consistency and falls within expected statistical variation, justifying its use for the present analysis.

It is important to acknowledge that the concept of relating color to wavelength proportions has been largely dismissed by many color science researchers. Numerous studies have reported only limited success in establishing meaningful correspondences, with some even asserting the independence of "unique hue" locations from spectral properties. This skepticism, however, often stems from a common methodological flaw: the use of aggregated and averaged data from multiple experiments with varying methodologies. Averaging across diverse datasets effectively obscures and destroys the very patterns this analysis seeks to reveal. Indeed, even within controlled experiments, significant individual variation in "unique hue" locations is widely acknowledged, with one individual's "best" red potentially overlapping with another's "best" orange.

The data used in this analysis, while from a relatively small sample (15 trichromats and 23 tetrachromats), offers a unique advantage. Participants were asked to mark the spectral locations of color attractors (red, orange, yellow, green, cyan, blue, and violet) without any visual stimuli. This approach directly reveals learned perceptual associations and preferences, minimizing the influence of immediate perceptual judgments. While individual responses varied, a clear pattern emerges when analyzed individually. To validate these findings, which are visually apparent in Figure [img01], several statistical techniques were employed. Crucially, this spectral representation, more closely aligned with the neural hue wheel concept, successfully predicts afterimage phenomena, color constancy effects, and stereoscopic mixing outcomes. This convergence of evidence suggests a deeper, underlying relationship between color perception and spectral properties, despite the challenges faced by previous averaging-based approaches.

Complementary Color Relationships Based on a Consistent Ratio

Musical analysis of color wavelengths, specifically by constructing musical scales from individual sets of color attractor wavelengths, has revealed a consistent relationship between proposed complementary color pairs: red-cyan, orange-blue, yellow-violet, and green-magenta. This relationship is characterized by a consistent ratio, deviating from the conventional RGB model of complementarity. This ratio predicts the pairings of inducer hues and afterimages, explains color mixing in stereoscopic vision, and accounts for hues produced by color constancy.

Logarithmic Representation Reveals Underlying Symmetry

Traditional linear representations of wavelength-hue data, often presented in tables within arbitrary horizontal ranges (Image.05), obscure these inherent relationships. While linear formats can show inter-subject alignment, they fail to highlight the internal symmetries present within individual datasets. Similarly, circular representations using standard 400–700 nm ranges and assigning complementary wavelengths based on various models can also obscure these relationships, especially when employing non-linear step sizes for perceptual uniformity.

(Image.05) Color Attractor ("Unique-Hues") 380-780nm, linear scale.

To overcome these limitations, a logarithmic scale within a spectral octave range (a, 2a) was employed to visualize individual color attractor data during scale construction. This approach unveiled a clear symmetrical pattern in color perception, approaching near-perfect symmetry in some individuals. Crucially, the symmetry point in this logarithmic, octave-based arrangement corresponds to the square root of 2. This consistent ratio, approximately √2, is observed across all complementary pairs. For example, if an individual's "orange" is located at wavelength x, their "blue" is predicted to be at approximately x × 1/√2. This ratio also precisely positions magenta opposite green, the central hue of the spectrum. While other color relationships might show some predictability, the repeated occurrence of the √2 ratio within each individual's data strongly supports the robustness of this specific relationship.

While some physiometric tests may extend the visible range beyond a doubling of wavelength (e.g., 380–820 nm), such instances are rare, and these extensions do not involve the perception of novel hues beyond red or violet. This extra range, and similarly, shorter ranges of vision, while still encompassing the full spectrum, exhibit the same flexibility observed in the musical octave, as discussed previously. These variations in range do not substantially affect the proportions of internal components, analogous to the stretched diatonic scales (Audio.01/02).

The selected range for the spectral octave here is 375–750 nm. These specific values serve both as commonly used ranges and for facilitating graphic representation and ruler markings with predictable subdivision increments (+5 nm).

(Image.06) Color attractor locations (red, orange, yellow, green, cyan, blue, violet; magenta is artificially mirrored across green) for trichromats (left) and tetrachromats (right), plotted on a logarithmic scale within the spectral octave of 375–750 nm. (Illustrative RGB values).

While consistency is observed in certain cases, individual variation is also present. Some subjects exhibit deviations from others, even lacking a defined color attractor for cyan, for example. These variations reflect not only potential differences in the semantic interpretation of color names arising from personal preferences or cultural influences but also patterns in the distribution of these "deviations."

The observed ambiguity in the violet/blue region, where overlap in color attractor positions occurs between individuals (up to a clearly defined red region), is characteristic of linearly sampling a logarithmic phenomenon. This is analogous to the distortions that arise in musical interval tempering when the logarithmic nature of pitch is not considered.

Consistent Internal Ratios Despite Individual Variation

This color structure reveals a crucial insight into the nature of individual differences in color perception. While perceptual thresholds (JNDs), subjective experiences (qualia), and learned color associations can vary between individuals, the underlying relationships between color categories remain consistent. This consistency underpins the predictive power of this spectral representation. Color perception, therefore, appears to be based not on absolute wavelength values, but on consistent internal relationships between color categories. The perception of "yellow," for example, is intrinsically linked to the perception of "violet," forming a coherent internal color system with defined relationships between complementary pairs. This framework naturally accommodates individual differences in cone sensitivities, lens pigmentation, and neural processing.

Symmetry Around Green and Mathematical Expressions

(Image.07) Individual Color Attractors Symmetries - The image presents the original tabular visualization of color attractor data for each observer. Adjacent to this table, data from selected subjects are represented within a logarithmic octave wheel visualization. This visualization reveals that the near-perfect symmetrical patterns observed in some individuals are not readily apparent in the original tabular format.

Analysis of color attractor data reveals a symmetry in the color spectrum’s organization, with green acting as a central point of reflection. This is visually evident in logarithmic octave wheel visualizations (Image.07), which are not apparent in tabular formats. This symmetry manifests as mirrored wavelength ratios on either side of green. Specifically:

(Cyan / Green) ≈ (Green / Yellow)

(Blue / Green) ≈ (Green / Orange)

(Violet / Green) ≈ (Green / Red)

Furthermore, the ratio between complementary pairs consistently approximates √2:

(Red / Cyan) ≈ (Orange / Blue) ≈ (Yellow / Violet) ≈ √2

These observations can be mathematically expressed as two key symmetries:

Symmetry around green: \(W_G / W_X = W_{X'} / W_G\), where \(W_X\) and \(W_{X'}\) are wavelengths of colors equidistant from green.

Complementary color ratio: \(W_X / W_{X'} ≈ \sqrt{2}\), where \(X\) and \(X'\) are complementary colors.

Both the symmetry and ratio can be expressed as: \(W_G^2 = W_X \times W_{X'}\)

Despite this cross-phenomenon consistency, these findings are preliminary. Future research should employ larger and more diverse participant samples and rigorous quantitative methods across all investigated phenomena to fully validate the approach, explore individual differences, and understand contextual influences.

Millioctaves for Color Measurement: For this analysis, frequency ratios were used for color values. However, for future research and standardization, millioctaves (mocts) are recommended over cents for color measurement. Millioctaves, dividing the octave into 1000 equal logarithmic units (with complementary colors separated by 500 mocts), offer consistent decimal scaling and map more intuitively to the fractional part of the base-2 logarithm (the musical chroma). This unit simplifies calculations, obviating wavelength/frequency inversions and direct use of √2.

Statistical Analysis (Concise Summary)

Objective: To analyze the symmetries and relationships within color attractor locations, comparing trichromats and tetrachromats.

Descriptive Statistics: Mean and standard deviation calculations support the hypotheses of symmetry around green and consistent individual ratios for complementary pairs (approximating √2) in both groups.

Correlation Analysis: Trichromats exhibit high inter-hue correlations, particularly with neighboring hues (+0.8 or greater), indicating strong mutual influence. Tetrachromats show a similar pattern except for green, which exhibits significantly lower correlations with other hues.

Principal Component Analysis: No single hue explains a majority of the variance in either group. At least three principal components are required to explain over 90% of the variance in both groups. However, the dominant principal components differ: cyan for trichromats (explaining ~60% of variance) and yellow for tetrachromats (explaining ~60% of variance).

Regression Analysis: Inconclusive due to extreme multicollinearity, reflecting the visual observation that the color attractor structure generally moves as a whole.

Upcoming Analysis

Prior to examining the predictive capacity of identified complementary pairs for afterimage hues and other phenomena, a forthcoming analysis will explore average values for each group, revealing potentially richer musical analogs beyond simple tritones (√2 ratios) in color relationships.

Color Attractor Spectral Location and Wavelength-Derived Musical Scales

Historically, attempts have been made to establish connections between the musical and visual domains. Isaac Newton famously associated the colors of the rainbow with musical notes. Despite the prevalence of equal temperaments, such as the 12-tone system, during his era, Newton's pitch calculations were rooted in Pythagorean metaphysics and rational harmony. However, the challenge of consistently aligning scales, intervals, and light wavelengths with musical octaves prevented the development of a definitive model.

This part of the study adopts a reverse approach, constructing musical scales based on the spectral locations of color attractors rather than imposing existing musical structures onto the light spectrum. These hues, identified as "best exemplars" in color science literature, exhibit notable consistency across studies. The derivation of scales from these data points reveals remarkably stable musical structures, distinct from the rational intervals sought by Newton, yet no less compelling.

This section presents short musical examples based on tuning systems derived from the wavelengths of color attractors reported in color science literature.

It is crucial to note that wavelengths, measured in nanometers, are part of a human-defined measurement system. The scales presented here are not constructed by directly mapping nanometers to frequencies (Hz). Instead, they are based on the proportional relationships between color attractors, abstracting away from specific unit systems.

For the creation of these musical scales, wavelengths are considered proportionally relative to a base color and adapted for practical implementation on specific instruments. For example, a synthesizer may map a central tone to 261 Hz (middle C), with subsequent scale values expressed as frequency multiples to establish a periodic system. Within this framework, the perceptual spectrum functions as a torsor, where relative relationships are of primary importance.

Torsor (in the context of color): A torsor describes a set lacking a distinguished origin or zero point, yet possessing a well-defined notion of relative position or displacement. In the context of color, the set of all possible hues constitutes a torsor. The difference between two hues can be defined (e.g., "this hue is 30 degrees clockwise from that hue"), but there is no absolute "zero hue." In this context, the hues form a torsor relative to the scales (nm, Hz, cents, mocts, etc.), meaning that the relationships between hues are preserved regardless of the measurement units employed.

Mathematical Process Summary:

The concept of a torsor within the context of hues and the spectral octave can be illustrated through an example.

While color science typically employs wavelength measurements (nm) within the electromagnetic spectrum, music utilizes audio frequencies (Hz). These quantities are inversely related. Analogous to musical frequency ratio calculation from string lengths (or wavelengths), where the specific frequency value is less important than the ratio itself (assuming constant string tension), the precise terahertz values or photon energy are not directly employed here. Wavelength units (nm) are sufficient for determining proportional frequencies, calculated as inverses of the wavelengths. For example, the frequency ratio from red (700 nm) to cyan (495 nm) is calculated as follows:

Red (base): 700/700 = 1

Cyan frequency ratio: 1 × (700/495) ≈ 1.414

In the generated scales, ratios are calculated relative to red. However, given the cyclical nature of the system, the choice of base color is arbitrary; the proportional intervals remain invariant regardless of which color is chosen as the root or unison. This invariance exemplifies the torsor nature of hues.

The position, wavelength, and corresponding musical note assigned to magenta are derived from the observed complementary relationships. Specifically, the frequency ratio assigned to magenta is the frequency ratio of green multiplied by √2. This methodology accounts for individual variations in the spectral octave range (e.g., 370–740 nm, 405–810 nm), which are dependent on the location of the green attractor. While the graphics presented here utilize a constant 375–750 nm range for illustrative purposes, this choice reflects the torsor nature of hues.

Examples of Unique Hue-Based Scales:

Modern Trichromat Research: This scale utilizes median unique hue data from contemporary color vision studies on normal trichromats.
Tetrachromat Data: This scale is derived from studies on individuals with genetic predispositions to a fourth photopigment.

Auditory Examples:

The following auditory examples demonstrate the translation of unique hues into musical scales, revealing perceptual and structural parallels between light and sound.

Musical Properties of Hue-Derived Scales and the Role of Uniform Distribution

If strikingly unusual or exotic microtonal sonorities are anticipated from these hue-derived scales, their relative conventionality may be surprising. While subtle microtonal inflections may be perceptible to trained listeners, the overall impression is often surprisingly consonant with established musical practice. As previously mentioned, not only the tritone is frequently approximated by frequency ratios derived from hue data, but also other stable musical intervals, such as the major third and perfect fifth, emerge from various color combinations. The resulting scales exhibit major and minor chords, and each scale features varying degrees of consonance with other traditional intervallic relationships, corresponding to intervals such as sixths and sevenths. However, a single diatonic scale is not derived from a single root; multiple intervals are present, but their non-uniform distribution prevents direct transposition of chords derived from one color to another. The fact that these scales exhibit musical usability with common timbres, as demonstrated by the piano example in Audio:Trichromats01, is notable.

This observation raises the question of whether this musical usability is merely coincidental. To address this, the implications of randomness in tuning systems are considered. Prior research ([link]) explored the musical properties of randomly generated tunings, examining various interpretations of randomness, order, and predictability. A key finding was that uniform distribution of pitches within the octave space—even with some allowance for clustering—facilitates conventional musical usage, including tonicization and consonance on standard instruments. This arises from the inherent tendency of random subdivisions of the octave to approximate low-integer rational values, regardless of timbre (within certain tolerances). Constructing a scale with ten unusable pitches proves more challenging than constructing a usable one.

"Octave space" is defined here as any pitch range of the form (a, 2a], where a ∈ ℝ⁺. Uniformity of pitch distribution is considered within this space, ensuring that any octave-equivalent range within the audible spectrum contains a reasonable density of pitches (approximately 5 to 20). This definition excludes trivial cases such as uniformly distributed pitches concentrated within a narrow frequency range or sparsely distributed across the audible spectrum without regard for octave equivalence.

This conclusion is further supported by analysis of the Scala archive, a database of over 5,000 world tunings. Interval matrix analysis revealed that approximately 80% of the database exhibits congruence, indicating that many scales share the same intervallic content but with different starting points (modal transpositions/cyclic permutations, thus exhibiting the torsor property). Furthermore, randomly generated numbers, even from pseudo-random number generators (PRNGs), often approximate existing scales within a tolerance of approximately 5 cents. This suggests that tunings resembling established, structurally organized systems can emerge from seemingly random values. This observation led to the development of an "Average Tuning System," a 14-note system capable of approximating at least five notes from any of the 5,000 tunings in the archive within a 10-cent tolerance.

As demonstrated in the aforementioned study, music created with numbers derived from diverse sources, including planetary sizes, temperatures, mountain heights, and subatomic particle energies, consistently exhibits musical usability due to the emergence of stable, familiar intervallic relationships. This reinforces the principle that uniform distribution within the octave is a primary factor in creating musically usable scales.

Therefore, the relative conventionality of the hue-derived scales is not entirely unexpected. The color attractors themselves are well-distributed across the "color octave," naturally facilitating traditional tonal and modal usage.

However, this statistical predictability does not diminish the significance of these findings. While the musical usability of these scales may be statistically probable, their origin in physical reality and human perception imbues them with additional meaning. These are not merely arbitrary numerical values; they are rooted in the fundamental properties of light and its perception.

If the visible spectrum spanned a significantly different range—either much smaller (e.g., 400–430 nm) or spanning multiple "spectral octaves" (e.g., 400–3500 nm)—the relationship between color and chroma would become less compelling. The fact that colors exist within a single spectral octave strengthens the perceptual analogy.

This limited range also addresses the question of whether sufficient color distinctions exist to represent functional harmonies. The answer is affirmative. The fine distinctions made in color perception are analogous to the subtle distinctions made in musical intervals. Just as musicians may debate whether an interval is a "super major second" or a "sub minor third," distinctions are made between colors such as "yellowish orange" and "orangish yellow." This shared phenomenon highlights the fine granularity of both auditory and visual perception.

Color Wheel Construction: Addressing Color Space Transformations and Limitations

The construction of the color wheels presented in this analysis requires careful consideration of color space transformations and the inherent limitations of representing the visible spectrum within the RGB color space. Converting a specific wavelength to RGB values involves several factors that can influence the final color representation:

CIE XYZ Model Version: Different versions of the CIE XYZ color space (e.g., 1931, 1964, 2012) have slightly different color matching functions, leading to variations in the resulting XYZ coordinates for a given wavelength.
Illuminant: The choice of standard illuminant (e.g., D65, A, C) affects the white point of the color space and, consequently, the mapping of wavelengths to XYZ coordinates.
Gamma Correction: Gamma correction is a non-linear transformation applied to RGB values to account for the non-linear response of display devices. Different gamma values will result in different RGB representations for the same XYZ coordinates.

Consequently, obtaining a specific RGB value like (0, 255, 255) for cyan from a wavelength requires careful selection of the CIE XYZ model, illuminant, and gamma. Furthermore, achieving fully saturated RGB values for all spectral hues is often impossible. If a median render of the spectrum with equal power distribution is used, for example, the perceived saturation of red tends to decrease at longer wavelengths, making it difficult to accurately represent individual "best red" values at wavelengths like 710 nm.

It is crucial to emphasize that the color wheels presented here are primarily concerned with the hue/chroma dimension of color, not with precise representations of luminance or gamma. The goal is to accurately represent the relative positions of hues within the spectrum and their complementary relationships, rather than to create a photometrically accurate rendering of the spectrum.

Therefore the final color attractor representations in the wheel are ultimately based on standard RGB values, chosen to represent the perceived hue as accurately as possible within the limitations of the RGB color space. The choice of RGB values for the attractors is done with a focus on maximizing saturation and perceptual distinctiveness, with the understanding that this might not perfectly align with a strict radiometric conversion. (a pseudo-color)

This approach acknowledges the inherent limitations of representing the full spectrum in RGB screens while prioritizing the accurate representation of hue relationships, which are central to the analysis presented.

Ultimately, it is important to note that this color wheel, constructed through a combination of standard color matching functions (CIE XYZ 2006) and individual perceptual data (color attractors and bands), possesses significant predictive power. (Stereo color mixing, color constancy-afterimages)

_______

Prediction Mechanism and Perceptual Parallels

Beyond the general correspondence implied by the "spectral octave," several other parallels exist between the auditory and visual sensory domains. These parallels manifest in two distinct ways: (1) through shared complementary structures, as revealed by the octave-based color model; and (2) through shared descriptive characteristics or related conceptual frameworks.

For example, within the context of sensory conflict, binocular rivalry is often compared to the phenomenon of binaural beats. While this comparison highlights similarities in sensory conflict resolution, it does not typically propose a shared model based on chroma or color. Binocular rivalry is analyzed in the context of stereoscopic color vision, where the octave and chroma models are essential for predicting color mixing outcomes.

The auditory analog of afterimages, the "aftersound" effect, is briefly mentioned as a further example of sensory adaptation. While not directly related through chroma or octave models, the afterimage phenomenon is analyzed to demonstrate its adherence to the same complementary pairs predicted by the octave chroma model.

Color constancy is also addressed, drawing a parallel with the auditory phenomenon of tonal constancy. These phenomena are not directly linked by the octave model but share similar resolution mechanisms for interpreting "neutral" or ambiguous sensory information to maintain a coherent perceptual experience, particularly in the context of musical scales and color perception. Subsequently, color constancy is analyzed using the octave model, demonstrating that the same complementary color pairs emerge when the brain interprets a physically achromatic object under colored illumination. The perceived hue of the object under colored light corresponds to the complementary of the illuminating light's hue.

Image Presentation and Viewing Instructions

The images presented in this study serve as carefully constructed visual demonstrations of various color perception phenomena. Due to the precision required for accurate depiction of these effects, image compression, re-encoding, or resizing can introduce significant noise and artifacts that compromise the integrity of the demonstrations. Standard image formats, such as JPEG, even at high quality settings, employ compression algorithms that introduce RGB artifacts and unwanted perceptual filtering. These artifacts are particularly problematic for demonstrations requiring precise control of color manipulation, such as those illustrating color constancy.

For optimal viewing and accurate reproduction of the intended effects, it is strongly recommended to access the original, uncompressed image files. These are provided in PNG (16-bit) format for most examples, or as uncompressed bitmaps. While the thumbnail images included in the article are functional for illustrative purposes, they have been resampled and interpolated, potentially introducing subtle but significant alterations that may affect the perception of the intended effects. This is especially critical for the color constancy demonstrations, where the objective neutrality of gray areas is paramount.

For instance, in the color constancy demonstrations focusing on complementary color pairs, the "constancied color" (the color perceived in a region objectively defined as grayscale RGB 127, 127, 127) can be altered by the color filtering inherent in JPEG compression. This can introduce unintended color casts into the nominally gray areas. While the general color constancy effect may still be discernible, the crucial demonstration of a color emerging solely from a region of objective gray due to contextual interpretation (scene, object, and light/material reflection) is compromised. A detailed explanation of this effect and its nuances is provided in the corresponding section.

The stereoscopic images presented in this study employ two techniques: true 3D rendering with depth information and dichoptic presentation of duplicated images to demonstrate the interaction of complementary colors with afterimages and color constancy. All stereoscopic images are designed for cross-eye viewing. While all images can be viewed using VR headsets with appropriate scaling and settings, certain effects and induced artifacts are best observed using the cross-eye technique. Specifically, the cross-eye method is essential for demonstrating the interaction of afterimages in stereoscopic vision, allowing for analytical observation of the residual images along the mixed percept. Similarly, the cross-eye technique is valuable for directly demonstrating the nuances of color conflict resolution and its interaction with other regions of the visual field. For cross-eye viewing, the left image is intended for the right eye, and vice versa.

All images adhere to standardized RGB values for the color attractors (see Table01 for specific values). Even with a perfectly calibrated display, minor individual variations in color perception are expected. These variations may arise from factors such as lens pigmentation, macular pigment density, cone sensitivity shifts, and individual perceptual and subjective differences. Consequently, some observers may perceive slight variations in the precise point of achromatic cancellation in stereoscopic mixing experiments, or subtle differences in afterimage and color constancy complementary mappings. Such individual differences are inherent in color perception research.

(Image.07) - Stereo vision demo. Orange/Yellow snake

Stereoscopic Color Mixing and the Integration of Binocular Information

The integration of binocular information in stereoscopic vision raises important questions about the role of trichromacy and opponent processing at various stages of visual processing, from the initial encoding in the retina and early visual pathways (retinal and post-retinal opponency) to the formation of a unified perceptual experience. The fact that color information undergoes at least one further transformation during stereoscopic processing before reaching conscious awareness suggests that opponent mechanisms may operate at later stages of visual processing. This also highlights the binary nature of color opponency in achieving achromatization (the perception of gray or white).

While opponent processing is well-established in the retina and early visual areas of the brain, the phenomenon of stereoscopic color mixing suggests the possibility of a further stage of opponent processing specifically dedicated to integrating color information from the two eyes. This proposed "final" opponent process could be responsible for the observed cancellation of complementary colors when presented to the two eyes in a stereoscopic configuration. This hypothesis aligns with known mechanisms involved in binocular rivalry and stereoscopic depth perception, both of which require the integration and resolution of potentially conflicting signals from the two eyes. Further research is necessary to fully elucidate the neural basis of this proposed "final" opponent process.

The propagation of color information in the brain, originating from discrete photoreceptors and culminating in continuous image perception, necessitates interpolation of the discrete signals. This interpolation, evident in the filling-in of the blind spot and the perceived continuity of peripheral vision despite decreasing resolution, represents a point where the limits of qualia become apparent, merging with a lack of conscious experience. This interpolation may occur concurrently with or prior to stereoscopic color mixing, which exhibits complementary relationships resembling subtractive models.

This suggests that color mixing occurs prior to the formation of unified qualia but interacts with other phenomena, such as color constancy, in complex and not entirely predictable ways, as will be discussed.

Binocular Rivalry and Resolution

Binocular rivalry occurs when two different images are presented to each eye. The brain is unable to fuse these disparate images into a single coherent percept, resulting in an alternating perception of the two images, with each image intermittently dominating conscious awareness.

This rivalry can be directly observed with colored stimuli. When viewing stereoscopically merged blue and yellow squares, for example, rivalry ensues. However, color alone does not fully account for this conflict. Introducing contextual cues, such as the outline of an object, facilitates fusion and resolves the rivalry. In the landscape example, where one image is tinted blue and the other yellow, stereoscopic viewing successfully merges the images, and the colors are no longer perceived as conflicting, but rather mix, exposing the non-complementary nature of blue and yellow, mixing into green. (The blue-yellow "problem" will be addressed later.)

Demonstrating the Influence of Luminance on Stereoscopic Color Mixing:

The following image pair (Image.01-Conflict and Image.02-Resolution) is designed to isolate the influence of luminance variations on stereoscopic color mixing, specifically addressing the non-complementary mixing of RGB yellow (255, 255, 0) and blue (0, 0, 255). Previous examples, containing more complex luminance information, demonstrated that these colors combine to produce green, rather than the expected achromatic (gray) percept observed with true complementary pairs. This deviation from expected achromatic mixing can be attributed, at least in part, to the influence of varying luminance cues present in those images.

Image-01-Conflict

To directly examine the interaction of saturated yellow and blue patches in the absence of confounding luminance variations, the subsequent image (Image.02-Resolution) is constructed with minimal luminance differences. Shadows and luminance variations are removed, leaving only subtle object edges to facilitate binocular fusion. This contrasts with the first image (Image.01-Conflict), which presents the same saturated yellow and blue patches without any object cues, in which rivalry is more likely to prevent color fusion.

Image-02-Resolution

Image-02b-Stereo Mix Result

Results and Interpretation:

In Image.01-Conflict, the absence of visual cues prevents (in most subjects) binocular fusion, resulting in binocular rivalry – the alternating perception of the yellow and blue patches. However, in Image.02-Resolution, the addition of minimal outline details enables binocular fusion. Critically, despite fusion, the combined percept is green, not gray. This confirms that RGB yellow and blue do not behave as true complementary colors in stereoscopic mixing. Unlike true complementary pairs, such as RGB red (255, 0, 0) and cyan (0, 255, 255), which do achromatize (mix to gray) in stereoscopic vision, yellow requires violet for achromatic mixing, and blue requires orange.

This observed deviation from expected complementary mixing demonstrates a discrepancy between the standard RGB color space and the brain's internalized, neurally represented color space. The brain's internal color space appears to be organized around a more uniform distribution of color attractors and canonical complementaries, which do not perfectly align with the RGB primaries. The use of near-saturated and spectrally extreme stimuli (yellow and blue) highlights this divergence. The results suggest that luminance information plays a significant role in how the brain resolves color conflicts in stereoscopic vision, and that the brain's internal representation of color may be more consistent with a model of truly complementary opponent processes.

(It's important to note that monocular rivalry also exists, where the alternating perception occurs even when only one eye is stimulated with two different images presented in rapid succession. This further emphasizes the brain's role in resolving sensory conflict, which isn't inherent of stereo inputs)

The following stereoscopic images are designed to demonstrate two key aspects: (1) binocular complementaries, defined as those opposed in the logarithmic wheel mapping; and (2) color mixtures exhibiting subtractive-like characteristics. For clarity, these demonstrations focus on pairs of color attractors.

Visual Processing Hierarchy in Stereoscopic Vision

To investigate the precedence, order, and interactions of various visual processes and effects, several stereoscopic images were designed to elucidate the conditions necessary for a unified perceptual experience and to isolate specific perceptual conflicts. The goal was to create stereoscopic image pairs that fuse naturally while introducing controlled conflicts in specific visual attributes. The analysis of these experiments suggests a hierarchical organization of visual processing, where depth information derived from binocular disparity exerts a dominant influence, often resolving conflicts arising from color and luminance information.These findings also enable the creation of images where perceptual conflicts can be induced in otherwise harmonious stereoscopic image pairs.

Summary of Observations:

1. Scene Influence on Color Mixing:

Image (a): Colored Background, Colored Ball: This image depicts a soccer ball positioned against a uniform background. The left eye's view has a blue (0, 0, 255) filter applied to the entire image, while the right eye's view has a yellow (255, 255, 0) filter. The soccer ball is rendered in orange (255, 127, 0) in the right eye's view. When these images are fused stereoscopically, the observer perceives a black and white (achromatic) soccer ball against a green background. The disparity information from the ball, combined with the luminance cues, facilitates stable binocular fusion. The disparate color information from the backgrounds is integrated through stereoscopic color mixing, resulting in the perception of green.
Image (b): Colored Ball, Monochromatic Background: This image uses the same yellow and blue filters applied only to the soccer ball in each eye's view (yellow for the left eye, blue for the right). The background is rendered as monochromatic gray in both views. When these images are fused stereoscopically, the blue-yellow conflict present in the ball is not resolved into green. Despite the ball being the primary focus of attention and providing depth cues, the consistent achromatic information from the background and the matching depth information facilitate binocular fusion. The color mixing "instruction" is likely interpreted as "deliver the color information to qualia as is," preventing the typical blue-yellow mixing seen in other contexts. Depth information continues to dominate the perceptual strategy. To further highlight the binocular color conflict, small blue and yellow patches are introduced in the respective images, positioned so as not to overlap with the ball (top-right). These patches are perceived as floating, distinct colored regions within the 3D scene, demonstrating clear binocular rivalry. In contrast to Image (a), where the same color information was integrated into a unified green percept, these patches remain distinct due to the lack of a global color mixing instruction.
Image (c): Color Inversions: This image pair explores the effects of inverting complementary color filters between the two eyes. The right eye's view features an orange background and a blue ball. The left eye's view inverts these filters, presenting a blue (0, 0, 255) background and an orange (255, 127, 0) ball. When these images are fused stereoscopically, the global color conflict created by the complementary backgrounds is resolved towards a near-achromatic (gray) percept, driven by the depth and luminance information. However, both the ball and the small colored patches (also using the same orange-blue color pair) exhibit pronounced binocular rivalry. This setup demonstrates that the color mixing strategy is determined globally. Despite using the same colors, the intended balance of the global mixing scheme influences the entire visual field. The right eye's view exerts a "push" towards blue, while the left eye's view exerts a "push" towards orange, resulting in the gray background. Crucially, this global influence extends to the local color information as well. The colors of the ball and patches are pushed along in the same direction as their respective backgrounds, amplifying their chromatic contrast and resulting in a more intense perceptual conflict. This amplification manifests as a "brighter" or more saturated rivalry. Close observation of the ball's edges reveals the conflicting colors "bleeding" into the nearby achromatic (gray) grass. This observation directly demonstrates that color interpolation is processed independently of the luminance channel, which retains sharp detail without interference from the color conflict.

Stereoscopic Color Mixing and the Subspace Mixing Strategy

The following images demonstrate a crucial aspect of stereoscopic color mixing: the concept of a subspace mixing strategy/instruction This refers to the phenomenon where, once the visual system identifies a region of hue interaction and determines a resolution for color conflict (using disparity or luminance information), it applies this resolution to the entire subspace. This occurs even if internal inconsistencies remain within the region and regardless of whether some colors are, in fact, identical. (This principle was previously illustrated in the soccer ball examples).

Critically, the hue shifts induced by stereoscopic mixing do not propagate outside each defined region.

Image.07 illustrates this principle. It uses red and cyan as stereoscopic complementaries, applied as filters to an image of a snake. Specific areas within the snake image are designed for additional color mixing demonstrations. When the image is successfully fused via cross-eye viewing, the snake is perceived primarily in grayscale with yellow/orange details. Simultaneously, the original red and cyan images are still visible at the periphery of the fused image. Importantly, the hue shifts necessary for achieving the achromatic (gray) state in the fused region do not extend beyond this region. The observer's perception of the surrounding environment (the room, the computer screen background) remains unaffected.

However, within the defined region of stereoscopic mixing, the mixing instruction does apply globally. Objects or details within the red or cyan filtered areas are "dragged" along by the forces that are uniting the parent colors into the achromatic state.

In this specific example, two additional color mixtures are observable:

Orange: Created by the combination of red areas in the red-filtered image and yellow details in the cyan-filtered image.
Yellow-Green: Created by the combination of orange areas in the red-filtered image and the cyan areas in the cyan-filtered image.

These resulting color mixtures are consistent with predictions based on the spectral octave and the principle of the subspace binocular mixing strategy. The sharp edges of the images within the computer screen window likely define the boundaries of the region to which the mixing instruction is applied. The brain may interpret this as viewing the scene through a window, effectively isolating the stereoscopic mixing effects to the defined area. This subspace is analyzed in depth later.

Out-of-Gamut Color Shifts in Stereoscopic Mixing

This image demonstrates how the subspace mixing strategy in stereoscopic vision can "drag" colors out of the standard color gamut. A stereoscopic image depicting a landscape with true depth information is employed. The right-eye image is filtered with yellow, containing some orange areas, while the left-eye image is filtered with blue-violet. When fused stereoscopically, the landscape is perceived with green trees and near-gray areas (while the rest of the observer's visual field remains unaffected).

Crucially, two red patches, identical in both the left- and right-eye views, are included in the image. These patches are slightly displaced vertically (as opposed to horizontally, which would be interpreted as a depth cue). This vertical displacement ensures that the patches are perceived as separate entities superimposed on the fused, near-gray background.

Upon stereoscopic fusion, these red patches undergo a dramatic transformation. One patch is perceived as a dark, purplish hue, while the other appears as a lighter, orange hue. Neither of these perceived colors corresponds to the original red of the patches. Furthermore, the overlapping regions of the patches create a highly saturated orange, often perceived as being out of gamut. This demonstrates how the mixing strategy can not only shift colors but also push them beyond the boundaries of typical color representation. (The question of the afterimage of this complex, out-of-gamut hue is explored in a later section.)

Stereoscopic color mixing and its interaction with other effects such as color constancy and afterimages provide more details about the nature of color phenomena. Each next section introduces Color Constancy and Afterimages and integrates with binocular color mixing.

Contextual Perception: Tonal and Color Constancy

In music, tonal constancy describes a phenomenon analogous to color constancy in vision. While these phenomena are not necessarily mediated by identical adaptive mechanisms, familiarity may contribute to their enhancement. In music, tonal constancy, as analyzed subsequently, refers to the brain's ability to interpret musical scales with deviating step sizes and/or neutral intervals (from a diatonic perspective) as exhibiting resolved intervals when required by the musical context. This can be illustrated with more extreme examples than the stretched diatonic scales discussed earlier. Given any melody in any tuning system, each note can be subjected to a degree of pitch variation without losing its tonal meaning. A specific interval that functions as a minor third in one chord or cadence may be perceived as a major third in a different context or melodic trajectory. Similarly, other intervals can exhibit a superposition of functional roles. A sharpened second, for instance, may function as a melodic minor third but, when transposed an octave higher and combined with a suitable fifth or seventh, can function as a ninth. Tonal constancy is further elucidated with auditory examples later in this analysis.

Color constancy is related to chroma through the octave color wheel and complementary colors. This visual phenomenon, intimately linked to afterimages, refers to the brain's capacity to interpret the color of objects under varying illumination conditions. The visual system adapts to changes in illumination and takes into account both illumination and material properties to discriminate colors.

As a consequence of color constancy, when an object is illuminated with light of its complementary color, it is perceived as achromatic (gray or white). Conversely, objectively achromatic objects are perceived as tinted with the complementary color of the illuminating light. This effect can be readily demonstrated on computer screens, further confirming the objective nature of gray and its susceptibility to perceptual adaptation.

As previously mentioned, familiarity plays a role in shaping these effects. Research has shown that color constancy is more pronounced when the shape and actual color of the object are known; in the absence of such prior knowledge, the perceived hue is less salient.

(Image.08) The "orange" guitar.

Another factor indicating the active participation of the brain in this effect is the difficulty in simply simulating it. For color constancy to occur, sufficient contextual cues must be present for the brain to interpret a scene, rather than merely an image. This is analogous to binocular rivalry, where contextual cues resolve perceptual conflict. For example, a pure blue image with a small gray square at its center is typically perceived as a blue background with a gray square; the color constancy effect is not elicited by simple color-gray contrast alone. However, a more realistic scene (Image.08) generates a vivid effect, even with a less saturated "simulated" blue light. The image depicts an "orange" classical guitar, which is objectively gray, with the rest of the scene rendered using a pure RGB blue filter (0, 0, 255).

Given the influence of familiarity on the strength of color constancy, the subsequent images employ Rubik's Cubes within a scene. Rubik's Cubes are commonly used in color constancy demonstrations because, while viewers may associate them with color, they do not typically associate them with a single, fixed color. This object provides sufficient cues to establish a "natural" scene and elicit the color constancy effect across various complementary color settings.

(Image.09) Color Constancy and Complementary Colors. This image demonstrates eight configurations of illuminating light and the corresponding perceived complementary color on the gray cube. The effect is sufficiently pronounced that discerning the objectively gray regions of the cubes may require careful observation. Some viewers may even be inclined to download the images to verify that the target areas are indeed gray (RGB 85, 85, 85).

(Image.10)

Image Pair (image.cc-10-11) Demonstrating Color Constancy and the Role of Familiarity

The following image pair (image.cc-10-11) is designed to demonstrate several key aspects of color constancy, including the influence of object recognition and familiarity. While previous research has convincingly shown that color constancy can be enhanced by object recognition and memory, the primary goal of these images is to establish a baseline for subsequent demonstrations. Specifically, these images illustrate that carefully controlled illumination and filtering can produce perceptually compelling color experiences that are solely attributable to color constancy mechanisms, independent of object familiarity. In all these images, tiger's colored fur is objectively the same monochromatic RBG values.

Image cc-10 presents four copies of a tiger art pencil drawing, each at a different luminance level. This image demonstrates how the "constancied" orange hue remains relatively consistent across varying intensities of blue illumination. Critically, the regions perceived as orange in this image are objectively grayscale (RGB 127, 127, 127). While object familiarity (the knowledge that tigers typically have orange fur) might contribute to the perception of orange in this image, it is essential to establish that the effect can be achieved with objectively neutral gray areas.

Image cc-11 explores the limits of familiarity's influence. It presents four variations of the same scene, each with different illumination settings. In these variations, the tigers are perceived as orange, red, yellow, and green, respectively. Remarkably, the corresponding fur areas in each tiger image are also objectively grayscale (RGB 127, 127, 127). The perception of these diverse hues arises because the brain interprets the varying illumination as realistic, and color constancy mechanisms then generate the corresponding complementary colors. In essence, we "trust" our interpretation of the light source more than our prior knowledge that tigers are not typically green, yellow, or red. More concretely, the opponent processing cells likely respond to the blue light across the entire region, leading to the emergence of the complementary orange, red, magenta, yellow or green sensation within the grayscale areas.

Constancy and Stereoscopic Mixing

The color constancy effect can be predictably manipulated and mixed stereoscopically. This complex scenario is illustrated in images [cc09-10-11].

Two identical grayscale tiger images are used, each filtered with a different color: cyan for one and violet for the other. The cyan-filtered image elicits a reddish percept of the tiger due to color constancy mechanisms operating on the objectively gray areas. Conversely, the violet-filtered image elicits a yellowish percept. These induced colors are perceived as "normal" due to the brain's compensation for the filtering.

Upon binocular fusion (using the cross-eye technique), the background, now perceived as a subtle blue light, appears nearly achromatic (gray) compared to the saturated orange percept of the tiger. Remarkably, in this arrangement, where perceived colors are effectively "rebuilt" through constancy and stereoscopic vision, the only areas lacking direct monochromatic information in both eyes (the objectively gray areas) are the only areas exhibiting color after binocular fusion (the orange percept). A diagram [cc11] below the images illustrates this color mixing process.

Diagram [cc11] Labels:

M = Monocular: Indicates information presented to a single eye.
B = Binocular: Indicates information resulting from binocular fusion.
L = Left: Refers to information presented to the left eye.
R = Right: Refers to information presented to the right eye.
SDCI = Subspace Dominant Chromatic Information: Represents the chromatic information presented to each eye before constancy effects.
CDCI = Constancy Driven Chromatic Information: Represents the chromatic information resulting from the color constancy mechanism (the "constancied" colors).

The strength of the color constancy effect in this demonstration is notable. All tiger images are objectively grayscale. The first image [cc09] is designed for cross-eye viewing, and image [cc10] illustrates the resulting percept. The subtle saturation difference in the blue background is sufficient to elicit a strong orange percept in the tiger, demonstrating the robustness of color constancy.

The CDCI (Constancy Driven Chromatic Information) determined for each SDCI (Subspace Dominant Chromatic Information) can be mixed stereoscopically, validating its function as genuine chromatic information that influences perception.

The binocular chromatic fusion strategy is determined for each SDCI. The visual field can contain multiple SDCI(object detection of stereoscopic images on the screen). When fusing images using the cross-eye technique, colors mix predictably, while the surrounding visual field (level 0) remains unaffected. Each object's size and salience influence the fusion strategy within its respective SDCI. This means that nested images inherit, rather than create, their fusion strategies from their superspace. Consequently, while multiple simultaneous stereo images can be mixed independently, color conflicts within nested images are not resolved independently and are "dragged" by the fusion strategy of their superspace. Even with identical images that don't inherently require "mixing," color conflicts can still occur, as demonstrated in subsequent examples.

Image Series (image.cc-[12-19]): Color Constancy Demonstrations with an Electric Guitar

This series of eight images utilizes an electric guitar to demonstrate color constancy under varying color attractor illumination settings. These demonstrations highlight the complementary relationships between illumination and perceived color, as well as the varying salience of different "constancied" hues. The images also illustrate how the perception of these hues, derived from objectively grayscale regions, can be influenced by the surrounding color context.

The images were carefully adjusted to equalize the average salience of the perceived colors. It is observed that some hues are more readily elucidated (perceived as saturated) than others. For example, "constancied" cyan is typically perceived as more salient than "constancied" red. This difference in salience mirrors the ranking of afterimage hues, where red afterimages are often the least vivid. One possible explanation for this phenomenon is the larger size of the neural "blobs" representing red in V1 compared to other canonical hues. If afterimage and color constancy are active, feedback-driven processes involving V4 and subsequent visual areas looping back to V1, the weaker signal reaching the initial loop stage for red (due to the larger V1 blob size) might explain its lower salience. However, it is likely that multiple factors contribute to this difference. It is important to note that afterimages produced with natural pigments and daylight exhibit higher saturation for red, suggesting that the lower saturation observed with RGB displays might be a limitation of the display technology or the additive color mixing process.

Each guitar image features two grayscale areas, one slightly darker than the other, with RGB values near (100, 100, 100) and (150, 150, 150), respectively. These areas contain subtle variations in gray details and shadows. The rest of the guitar is "illuminated" with different colored lights, inducing the perception of the corresponding "constancied" color within the grayscale regions.

Adjacent to the guitar is a circular arrangement of eight guitar picks, each representing a canonical color attractor: red, orange, yellow, green, cyan, blue, violet, and magenta. These picks serve two purposes. First, they demonstrate the interaction of the colored illumination with other hues beyond the grayscale areas. Second, and more importantly, one of the picks in the arrangement is also gray, matching the grayscale areas on the guitar. This gray pick, along with the spectrally ordered arrangement of the other picks, resolves potential ambiguities in hue perception.

For example, an isolated guitar under blue (0, 0, 255) illumination might be perceived as either orange or yellow. However, the presence of the gray pick and the surrounding picks (yellow and red) clarifies that the guitar's "constancied" hue falls between yellow and red, confirming it as orange. Similarly, a guitar under yellow light might be reported as either violet or blue; the presence of the gray pick and the surrounding picks (blue and magenta) helps the observer correctly categorize the "constancied" hue as violet/purple. Constructing these images is analogous to performing a spectral ordering task with "imaginary" hues, a conceptually challenging but revealing process.

Tonal Constancy

This auditory phenomenon shares conceptual similarities with color constancy, although the underlying mechanisms and models differ. Analogously, if one considers a set of pitches (musical chromas), such as the diatonic scale, as analogous to a set of colors, these pitches can be substantially shifted and retuned without losing their tonal meaning, just as a set of colors can remain identifiable under varying illumination conditions. The stretching or alteration of notes can be considered analogous to different "illuminations" of the set of musical chromas, which nonetheless retain their identifiable relationships.

As discussed previously, a key difference exists between musical and visual chromas. The color of light can be perceived based on a single frequency or wavelength. In music, however, chroma is relative; a single isolated note does not possess an inherent "color" but acquires a contextual chroma within a chord or melody. In this sense, the color spectrum functions as a torsor relative to sound. Once a note is incorporated into a harmonic or melodic context, its role and degree are defined by its relative chroma. Each note, therefore, possesses multiple chromas relative to the other notes within the musical context. Consequently, different intervallic configurations of the diatonic scale not only remain functionally viable but can also generate additional chromas through transposition. There is no single "yellow" in this analogy; there are multiple "roots," each with its own set of relative chromas.

In general, tonal constancy refers to the brain's tendency to interpret musical intervals and progressions within a tonal context, even when the actual intervals deviate from standard tunings. A clear demonstration can be provided using 7-tone equal division of the octave (7-EDO), a tuning system in which no interval perfectly corresponds to those of 12-tone equal temperament.

Audio Examples

The following audio examples present melodies in both 7-EDO and their 12-tone equivalents. Despite the equal step sizes in 7-EDO, the melodies evoke a sense of familiar tonal functions. For example, the second step in 7-EDO, a neutral third at 342 cents (approximately halfway between a major and minor third), is often perceived as having either a "major" or "minor" quality depending on the surrounding musical context, such as the implied harmony or the melodic contour. This effect, which persists even with pure sine waves (thereby eliminating harmonic artifacts), demonstrates tonal constancy: the listener's brain interprets the neutral intervals within a tonal framework, resolving them into functionally familiar pitches. When the same progression is rendered in 12-tone equal temperament, the listener/performer naturally resolves each step into the "correct" functional pitch to satisfy the implied cadence.

Ξ Example A - 7edo

Ξ Example A - 12edo

Ξ Example B - 7edo

Ξ Example B - 12edo

(Further examples exploring this phenomenon in other tuning systems, including more complex modulations, can be found on my YouTube channel.)

(Image.11) This geometric visualization compares 7-EDO with the diatonic scale in 12-tone equal temperament on a logarithmic scale. Transposition of the 7-EDO structure yields identical intervallic relationships, whereas transposition of the diatonic scale reveals the seven familiar modes of 12-tone music.

This phenomenon raises questions regarding the limits of tonal functions. How much can these intervals be shifted or stretched before they lose their tonal meaning? This is a complex question involving individual perceptual variations and the continuous nature of pitch space. Color, chroma, and hue play a central role in exploring this question within the visual domain.

While the examples demonstrate how a single interval can serve different functions depending on context—particularly within melodic sequences or trajectories—two additional factors warrant consideration. First, familiarity, while difficult to define and quantify precisely, intuitively influences perception and reinforces tonal constancy. Second, the non-Euclidean nature of pitch space, where the cumulative perception of small intervals can lead to an wrong estimation of the total perceived distance, contributes to the effect. These factors, combined with the influence of surrounding notes on the perception of otherwise neutral intervals, provide a comprehensive explanation for tonal constancy.

Sense Adaptation and Aftereffects:

Both vision and audition exhibit phenomena related to sensory adaptation, termed aftereffects. In vision, this is known as the afterimage, while in audition, it is the aftersound. Although both involve sensory adaptation, their underlying mechanisms differ significantly.

Auditory Aftereffects (Aftersounds):

Auditory aftereffects manifest as the perception of a residual pitch following exposure to broadband noise with a rejected frequency band. The perceived pitch corresponds to the logarithmic center of the rejected band. This phenomenon is distinct from visual afterimages and will not be discussed further in this section, which focuses exclusively on visual afterimages.

Visual Aftereffects (Afterimages):

Visual afterimages are perceptual phenomena in which a residual color sensation persists after the removal of an initial stimulus (the inducer). While the initial trigger is attributed to temporal adaptation of photoreceptor cells (cone fatigue), the phenomenon is more complex than a simple depletion of cone sensitivity.

Complexity of Afterimage Perception:

Several observations highlight the complexity of afterimage perception:

Temporal Integration: The perceived afterimage hue is determined by the integrated exposure time to the inducer, even if the inducer color changes rapidly. For instance, a stimulus alternating rapidly between dark red and yellow will produce a similar afterimage to a spatially mixed red-yellow stimulus, provided the total exposure times are equal. This demonstrates a temporal integration process in adaptation.
Edge and Object Influence: Afterimages can be perceived in regions where the inducing color was not directly present. For example, if a sharp-edged shape (e.g., a star) is presented with only its corners tinted red, the afterimage will encompass the entire shape, including the gray central area. This suggests that post-retinal processing, including edge and object detection, influences the perceived afterimage. This influence occurs pre-stereoscopically, as evidenced by the persistence of individual afterimages in each eye during stereoscopically fused images (as demonstrated in previous sections).

These observations indicate that while cone fatigue initiates afterimage formation, subsequent visual processing stages modulate their final appearance.

Types of Negative Afterimages:

The common conception of afterimages as simple RGB complementary colors is an oversimplification. Modern research has revealed more nuanced relationships between inducer and afterimage hues. It is crucial to distinguish between two types of negative afterimages:

Instant Afterimages: These appear immediately upon removal of the inducer.
Delayed/Conflict Afterimages: These require longer exposure times and are best observed in a dark environment. They are often described as "negative images" that exhibit oscillation (with a frequency similar to binocular rivalry) and gradual fading. When multiple inducer colors are used sequentially, delayed afterimages can appear sequentially as well.

Early studies primarily focused on instant afterimages, attempting to explain them solely through cone fatigue and a simple subtraction of the inducer color from the background. These explanations failed to account for delayed afterimages, which are a common experience (e.g., the afterimage seen after staring at a bright light and then closing the eyes). Delayed afterimages are more readily elicited with natural pigments and daylight and are optimally observed in complete darkness (by covering the eyes).

The Octave Hue Wheel and Afterimage Prediction:

While empirical data on inducer-afterimage hue relationships exhibit some variation across studies (due to individual differences and methodological variations), a consistent pattern emerges when compared to the logarithmic octave hue wheel proposed in this study. This wheel demonstrates remarkable predictive power for afterimage hues.

Specific Complementary Pairs:

Red and Cyan: Red and cyan are conventionally considered complementary and exhibit reciprocal afterimage relationships. Empirical studies consistently report afterimages within the red and cyan bands, corresponding to their directly opposing positions on both the sRGB color wheel and the logarithmic octave hue wheel.
Green and Magenta: Green and magenta also exhibit reciprocal afterimage relationships, with afterimages consistently falling within their respective color bands. These colors also occupy opposing positions on both color representations.
Blue and Yellow: Blue and yellow present a crucial deviation from simple reciprocity. Blue induces an orange afterimage, while yellow induces a purple afterimage. Conversely, orange induces a blue afterimage, and purple induces a yellow afterimage. This non-reciprocal relationship is accurately predicted by the logarithmic octave hue wheel, where blue is opposite orange and yellow is opposite purple, unlike the sRGB wheel, where blue and yellow are directly opposed.

This accurate prediction of afterimage relationships provides strong support for the validity of the logarithmic octave hue wheel and its underlying logarithmic representation of the visible spectrum.

GIF Animations (Gif.01-02): Edge Detection and Afterimage Perception

The following GIF animations (Gif.01-02) demonstrate the influence of edge and object detection on afterimage perception. Each animation, consisting of three frames played in a continuous loop, comprises a 2-second inducer presentation followed by 1-second presentations of two different edge guides. These animations illustrate how afterimages are selectively perceived only where edges are present, with colors disappearing in areas lacking defined boundaries. This observation strongly supports the active construction of afterimage perception, suggesting that while cone fatigue may play a role in triggering the effect, higher-level visual processing, including edge detection, is crucial in shaping the afterimage's appearance. Gif.01 utilizes the classic example of two superimposed four-point stars, while Gif.02 employs differently colored circles.

Analysis of Afterimage Research Data:

Analysis of existing afterimage research data reveals evidence of a color organization consistent with the octave hue model proposed in this study.

Reciprocity and Stereoscopic Confirmation:

Numerous studies report a statistical reciprocity between inducer and afterimage hues, often modeled by iterative functions that map inducer hues to their perceptual inverses. This reciprocity is also qualitatively evident in stereoscopic color vision. For instance, when two complementary colors (e.g., red and cyan, precisely determined for a given individual using minimal intensity adjustments to achieve perceptual achromatization) are presented dichoptically (one color per eye), the resulting afterimages are also complementary. The fused, binocular afterimage appears achromatic, demonstrating mutual cancellation of the opponent afterimage hues.

Conversely, when colors are fused stereoscopically to create a chromatic mixture (rather than complementary cancellation), the afterimage reflects both the fused color and the individual afterimage components. For example, if blue and yellow are fused to create a perceived green, the subsequent afterimage will be magenta/purple. This resulting color percept is conceptually "doubly justified": it arises both as the afterimage of the fused green percept and as a stereoscopic combination of the individual afterimages induced by each eye (blue inducing orange, yellow inducing violet). The stereoscopic fusion of these individual afterimages (orange and violet) results in the perceived magenta afterimage. The following image is designed to illustrate these phenomena. (Viewing instructions are provided at the beginning of the stereoscopic vision section.)

Deviation from RGB Complementarity and Hue Distribution:

However, a more significant pattern emerges from the distribution of afterimage hues, specifically a clear deviation from simple RGB complementarity. Studies employing 24 uniformly spaced inducer samples from an RGB color space, including pure RGB and CMY primaries, reveal a non-uniform distribution of afterimage hues.

These studies identify three distinct clusters of inducers, each mapping to a disproportionately large region of afterimage hues compared to their corresponding region on the RGB color wheel. For example, a relatively small 20° region of green in the inducer set can map to nearly 100° of red and magenta hues in the afterimage set. Other clusters, while smaller in angular extent, exhibit the same "stretching" or non-linear mapping.

Connection to the Octave Hue Model:

This non-linear mapping demonstrates that the RGB color wheel does not accurately represent the relationship between inducer and afterimage hues. If the afterimage mapping is "unstretched" to achieve a more uniform distribution of hues, the resulting arrangement closely resembles the spectrum generated by the octave hue method proposed in this study. Specifically, this "unstretching" reveals the key non-reciprocal relationships predicted by the octave model, such as blue mapping to orange and yellow mapping to violet/purple, which are not directly opposite on the RGB wheel.

Afterimages and Stereoscopic Mixing: Evidence for a Shared Mechanism

These experiments explore the interaction between afterimages and stereoscopic mixing, revealing a direct relationship between these two phenomena and providing further support for the proposed model of color perception. The observation that afterimage hues interact stereoscopically in the same manner as colors in stereoscopic mixing experiments strongly suggests a shared underlying mechanism.

Reciprocity of Afterimages and Dichoptic Complementaries

The reciprocity of afterimages and dichoptic complementaries, where an inducer hue triggers its complementary afterimage and vice versa, has been previously discussed and is well-supported by modern research. While minor discrepancies in hue matching may arise due to sampling methods or the use of median sample values, the general reciprocal relationship is clear. Furthermore, even when perfect reciprocity is not observed, the afterimage hue typically falls within the expected color category, representing a minor variation rather than a fundamental deviation. Stereoscopic mixing experiments provide even more direct evidence for this reciprocity.

Stereoscopic Mixing of Afterimages

The following series of images is designed to minimize luminance and depth cues to isolate the interaction between afterimages and stereoscopic mixing. Minimizing these cues helps to avoid binocular rivalry and allows for a clearer observation of the color mixing effects.

Image [] employs red and cyan filters for the right and left eyes, respectively. Upon stereoscopic fusion, the image is perceived as gray, with the original red and cyan images still visible at the periphery. After adaptation (inducing afterimages) and subsequent replacement of the red/cyan images with grayscale or outlined versions (while maintaining cross-eye viewing), the central fused image remains gray. This suggests that the afterimages, like the original colors, are also subject to stereoscopic cancellation.

This is confirmed by the following observation: Slight head movements while fixating on the fused image, or covering one eye, reveals that each eye has its own corresponding afterimage (cyan for the red-adapted eye, red for the cyan-adapted eye). These afterimages, generated before the point of binocular combination, are then mixed stereoscopically, just as real chromatic information would be.

Stereoscopic Mixing of Afterimages: Complex Interactions

The interaction becomes more complex when considering other color mixtures. The selection method for color conflict resolution can influence the outcome, potentially "dragging" in other colors that would normally be expected to cancel out or produce a predictable mix. However, this is not always the case, revealing a clear hierarchy in the operation of these effects.

The green percept created by the stereoscopic mix of blue (0, 0, 255) and yellow (255, 255, 0) was previously demonstrated with the inclusion of orange areas to provide an achromatic reference. In the following image, the minimal luminance information simplifies the interaction.

This blue-yellow stereoscopic pair produces a cyan-green percept. After adaptation (inducing afterimages), the blue stimulus generates an orange afterimage, and the yellow stimulus generates a violet afterimage. The stereoscopic mix of these afterimages results in a red-magenta percept. This doubly justified color (red-magenta) is also the afterimage of the cyan-green percept, demonstrating an algebraic-like nature of hue interaction. (A more detailed exploration of this concept is presented later in this study.) As an initial analogy, consider a unit group where each element has an inverse. Multiplying an element by its inverse results in the identity. Multiplying other pairs results in a different element, whose inverse is the result of multiplying the inverses of the initial pair. Hue interactions in the brain appear to follow a similar cyclic multiplication scheme.

Complementary Colors and the Central Role of Green:

As previously discussed, the symmetrical arrangement of color attractors on the logarithmic octave hue wheel allows for a degree of predictability. Knowing the positions of red, orange, and yellow for a given individual allows for the prediction of cyan, blue, and violet by locating the points directly opposite them. This corresponds to a wavelength ratio of approximately 1:√2 (or √2:1 when mapping back into the visible range).

However, this raises a crucial question: can any color attractor be predicted independently, without relying on the positions of other hues? Green emerges as a strong candidate.

Green as the Logarithmic Center:

Green's position on the octave hue wheel is defined by the logarithmic midpoint of the visible spectrum. Because green lies opposite the non-spectral color magenta, its location is effectively determined by the extreme edges of the visible spectrum. In other words, the perceived limits of the visible spectrum, where red and violet are no longer perceived, define the central point of green. This makes green's position independent of other hue positions within the spectrum.

Empirical Support for Green's Central Role:

This independent predictability of green is supported by empirical observations. The perceived transitions between green and its adjacent hues (yellowish-green and bluish-green) often correspond closely to the perceived limits of the individual's visible spectrum—the points where red and violet disappear.

Predicting Spectral Limits from Green:

This relationship can be demonstrated using approximate wavelength ranges for Newton's "principal" hues (which, as discussed earlier, are modern interpretations of his qualitative descriptions). Taking the approximate range for green as 500-570 nm, red band limit as 700nm and violet band limit as 400nm:

Multiplying the short wavelength edge of green (500 nm) by √2 (approximately 1.414) yields approximately 707 nm, close to the perceived limit of red.
Dividing the long wavelength edge of green (570 nm) by √2 (or multiplying by 1/√2, approximately 0.707) yields approximately 403 nm, very close to the perceived limit of violet.

This demonstrates that the perceived edges of the green band can effectively predict the perceived limits of the visible spectrum, reinforcing its central and independently defined position on the hue wheel.

(draft)

The Logical Foundation of the Color Continuum

The structure of distinct color appearances can be logically deduced from fundamental axioms defining color as a phenomenon, specifically cyclicity-continuity and achromatism-symmetry. This deduction is independent of the physical, biological, or perceptual specifics of light and vision, focusing on the abstract essence of color differentiation. Color is envisioned as a foundational, primal space, manifesting across various realms of reality—from abstract color spaces to physical, biological, neurological, subjective, and perceptual layers. These layers represent transformational points in color information processing, forming a continuum where boundaries inevitably blur. This abstract definition allows for color to exist even without strictly defined, discrete layers of manifestation.

The model focuses on the hue dimension of color, not directly addressing the potential number of "unique" components but together with the limits of their distinctiveness. It analyzes the potentially discrete hue categories within the continuum and argues against the notion of "new colors," which is often associated with tetrachromacy—a point that will be elaborated upon further.

Within this continuous phenomenon, we naturally categorize regions, assigning discrete names as change along the continuum becomes perceptually significant. This categorization is influenced by perceptual thresholds and evolutionary pressures, highlighting certain "attractor" hues as reference points, like red, green, and blue (see first section for full definition of color attractor). We name other appearances as mixtures, such as yellowish-orange. This leads to the intuitive, yet ultimately misleading, concept of "unique" or "primary" colors. In reality, colors are not truly irreducible; they arise from transformations and mixtures within the continuous underlying space.

To clarify, we must refine the notion of "unique hues," distinguishing it from the idea of color attractors. The common definition of a unique hue as "without any tint of another" is subjective and scientifically weak.Instead, a relational definition of co-unique hues is proposed.

Definition of Co-Unique Hues: A pair of hues (A and A⁻¹) are co-unique if and only if neither hue contributes to the perceptual experience of the other, representing opposing tendencies relative to the neutral point of achromatism. This definition is independent of qualia, physical properties, spectral location, and naming conventions, focusing on their fundamental relational opposition.

Fundamental Axioms of Color

Continuous and Cylic: Hues transition smoothly and cyclically into one another, forming a continuous spectrum without discontinuities or gaps. This cyclical nature is essential to the concept of hue.
Achromatism (Complementarity): Colors exist in at least complementary pairs, co-unique hues (A and A⁻¹) that, when combined, produce a neutral achromatic sensation (grey). This process is termed achromatization and is fundamental to color perception.

---

The following mathematical definitions, while implying algebraic characteristics and a group-like structure, are intended to be concise and focus on the essential properties of the hue dimension.

Concepts & Definitions

- \( H \): The set of hues (points in hue space)

\( H \cong S^1 \), a compact, cyclic continuum isomorphic to the unit circle.

- \( N \): The achromatic state

- \( \ast \): Abstract binary mixing operation

The 'mix' operation represents how hues perceptually combine or interact, not necessarily a direct physical or computational process. It encompasses additive, subtractive, and, fundamentally, stereoscopic color mixing, emphasizing the binary nature of hue interaction.

For hues \( h_1, h_2 \in H \):

\[
\ast(h_1, h_2) =
\begin{cases}
N, & \text{if } \Delta \theta = \pi \text{ (complementary)} \\
h_1 + \frac{\Delta \theta}{2} \mod 2\pi, & \text{otherwise}
\end{cases}
\]
where \( \Delta \theta = \min(|h_2 - h_1|, 2\pi - |h_2 - h_1|) \).

- \( d \): A perceptual distance metric \( d: H \times H \to \mathbb{R} \)

- Core Axioms

1. Cyclicity:
\( \forall h_1, h_2 \in H, \, h_1 \ast h_2 \in H \cup \{N\} \)
Mixing is closed and cyclic.

2. Continuity:
\( \forall h_1, h_2 \in H, \, \exists f_1, f_2: [0, 1] \to H \text{ such that } f_1(0) = f_2(0) = h_1, f_1(1) = f_2(1) = h_2 \)
Two disctinct continuous path connects any two hues.

3. Co-Uniqueness:
\( \forall A \in H, \, \exists A^{-1} \in H : d(A, A^{-1}) = \max \)
Every hue has a maximally distant complement.

4. Achromatism:
\( \forall A \in H, \, A \ast A^{-1} = N \iff \Delta \theta = \pi \)
Mixing complements yields achromaticity.

5. Symmetry:
\( \forall A, B \in H, \, d(A, B) = d(B, A) = \min(\Delta \theta, 2\pi - \Delta \theta) \)
Perceptual distance is symmetric and respects path equivalence.

Derivation of the Four-Hue Structure (Theorem):
\[
\exists A, B \in H : A \neq B \land A^{-1} \neq B^{-1} \land A^{-1} \neq B
\]
At least four fundamental hues are required to satisfy continuity and achromatism.

Proof Sketch:

1. Existence of a Co-Unique Pair: By Axiom 3, let \( A \) and \( A^{-1} \) exist.
2. Achromatic Discontinuity: \( A \ast A^{-1} = N \), creating a perceptual gap.
3. Bridging the Gap: Continuity (Axiom 2) necessitates intermediate hues \( B \) and \( B^{-1} \).
4. Symmetry & Closure: Placement of \( B \) and \( B^{-1} \) equidistant from \( A/A^{-1} \) restores cyclicity.

Result:
A closed, continuous cycle \( A \rightarrow B \rightarrow A^{-1} \rightarrow B^{-1} \rightarrow A \).

Emergent Properties

- Transitional Hues: Midpoints between fundamentals (e.g., \( AB \)) arise from mixtures.
- Perceptual Categorization: Evolutionary and perceptual thresholds create "attractor" hues (e.g., red, green).
- Illusion of Primary Colors: "Unique hues" are relational, not absolute, emerging from symmetry.

-------------

Distinctiveness Decay Theorem

In a continuous, cyclic hue space \( H \cong S^1 \), recursive subdivision of the hue continuum yields hues with exponentially diminishing perceptual distinctiveness. Specifically, after \( n \) subdivisions, the distinctiveness \( D(n) \) of new hues relative to their neighbors is given by:

\[
D(n) = \frac{D_{\text{max}}}{2^n}
\]
where \( D_{\text{max}} = \pi \) is the maximal distinctiveness in \( H \).

Proof Sketch

1. Base Case: For \( n = 0 \), the co-unique pair \( A \) and \( A^{-1} \) has \( D(0) = D_{\text{max}} = \pi \).
2. Inductive Hypothesis: Assume that after \( k \) subdivisions, the distinctiveness of new hues is \( D(k) = \frac{D_{\text{max}}}{2^k} \).
3.Inductive Step: For \( k + 1 \) subdivisions, each new hue is inserted at the midpoint of an interval with distinctiveness \( D(k) \). By definition, the new distinctiveness is:
\[
D(k+1) = \frac{D(k)}{2} = \frac{D_{\text{max}}}{2^{k+1}}
\]
4. Conclusion: By induction, \( D(n) = \frac{D_{\text{max}}}{2^n} \) holds for all \( n \geq 0 \).

Base Case

- Initial Pair: Let \( A \) and \( A^{-1} \) be a co-unique pair of hues, with \( d(A, A^{-1}) = D_{\text{max}} = \pi \).
- Distinctiveness: \( A \) and \( A^{-1} \) are maximally distinct (100% different).

Inductive Step
- First Subdivision: Insert hue \( B \) at the midpoint between \( A \) and \( A^{-1} \).
- By definition, \( d(A, B) = d(B, A^{-1}) = \frac{D_{\text{max}}}{2} = \frac{\pi}{2} \).
- Distinctiveness: \( B \) is 50% distinct from \( A \) and \( A^{-1} \).
- Second Subdivision: Insert hues \( C \) and \( D \) at the midpoints between \( A \) and \( B \), and \( B \) and \( A^{-1} \), respectively.
- \( d(A, C) = d(C, B) = d(B, D) = d(D, A^{-1}) = \frac{D_{\text{max}}}{4} = \frac{\pi}{4} \).
- Distinctiveness: \( C \) and \( D \) are 25% distinct from their neighbors.
- General Case: After \( n \) subdivisions, the distinctiveness of new hues is:
\[
D(n) = \frac{D_{\text{max}}}{2^n} = \frac{\pi}{2^n}
\]
- Distinctiveness Decay: Each subdivision halves the distinctiveness of new hues relative to their neighbors.

Implications

- Limits of Distinctiveness: As \( n \to \infty \), \( D(n) \to 0 \). New hues become indistinguishable from their neighbors.
- No New Categories: Subdivision refines resolution but does not create fundamentally new distinct hues.
- Tetrachromacy: Even with additional photoreceptors, the theorem predicts diminishing returns in perceptual distinctiveness.

Interpretation

Necessity of a Four-Hue Foundation:

Four-Component Basis: Due to the inherent symmetry and continuity of hue space \(H\), it can be logically decomposed into four quadrants. This decomposition is naturally defined by the four fundamental hues \( {A, A^{-1}, B, B^{-1}} \), strategically positioned to ensure continuity and avoid any discontinuity( the neutral state\(N\)

The principle of achromatism necessitates at least one co-unique pair (A and A⁻¹). A rudimentary two-hue spectrum (A – N – A⁻¹) emerges, but this structure violates continuity due to the achromatic point creating a discontinuity between A and A⁻¹. To restore continuity, at least one additional co-unique pair (B and B⁻¹) is required, positioned to bridge the achromatic gap and, crucially, create a balanced and symmetrical structure within the hue circle. This logically necessitates a minimum four-hue structure: A – B – A⁻¹ – B⁻¹ – A, forming a closed, continuous cycle.

For continuity, B and B⁻¹ cannot be placed arbitrarily. They must "bridge" the gap between A and A⁻¹. The most symmetrical and continuous way to do this is to place B and B⁻¹ such that they are also maximally distant from each other and equidistant from A and A⁻¹. This creates a balanced and symmetrical structure.

Emergence of Transitional Hues:

The transitions between these four fundamental hues are continuous, generating further hue categories. Points of equal contribution from adjacent fundamental hues (midpoints in the continuum) represent distinct transitional hues (e.g., between A and B). These transitional hues are doubly justified: they arise from the mixture of two adjacent fundamental hues (e.g., AB) and as complements of the opposite mixture (e.g., A⁻¹B⁻¹), reinforcing the symmetrical and relational nature of color space.

Logical Limit to "Most Distinct Hues":

Starting with any co-unique pair and applying continuity and achromatism inevitably leads to this four-fundamental-hue structure, subsequently populated by transitional hues to fill the continuum. Further subdivision within this continuum yields finer variations of these categories, not fundamentally "new" distinct hues. Imagine a cycle where co-unique pairs represent maximal perceptual difference (100%). The maximum difference from one hue to another within this cycle is limited. The second co-unique pair (B/B⁻¹) can only be maximally 50% distinct from the first (A/A⁻¹). Recursive subdivision creates hues of diminishing distinctiveness (25%, 12.5%, etc.), highlighting that the continuum is filled by variations, not an endless proliferation of fundamentally new categories.

Logical Implications and the Four-Component Model

This framework logically predicts the existence of four fundamentally different unique hues as a necessary condition for a cyclic and continuous spectrum of appearances. This prediction is independent of specific color spaces or physical implementations and provides a fundamental, axiomatic explanation for the discrete nature of our core color categories. Starting with any arbitrary co-unique pair will inevitably generate the same fundamental four-hue structure to satisfy achromatism and continuity. These relationally defined hues form the basis of our conventional color categories, one iteration or subdivision from the Distinctiveness Decay Theorem renders, roughly, as color attractor names: red, orange, yellow, green, cyan, blue, violet and magenta.

This model aims to explain why we perceive the hues we do, not just how we represent them. The model logically derives the structure of the hue cycle from first principles, therefore, the notion of independently arising "new hues" becomes unnecessary within this framework. The perceived color space is thus not an arbitrary construct, but a logically constrained interdependent system.

-------------

Practical Implications and Limitations

It is important to consider that the presented model, particularly the S¹ hue wheel representation and the logical framework derived from it, describes an idealized case. This ideal assumes a perfectly uniform perceptual color space and a hypothetical observer with maximally fine just-noticeable differences (JNDs). Even under such idealized conditions, when presented with a cyclical continuum of uniformly distributed, yet initially unknown, color components, the model predicts a maximum perceptual distance between co-unique hues and the fundamental logical structure remains consistent.

Consider a scenario where the physical spectrum is drastically non-uniform, for example, heavily skewed towards yellow, with only a minimal representation of other colors and transitions (e.g., 99% yellow and 1% encompassing all other spectral appearances). While a linear mapping of such a spectrum to the hue wheel would distort the simple angular and symmetry relationships, the core principle of complementarity remains valid perceptually. Every hue point, regardless of its physical prevalence in the spectrum, will still possess a complementary, or 'neutralizer,' point that, when appropriately combined, leads to an achromatic sensation. This fundamental symmetry, the existence of complementary pairs, does not necessitate a physically uniform spectrum; the maximal perceptual distance between co-unique hues is a perceptual, not a strictly physical, attribute.

Furthermore, when considering real-world color mixing, such as with paints, the idealized hue dimension must be understood in the context of saturation and lightness. Imagine mixing spectrally narrow, 'pure' colors like a highly saturated red and a highly saturated green paint. While the idealized S¹ hue wheel might predict an orange or yellowish hue as the midpoint, in practice, paint mixing often yields a darker, desaturated orange-brown. This deviation arises because real-world paints introduce changes in saturation and lightness, dimensions not explicitly represented in the simplified S¹ hue wheel. However, the hue component of the mixture is still directionally consistent with the model's prediction: the mixture is indeed perceptually intermediate between red and green in hue, even if modified by other color dimensions.

Even in cases where visual intuition might be misleading, such as mixing a slightly orange-leaning red and a slightly green-leaning cyan paint, the S¹ model remains analytically valuable. While a cursory visual assessment of the paint mixture might suggest a near-achromatic gray, spectral analysis could reveal a subtle, yet measurable, spectral distribution peak corresponding to the yellowish hue predicted by the S¹ model for the idealized mix. This highlights that while real-world color mixing is complex and multi-dimensional, the hue dimension, as structured by the S¹ model and the logical framework, provides a consistent and accurate underlying principle for understanding color relationships, even when those relationships are not immediately apparent in everyday color experiences.

Co-Uniques(Complementaries) as generating sets

Deconstructing the "New Color" Misconception in Color Vision Research

The notion of "new colors" or "unique hues" frequently emerges in discussions about tetrachromacy and animal color vision, often lacking precise definitions and rigorous justification. This ambiguity fosters misinterpretations, particularly regarding the perceptual experiences of organisms with more than three cone types, leading to overstated claims. This section argues that the concept of discrete, standalone "new hues" is fundamentally incompatible with a logically consistent understanding of color perception, which is grounded in continuity, complementarity (achromatism), and relational structure.

The Ambiguity of "Unique Hues" and "New Colors"

A primary issue is the absence of a clear, universally accepted scientific definition for "unique hue." The traditional subjective definition – a color "without any tint of another" – lacks scientific rigor and contributes to overinterpretations, especially in studies of animal and human tetrachromatic vision. A more robust, relational approach defines "co-unique hues" based on mutual cancellation: two hues are co-unique if neither contributes to the perceptual experience of the other, representing opposing tendencies towards achromatic neutrality. The very idea of a "new color" or "new hue" is problematic as it often implies entirely discrete, qualitatively different sensations outside of human experience. Without a precise definition of "hue" anchored to physical, neural, and abstract color models, claims of "new colors" are difficult to validate and can be misleading.

The Flawed Analogy of Additional Cones and the Dichromat-Trichromat Transition

A common, yet flawed, analogy attempts to explain tetrachromacy by comparing it to the transition from dichromacy to trichromacy, suggesting that each additional cone simply adds a new dimension to color perception. This analogy often proceeds without a clear definition of "color," assuming a direct, quantifiable link between cone activation and perceptual experience. While it's true that some animals transition from dichromacy to trichromacy with a third cone mutation, this analogy overlooks a crucial distinction: dichromatic vision inherently includes a neutral gray point formed by mixing their two hues, creating a discontinuity in their perceptual space. Trichromatic vision, conversely, is continuous, lacking a spectral gray (except through complementary color mixing). This fundamental difference is key: dichromats require this gray point to differentiate their two hues because two hues alone cannot reconstitute white light. Trichromats, however, can reconstitute white light from any complementary pair, making their spectrum continuous. A trichromat can conceptually approximate dichromatic vision by replacing parts of the spectrum with gray, but a dichromat cannot conceive of the trichromat's continuous spectral experience. The addition of a cone, therefore, fundamentally alters the structure of color space by making it continuous, not just expanding its dimensionality in a simple additive manner.

Misinterpretations of Tetrachromacy and the Overemphasis on Photoreceptors

Claims of tetrachromats perceiving "millions of unimaginable colors" are often based on flawed reasoning and an overemphasis on the number of photoreceptors. Studies claiming tetrachromacy often fail to investigate "complementary behavior"—a crucial aspect of color perception. If a genuinely "new hue" exists, there must be a corresponding complementary hue that, when mixed, produces a neutral sensation. This has not been demonstrated in tetrachromacy research. Furthermore, these claims often misinterpret Just Noticeable Difference (JND) data. Enhanced JNDs, indicating finer discrimination between similar wavelengths, are frequently mistaken for the perception of new hues. However, increased JNDs simply suggest finer discrimination within existing color categories, not the emergence of new qualia. Phrases like "satisfies the criteria for behavioral tetrachromacy" are often used vaguely, contributing to confusion. The ability of a tetrachromat to distinguish between similar oranges, for example, can be explained by variations within the normal trichromatic range of cone sensitivities and doesn't necessitate the perception of a novel color sensation. Many such studies also lack crucial qualitative data – interviews and descriptions of perceptual experiences – and comparative JND tests against trichromats.

Similar misconceptions extend to claims about mantis shrimp vision. The assertion that mantis shrimp, with their 15 photoreceptors, experience colors "beyond human imagination" is equally flawed. Having more photoreceptors does not inherently equate to perceiving more distinct hues or finer spectral distinctions. Mantis shrimp vision, like human trichromatic vision, operates on the principle of relative receptor responses, creating a continuous mapping between wavelength and receptor stimulation. Their "colors" are determined by the relative responses across their photoreceptor array, not by discrete, otherworldly hues. The analogy of spiders having more legs but not moving in more spatial dimensions aptly illustrates the flawed logic of equating photoreceptor number directly with the dimensionality of perceptual experience.

While the mantis shrimp's 15 photoreceptors are often cited as an example of "super color vision," an alternative hypothesis suggests that their proliferation might be better explained by the principle of evolutionary efficiency. Rather than evolving a single, broadly tuned photoreceptor, it might have been evolutionarily simpler to duplicate existing photoreceptor types and subtly shift their spectral sensitivities. This "instantiation" of similar receptors at different wavelengths would effectively cover the visible spectrum without requiring drastic alterations to the underlying biochemistry of the photoreceptors themselves. This hypothesis emphasizes the fundamental need for broad spectral coverage in any visual system and offers a plausible explanation for the mantis shrimp's unique visual apparatus, even if it doesn't necessarily imply an extraordinary capacity for color discrimination.

Logical Constraints and the Structure of Color Space

The prevailing simplistic view of color dimensionality, directly translating cone types to independent color channels, overlooks the crucial role of post-receptoral processing, particularly stereoscopic color mixing. Color perception's dimensionality is fundamentally dual, not simply determined by the number of cone types. The third cone acts as a reference point for the other two; the intensity response from two cones is meaningless without this reference. Retinal neurons encode proportional responses, not pre-formed color qualia. It's illogical for fully formed color percepts to be encoded at the retinal level when subsequent processes like stereoscopic color mixing operate on distinct principles, with binary input.

Critique of Current Color Vision Models and Cone-Level Qualia

This analysis challenges current color vision models that overemphasize photoreceptors and the concept of unique hues. The assumption that each cone type has a pre-assigned, fundamental color at the cellular level is illogical, implying fully formed color percepts at the retinal level, prior to conscious awareness. This inappropriately relocates the objectivity of color from wavelength or the mind to the cones themselves. The widespread dissemination of information about tetrachromacy, often with exaggerated claims of "millions" of additional colors, highlights a significant misunderstanding. Studies presented as evidence for novel color experiences in tetrachromats, often relying solely on metameric failure and lacking qualitative data, are insufficient to establish the perception of new color sensations. The conventional understanding of color dimensionality, based on a simplistic photoreceptor-centric view, needs revision.

Conclusion

In conclusion, claims of "new colors" in tetrachromacy and other forms of non-standard color vision often overemphasize the role of photoreceptors while neglecting the crucial roles of post-receptoral processing and the inherent logical constraints of continuity, complementarity, and relational structure within color space. The color space, as we understand it, leaves no room for discrete, standalone "new hues" without fundamentally disrupting its organization, symmetry, and balance. Color is an interdependent system. The dimensionality of color perception is not simply dictated by the number of cone types but by the complex interplay of receptor signals and their integration by the brain, particularly within the context of stereoscopic vision. Therefore, a critical re-evaluation of the "new color" concept and a more nuanced understanding of color perception beyond a simplistic photoreceptor-centric view are necessary.

Applying the Axioms to Visual Experience: A Logically Constrained Color Space

This abstract model, grounded in the principles of co-unique pairs, achromatism, and continuity, provides a powerful framework for understanding the organization of human color experience. It demonstrates how the existence and approximate arrangement of primary unique hues can be logically deduced, starting from fundamental principles.

Within this model, the specific wavelengths or range of the visible spectrum are not primary. Instead, the focus is on the phenomenon of continuous color perception itself. The model posits that the visual system detects a finite but continuous range of colors from the infinite electromagnetic spectrum. This detection is not an on/off switch but a gradual transition from non-visible to visible, a continuity inherent in the phenomenon itself.

Given this continuous and finite range, and adhering to the axioms of cyclicity and complementarity, the following logical structure emerges:

Hue A and its Complement: A hue sensation must appear first at one end of the visible range. Call this hue A (e.g., red). By the principle of complementarity, its complement, A⁻¹ (cyan), must also be present within the visible range.

Hue B and its Complement: Similarly, at the other end of the visible range, another hue sensation, B (e.g., violet), must appear, and its complement, B⁻¹ (yellow), must also be present within the visible range.

Arrangement of Hues: To maintain continuity within the hue circle, these four hues must be arranged in the order A, B, A⁻¹, B⁻¹. This arrangement is not arbitrary; it's a logical consequence of the axioms.

Completing the Cycle: No additional hues are needed to satisfy the axioms. The mixes and transitions between these four fundamental hues (red, violet, cyan, yellow) naturally complete the cycle, giving rise to the full spectrum of perceived colors attractors.

Orange and blue emerge as a direct consequence of both complementarity and the continuity axiom. Green by continuity, and Magenta by its complementarity. These transitions, while maintaining the complementary relationships and following the same distinctiveness decay observed elsewhere in the hue circle, do not lead to achromatization because each hue arises logically in its correct order with its opponent.

Non-Spectral Magenta: Tthe non-spectral color magenta emerges as a perceptual construct due to the brain's interpolation between red and violet along the closed loop of the hue circle. It's a consequence of the brain "completing" the cycle, not a direct response to a specific wavelength.

This demonstrates how, starting with the independently defined boundaries of the visible spectrum, the axioms of complementarity and continuity necessitate the existence and approximate positions of all primary unique hues. No further unique hues can be added without violating these fundamental principles.

This abstract color model, based on the logical necessity of co-unique pairs and a continuous spectrum, demonstrates a remarkable alignment with physical reality, revealing a consistent mathematical structure rooted in logarithmic perception and octave-like cycles. This strongly suggests that color, at a fundamental level, possesses an objective basis, even though its ultimate experience is subjective.

Connecting to Cone Function and Dichromacy:

The model predicts four relative primaries (two co-unique pairs) from which all other hues can be derived. However, any single point on the color continuum is composed of a mixture of at most three, not all four, of these relative primaries. This follows directly from how these primaries were logically derived: each transition involves only two adjacent hues.

This observation leads to a hypothesis about the function of the three cone photoreceptor types in the human retina. Rather than directly encoding opponent color channels (red-green, blue-yellow), each cone type primarily records the intensity of light it absorbs across a broad range of wavelengths. Opponent processing, crucial for color perception, occurs at a later stage, where neural circuits calculate the proportions of stimulation between the different cone types. This proportional encoding allows the visual system to "retrieve" or infer the approximate wavelength or spectral characteristic of the incoming light, which is then processed to create a specific visual experience.

In dichromats, the absence of one cone type limits the system to just two dimensions of color information. They experience only two unique hues and their mixtures, which blend into gray. In terms of our model, they are essentially trapped in a two-hue cycle, unable to access the transitions that give rise to the full color continuum. This is the paradox of breaking the continuity axiom: they have no way to distinguish the different hues and their transitions.

The role of the third cone is thus not merely to add a third color dimension but to provide the necessary reference point for calculating proportions. Just as the proportion between two mountain heights changes drastically if the sea level changes, the relative stimulation of two cone types is meaningless without a third reference point. Each cone pair needs a third cone as a "sea level" to determine the relative proportions and, therefore, the perceived color. This proportional encoding mechanism elegantly explains how the brain extracts rich color information from the limited and overlapping spectral sensitivities of the cones.

The previous part of this study, while directed towards the hypothesis of a shared conceptual and perceptual chroma framework, with several key pieces of evidence, and assuming the color and musical phenomena are fully understood mechanically, addresses why the octave is a cycle, why colors behave that way, and how every phenomenon can be predicted and manipulated with ultimate precision. Still, none of these theoretical explanations can elucidate the specifics and the reasons behind the specifics color. Therefore, what is next is to address these questions: Why the octave both in light and neural levels? Based on a few known facts like the frequency-specific communication mechanisms of the brain and derived strategies like resonant frequency alignments, the "neural octave" as a frequency range might hint at valid explanations. Additionally, coincidence (which is not) of the "spectral octave" for the retinal level of color coding also has valid roots both in the wave nature of light and possible independent decoder mechanisms. This means our brain knows stereoscopically how to mix gray from different points of the pre-hue cycle (pre because it is only unaware information) from a consistent frequency ratio. Similarly, the eye must have a similar method to perceive gray. Color must share a simple cyclic framework to compute pre and post-retinal white light. Therefore, what is next are possible theories, all including the octave as a sweet spot for free interference frequency range (1,2].

The Octave Principle: A Neural and Physical Basis for Color Perception

Our perception of color may be optimized by a principle analogous to the musical octave, wherein the visible spectrum spans a specific, brain-preferred range—from a base frequency \( \lambda \) to its double \( 2\lambda \). This range appears to be a “sweet spot” for information processing, providing both efficiency and clarity in how color is decoded and represented by our visual system.

The Spectral Octave and Information Processing

The hypothesis begins with the observation that the visible spectrum covers roughly one octave in electromagnetic frequency. This is significant because, much like an octave in music where the first and second harmonics resonate in a clearly defined relationship, the visible range might be naturally selected for its role in efficient information processing. The analogy extends to phenomena such as the double-slit experiment, where only certain wavelengths—those falling within a particular harmonic range—interfere constructively to produce clear, visible patterns(see next section). In a similar vein, wavelengths within the \( (\lambda, 2\lambda) \) range can be processed by the visual system without the ambiguity or interference that might arise if a broader spectrum were used.

If light were to contain wavelengths far beyond this octave (for example, spanning from 200 nm to 3000 nm), the resulting temporal integration could become exceedingly complex. Different wavelengths might interfere with each other, creating overlapping “orders” or patterns that confound the neural decoding of color. Thus, by confining the effective range to a single octave, the visual system minimizes potential interference, ensuring that color signals are extracted in a clear and unambiguous manner.

Neural Representation: The Hue Wheel

Neurophysiological studies support this octave-based view through the organization of color-responsive cells in the visual cortex—specifically in areas such as V1, V2, and V4. These cells are arranged in a circular fashion, forming what is often referred to as a neural hue wheel. Unlike a linear spectrum, this circular mapping accommodates the cyclical nature of color perception, providing an elegant explanation for phenomena like the existence of magenta—a hue that does not correspond to any single wavelength.

Magenta arises not as a direct sensory input but as an emergent property of the brain’s interpolation between the spectral extremes of red and violet. In other words, the brain completes the color circle by generating magenta from the overlapping neural representations of these ends of the spectrum. This internal construction aligns with the octave principle by highlighting how the visual system organizes information: by using harmonic relationships to create a stable, redundant, and efficient representation of color.

The Octave Principle in Neural Encoding

The concept of an octave is not limited to the physical properties of light; it also reflects the brain’s strategies for encoding and processing sensory information. By confining color representation to a limited octave, the brain can prevent unwanted frequency synchronization among neural populations. For instance, if two colors were encoded by firing rates that differed by a factor of two (spanning an octave), the risk is that these signals might synchronize excessively, leading to redundancy or even mutual cancellation. Keeping the representation within a controlled octave range avoids such pitfalls, ensuring that each hue maintains a distinct and non-interfering identity.

This confined range also establishes a crucial “stability point”—the achromatic state. When colors are logarithmically encoded within a range (from \( f \) to \( 2f \)), the brain can reliably compute gray (achromatism) by balancing complementary colors. Since complementary hues within this octave naturally oppose each other, their balanced neural representations effectively cancel out, resulting in a stable perception of gray. This mechanism is particularly important for stereoscopic vision, where the brain must integrate slightly different inputs from each eye into a cohesive, stable percept.

Mapping the Spectral Octave to Neural Frequencies

One intriguing aspect of the octave principle is its potential linkage to the brain’s frequency-based processing. Neural circuits communicate via oscillatory signals, often operating in distinct frequency bands to minimize interference. This frequency-based organization suggests that the visual system might map the electromagnetic frequency of light directly onto a corresponding range of neural frequencies. In doing so, the hues we perceive would be represented by separate, non-overlapping channels within the brain’s processing architecture.

Such a mapping offers several advantages. First, it directly connects the physical properties of light with the internal, frequency-specific operations of the brain. Second, it aligns with well-established principles of neural processing—such as resonance and frequency-specific communication—thereby offering a plausible mechanism for the emergence of the octave-like structure in hue perception. Moreover, if the final color percept is determined by the ratio of neural firing frequencies from each eye, then fixed ratios (such as the square root of 2) might be linked to the perception of achromatic colors. This idea could help explain why diverse color combinations can result in the same gray, highlighting the brain’s reliance on stable frequency ratios for accurate color decoding.

Implications and Open Questions

While the octave principle provides a compelling framework for understanding color perception, several challenges and open questions remain. For instance:

- Mapping Specific Hues to Neural Frequencies: Is the relationship between wavelengths and neural frequencies strictly linear, or does it involve a more complex mapping?

- Mechanisms of Frequency Comparison: What precise neural mechanisms detect and compare these frequency ratios, particularly across inputs from both eyes?

- Integration with Other Explanations: How does this frequency-based hypothesis coexist with other factors—such as evolutionary adaptations or developmental influences—that might shape our perception of color?

These questions underscore that, although the octave principle is a promising and conceptually elegant model, further empirical research is needed. Neurophysiological experiments and computational modeling will be crucial to test whether the brain indeed uses an octave-based strategy to optimize color representation.

In summary, the octave principle offers a unified explanation for several aspects of color perception—from the spectral limits of the visible range to the neural organization of color information in the brain. By constraining the effective range of light to an octave, the visual system minimizes interference, prevents unwanted frequency synchronization, and achieves a stable point for achromatic cancellation. Moreover, the circular arrangement of color-responsive cells and the emergent properties of hues like magenta lend further support to this model.

While the mapping between electromagnetic frequencies and neural processing remains an open area of inquiry, the hypothesis that our perception of color is optimized by an octave-like structure is both compelling and consistent with known principles of neural communication. As further research unfolds, it may reveal that the octave is not merely a characteristic of musical harmony but a fundamental organizing principle of our sensory and perceptual systems.

Probabilistic Color and the Spectral Octave

The limitations of the human visual spectrum, confined to approximately an octave (a 1:2 frequency ratio), are not arbitrary but are rooted in the fundamental physics of light and wave interference, as demonstrated by the double-slit experiment.

In the double-slit experiment, even single photons passing through two narrow slits behave as waves, interfering with themselves to create a pattern of bright and dark fringes on a screen. When white light (a mixture of wavelengths) is used, each wavelength creates its own interference pattern. Because the position of the bright fringes depends on the wavelength – a phenomenon called dispersion – the white light is separated into its constituent colors.

The position of these bright fringes (maxima) is described by the diffraction grating equation (also applicable to the double-slit experiment):

d sin(θ) = nλ

where:

`d` is the distance between the slits.

`θ` is the angle of the bright fringe.

`n` is the order of the maximum (0, 1, 2...).

`λ` is the wavelength.

This equation shows that shorter wavelengths (blue) have fringes closer to the central maximum (smaller θ), while longer wavelengths (red) have fringes farther out. The central maximum (n=0) is white because all wavelengths interfere constructively there.

The crucial point is that the fringes for different wavelengths and different orders (n) can overlap. Let 'a' be the shortest visible wavelength. Its first-order maximum (n=1) is at angle θ₁:

d sin(θ₁) = a

A wavelength 2a (double the shortest) has its first-order maximum (n=1) at θ₂:

d sin(θ₂) = 2a

But the second-order maximum (n=2) for the shortest wavelength 'a' is also at θ₂:

d sin(θ₃) = 2a => θ₂ = θ₃

This means the first-order maximum of 2a overlaps with the second-order maximum of 'a'. Wavelengths longer than 2a would overlap with even higher-order maxima of shorter wavelengths.

This physical overlap leads to perceptual mixing. Our visual system, with its broadly sensitive and overlapping cone types (S, M, L), cannot disentangle these overlapping wavelengths. The ratios of cone responses, which determine perceived hue, become less distinct, resulting in desaturated, whitish, or ambiguous color sensations.

Therefore, the "spectral octave" is a physical constraint, ensuring that the interference patterns of different wavelengths are sufficiently separated to produce distinct cone response ratios and, consequently, distinct hues. It's also biologically efficient, avoiding the need to process wavelengths that wouldn't contribute to useful color information.

From Deterministic Waves to Probabilistic Waves:

The example implicitly uses the classical wave description of light to derive the diffraction grating equation. While it mentions single photons, it should be highlighted that this doesn't fully embrace the probabilistic interpretation of the wave function.

In quantum mechanics, the wave associated with a photon (or any particle) is not a wave of physical displacement like a water wave. It's a probability wave. The square of the wave's amplitude at a given point represents the probability density of finding the photon at that point.

Shifting to this probabilistic view of the light wave makes the analogy more fundamental. It's not just about waves interfering; it's about probability distributions interfering.

Cone Response as Probabilistic Absorption:

For simplicity the study interpreted cone response as a relatively straightforward function of wavelength. While it mentions "overlapping sensitivities," doesn't explicitly frame it as a probability.

The absorption of a photon by a photopigment molecule in a cone cell is a quantum event. It's not guaranteed. Each cone type has a probability of absorbing a photon of a given wavelength. This probability is described by the cone's spectral sensitivity curve. This curve is not just a measure of "how much" light is absorbed; it's a measure of the probability of absorption.

Framing cone response as probabilistic absorption aligns it perfectly with the probabilistic nature of light detection. Both the arrival of a photon at a location and its absorption by a cone are governed by probabilities.

Time Integration of Probabilities:

The study already discussed "temporal integration" in the context of afterimages, but it must also be explicitly connected to the probabilistic nature of light and cone response.

Our perception of a continuous, stable color arises from the time integration of countless probabilistic events. We don't see individual photon arrivals or individual cone absorptions. Our visual system integrates these events over time, averaging out the probabilistic fluctuations.

Each cone is essentially acting as a probabilistic "photon counter." The rate of photon absorption (which is proportional to the probability of absorption multiplied by the intensity of the light) determines the cone's output signal. The visual system then compares the rates of absorption from the different cone types.

A Unified Probabilistic Framework:

Light arrives at the retina as a spatial distribution of photon arrival probabilities.

Each cone type has a wavelength-dependent probability of absorbing a photon.

The visual system integrates these probabilistic events over time, effectively "sampling" from the probability distributions.

The perceived color is a result of comparing the time-averaged rates of photon absorption by the different cone types.

This framework places both the physics of light and the initial stage of visual processing on the same fundamental footing: probability. It's not just an analogy between waves; it's an analogy between probabilistic processes.

Conclusion: Parallels and Open Questions in the Chroma of Color and Music

This study, focusing on the relationship between visual and musical chroma, has revealed numerous parallels while also raising profound questions about the fundamental nature of both perceptual domains.

Continuity, Cyclicity, and Discretization:

Both visual and musical chroma are continuous and cyclic. However, the discretization of these cycles into named categories (colors, notes) is primarily a product of cultural and individual experience. This discretization is limited by the inherent ambiguity that arises at the boundaries between categories.

The Octave as a Framework:

The octave, as a frequency range that doubles, provides a suitable, though not strict, framework for containing the discretized components of both pitch and light. While physically distinct (sound waves vs. electromagnetic radiation), both can be described in wave terms. Ultimately, however, the brain processes both as neural activity – firing neurons, not air rarefactions or photon absorption.

Complementarity and Achromatism:

Within both cycles, each point has an inverse, an "achromatic" counterpart. Just as gray can be created by numerous complementary color pairs, in music, the tritone acts as a kind of "achromatic" interval. Stacking tritones can effectively mask the fundamental pitch, making it difficult to discern individual notes within the complex sound. (It's important to note that the term "complementary notes" in music has a different meaning than in color. Musical "complementary" often refers to the additive inverse of pitch classes, a definition that varies across musical traditions. The tritone, however, exhibits a more direct analogy to color complementarity, as it tends to obscure or "cancel" the fundamental.)

The Subjectivity of Harmony:

Throughout this study, the concept of "color harmony" has been deliberately avoided. While inspiring and thought-provoking, much of what is written about color harmony is ultimately subjective. In music, while harmony also involves subjective elements, there are more objective, measurable aspects. This difference made it difficult to offer meaningful insights into color harmony beyond subjective experience.

Musical Scales: Coincidence or Inevitability?

The musical scales derived from hue-wavelength matchings are musically valid and practically useful. However, it's likely that their correspondence to recognizable musical ratios is more a coincidence arising from the distribution of hues within the spectral octave than a strict metaphysical principle.

Musical scales, including the pentatonic, do not have a purely scientific origin. They are likely a complex product of cultural development and learned preferences for specific timbral combinations that resonate with the brain's processing mechanisms. There are no "unique notes" in any fundamental sense. Instead, there's a differentiable subset of the pitch cycle that allows for increasingly complex musical creation and performance. This subset can be altered substantially without loss of tonal meaning, just as colors don't need to be spectrally pure to be identifiable.

The Missing Fundamental and Perceptual Preferences:

The missing fundamental effect highlights the brain's tendency to seek out and respond to harmonic timbral relationships. This preference may be related to the aesthetic pleasure we derive from music, which could share a common neural mechanism with color perception. When we play or listen to music, our brains might be engaging similar processes to those involved in combining and experiencing colors.

Universality in Music:

The strongest evidence for some degree of universality in music lies in musical form. Studies have shown that people from isolated cultures can often recognize the function of music from other cultures (lullabies, dance music, ritual music, love songs).

The diatonic scale, while seemingly ubiquitous, is another area of debate. While some theorists downplay its significance, attributing it to cultural borrowing, the diatonic scale appears in numerous cultures independently. It's a naturally occurring, manageable, and uniformly distributed subset of the pitch cycle. The idea that it was "stolen" from one culture to another likely oversimplifies a complex, multi-faceted development. Many musical concepts, like instruments themselves, likely have multiple origins, not a single linear path of invention.

Pitch Attractors and Evolutionary Influences:

While seemingly universal, the prevalence of the diatonic scale raises the question: why? Are there "pitch attractors" analogous to color attractors? While pitch attractors are less clear than color attractors, they might exist. It's possible that they arise from speech patterns or, perhaps even more likely, from the sounds of animals (birdsong, etc.), which could have unconsciously influenced our musical development over evolutionary time, or as this study proposes, a color reflection.

The Interconnectedness of Color and Music Perception:

Neither discrete colors nor discrete pitches are fundamental in any deep perceptual sense. This shared characteristic is what links them most profoundly.

The Octave and Color Space:

The logarithmic color representation within the octave, while requiring further research with larger sample sizes, offers a compelling explanation for deviations in RGB complementaries, afterimage effects, color constancy, and stereoscopic vision. However, the question remains: why the octave? Why the √2 ratio for true complementaries?

The Dimensionality of Color Space:

The dimensionality of color space is not adequately described by the number of photoreceptor types. Equating photoreceptors with dimensions ignores the complex neural transformations, including dichoptic (two-eye) color mixing. This simplified view undermines claims about tetrachromacy and "four-dimensional" color space, especially given the difficulties in even defining "unique hue."

The Logical Color Model and Pitch Space:

The logical color model, based on simple, accepted axioms (cyclicity-continuity, achromatism-symmetry), can be applied with similar success to pitch space. Both domains exhibit a limited number of fundamentally different sensations, with further subdivisions contributing to ambiguity and blurred boundaries.

Conclusion:

Just as there are no truly "new colors," there are no truly "new notes." The cycles are complete. That's all there is.

Practical Applications:

One practical application of this study is the development of more accurate color coding for musical notation, avoiding the distortions inherent in RGB-based systems. The canonical subtractive model presented here offers a more perceptually consistent alternative.

This study, while providing answers to some questions, has opened up many more avenues for future exploration. The journey into the fascinating intersection of color and music perception continues.

(draft)

Methodological Framework for Investigating Color Perception (Integrated with Web Application)

Research into color perception is fraught with conceptual ambiguities, particularly concerning terms like "unique hues," "color bands," and "richer color experience." These terms often lack precise definitions, leading to potential misinterpretations and methodological challenges. A rigorous approach is crucial to disentangle genuine perceptual phenomena from terminological confusion and methodological artifacts. This framework outlines a structured methodology, integrating insights from both discrete and continuous spectral presentations, to investigate color perception and address claims of expanded chromatic experiences, such as those associated with tetrachromacy. A valuable tool for implementing these methodologies is a web application designed for exploring perceptual color thresholds within an RGB color space.

Clarifying Key Terminology:

Before outlining the experimental procedures, it is essential to define key terms that are central to color perception research and often used ambiguously:

Color Attractors (Unique Hues): These are perceptually "primary" color sensations associated with discrete names (e.g., red, green, blue, yellow). They represent focal points in color space around which other hues are perceived to cluster.
Color Bands: Discrete, distinguishable regions within a continuous spectrum, analogous to perceptual "steps". The term is often used to describe segmented portions of a spectrum perceived as different hues. It's crucial to note the ambiguity of this term, as it can be confused with "unique hues" or simply describe distinguishable spectral appearances.
Just-Noticeable Difference (JND): The smallest detectable change in a sensory stimulus (in this context, hue). JNDs are a quantitative measure of perceptual discrimination.
Richer Color Experience: This is a polysemous term encompassing several potential aspects of vision, including:

Increased Precision/Resolution: Enhanced ability to distinguish fine details and resolve spatial information.
Increased Number of Hues: Perception of a wider range of distinct hues, representing qualitatively new color sensations.
Increased Saturation/Chroma: Perception of more intense and pure colors.

It is critical to differentiate "finer discrimination" (quantified by JNDs) from a genuinely "richer color experience," especially when investigating claims of novel color perception.

It is conceivable that, in some studies, the conclusions regarding a "richer" color experience could be rephrased in terms of a "poorer" experience without altering the consistency of the reported observations, highlighting the ambiguity of the terminology, and lack of robust data.

Continuous vs. Discrete Spectra:
The method of spectral presentation significantly influences perceptual discrimination:
Discrete Presentation: When the spectrum is presented as distinct, separate bands, observers can typically distinguish over 60 regions due to the clear boundaries between them (corresponding to an average JND of 1–2% of the visible range).
Continuous Presentation: In a smooth, continuous gradient, the number of distinguishable regions decreases significantly, averaging around 11, as the transitions between hues become less distinct.

Experimental Stages for Investigating Color Perception:

A progressive, multi-stage approach is recommended to rigorously assess color perception, starting with foundational validity checks and moving towards complex investigations of hue discrimination and categorization. A web application, as described below, can facilitate many of these stages.

Web Application for Perceptual Color Threshold Exploration:

This web application provides a robust and versatile platform for exploring perceptual color thresholds, particularly Just Noticeable Differences (JNDs), within a metameric RGB color space. While focused on RGB metamers, the fine-grained control over various visual parameters allows for comparison with similar experimental setups. Its design directly addresses the fundamental differences between continuous and discrete spectral representations, offering insights into the perception of color bands and color attractors, aligning perfectly with the methodologies outlined in this framework.

Key Features and Functionality of the Web Application:

CIE XYZ Color Model Selection: Users can select from various CIE XYZ color models (1931, 1964, 2006) and convert wavelengths to RGB, ensuring accurate colorimetric calculations.
Adjustable White Point and Gamma: Flexibility in setting white point and gamma allows for approximating real-world lighting conditions or using purely perceptual gradients.
Four-Row Interface: The application's interface is structured into four rows, designed to systematically investigate different aspects of color perception:

Continuous Spectrum: Displays a continuous (pixel-sized) spectral gradient defined by a user-selected wavelength range.
Discretized Spectrum: Presents a discretized version of the same spectral range, with the number of discrete samples controlled by a "step" parameter.
Adjusted Discretized Spectrum: Renders the same discrete samples with user-adjustable noise and blur parameters to simulate individual visual noise and acuity thresholds.
Separated Discrete Samples: Displays the same adjusted discrete samples but with spacing between color patches to minimize contrast effects and isolate hue perception.

Stage 1: Validation of Screen-Based Spectral Representation:

Before utilizing screen-based displays, it is crucial to assess their validity. While the web application itself doesn't directly validate displays, its accurate RGB conversion and CIE XYZ model integration are essential for ensuring that screen-based experiments are meaningful and minimize metameric failures. The flexibility to adjust white point and gamma allows for better control over the displayed colors.

Procedure: Present participants with both:

A real spectrum generated by a prism or diffraction grating.
A simulated spectrum displayed on an RGB screen, carefully calibrated and accounting for gamut limitations.

Expected Outcomes:

Identical Perception (within saturation limits): If participants perceive the real and simulated spectra as identical (apart from potential saturation differences due to RGB gamut limitations), it suggests that standard RGB-based color matching is applicable for these individuals.
Perceived Differences Beyond Saturation: If participants report differences that cannot be attributed to saturation, it indicates regions of metameric failure for that observer with the RGB display. For individuals claiming tetrachromacy, if no difference is perceived, it challenges the notion that their purported "new colors" are outside the standard tristimulus framework.

Stage 2: Assessing Perceived Color Bands in a Continuous Spectrum (Utilizing Web Application):

This stage investigates color band perception using the "Continuous Spectrum" row of the web application.

Procedure (Web Application Integration):

Use the "Continuous Spectrum" row of the web application to display a continuous spectrum across a chosen wavelength range (e.g., 400-700nm for the visible spectrum, or a narrower range like 400-500nm).
Instruct participants to mark perceived color band transitions directly on the displayed continuous spectrum within the application interface. The application can be designed to allow users to click or draw on the spectrum to indicate boundaries.

Expected Results (Web Application Context):

Trichromatic Norm: Using the application, expect most individuals to identify approximately six to seven color bands, corresponding to the major unique hues, consistent with typical trichromatic vision.
Atypical Color Perception: The application allows for easy recording of the number and location of boundaries marked by participants, facilitating the identification of individuals with potentially atypical color perception (reporting more or fewer bands).
Color Band Identification Task (Web Application Feature): The web application directly facilitates the "Color Band Identification" experimental paradigm. Users can be asked to identify and mark "color band transitions" directly on the continuous spectrum displayed by the application. For example, with a 400-500nm range, users might identify regions like "violet fade," "violet peak," "blue/indigo," "blue/cyan," and "cyan" within the application.

Stage 3: Discrete Color Step Grouping Test (Utilizing Web Application):

This stage leverages the "Discretized Spectrum" and "Separated Discrete Samples" rows to investigate color grouping in discrete presentations, mitigating ambiguities of continuous spectra.

Procedure (Web Application Integration):

Utilize the "Discretized Spectrum" row and the "Separated Discrete Samples" row of the web application.
Display a spectral range (e.g., 400-500nm) in both rows simultaneously, varying the "step" parameter in the "Discretized Spectrum" row to present different numbers of discrete color chips. The "Separated Discrete Samples" row can be used to minimize contrast effects.
Instruct participants to group the color chips in the "Discretized Spectrum" (and "Separated Discrete Samples") rows into sets based on perceived color similarity, potentially using drag-and-drop features or labeling functionalities within the application interface.

Expected Outcomes (Web Application Context):

Convergence of Categorization: The application allows for direct comparison of results from the "Discretized Spectrum" and "Separated Discrete Samples" conditions. Participants should ideally converge to a similar number of color groups in both conditions as they did for color bands in the continuous spectrum (Stage 2), reinforcing the consistency of categorical color perception even when using discrete stimuli and minimizing contrast effects.
Color Grouping by Similarity Task (Web Application Feature): The application is designed to directly implement the "Color Grouping by Similarity" paradigm. By presenting a discretized spectrum, the question of color bands is reframed as grouping discrete chips by perceived similarity, which the application facilitates through its visual presentation and potential interactive grouping features.

Stage 4: Stepwise Color Differentiation Task (Discrete JND Testing and Spectral Ordering) (Utilizing Web Application):

This stage utilizes the "Discretized Spectrum" row and the "step" parameter of the web application to systematically increase the number of discrete color steps and investigate the perception of "all distinguishable colors."

Procedure (Web Application Integration):

Use the "Discretized Spectrum" row of the web application.
Set the wavelength range to span a broad portion of the visible spectrum (e.g., 375-750nm).
Start with a small "step" value (e.g., 2) and progressively increase the "step" parameter, displaying a new sequence of evenly distributed color chips at each step. The application automatically generates and displays these sequences based on the chosen spectral range and step count.
Instruct participants to indicate when they believe the "Discretized Spectrum" row displays "all the colors" they can distinguish within the chosen spectral range, mirroring the "Identifying 'All the Colors'" paradigm.
Record the "step" value at which the participant indicates they perceive "all the colors." This step count serves as a proxy for the number of distinguishable hues.

Expected Outcomes (Web Application Context):

Trichromatic Threshold: Using the application, typical trichromatic observers are expected to stop at a "step" value around 6 or 7, consistent with the canonical color categories (red, orange, yellow, green, blue, violet), even as the specific colors presented at each step change. The application directly facilitates the "Identifying 'All the Colors'" paradigm and allows for easy recording of this "trichromatic threshold."
Tetrachromacy Assessment and "New Color" Emergence: Individuals claiming tetrachromacy might stop at a higher "step" value in the application, potentially suggesting finer discrimination. The application allows for recording this "stepwise color differentiation" data, which can be analyzed to compare thresholds between groups and investigate claims of expanded color perception. While the application itself doesn't directly assess "new color" sensations, the systematically varied color sequences it presents, coupled with participant feedback on "all colors" being represented, provides valuable data for analyzing such claims in conjunction with qualitative reports.

Stage 5: Evaluating Claims of "New Colors" and Color Attractors:

Data gathered from the web application in Stages 2, 3, and 4, particularly the step counts, color band markings, and grouping data, can be analyzed to evaluate claims of expanded color perception and investigate the underlying perceptual mechanisms. The application facilitates the collection of quantitative data (step counts, band boundaries) and qualitative observations (participant reports) necessary for this evaluation.

This final stage focuses on interpreting the data from previous stages, particularly in the context of claims regarding "new colors" and expanded color perception, such as in tetrachromacy. It also addresses the identification of color attractors and their "best exemplars."

Conceptual Framework for "New Colors":

A "new color" must be perceptually distinct from existing known colors to a degree comparable to the difference between established unique hues (e.g., red vs. orange).
The existence of a truly novel hue would challenge the established isomorphism of hue space with a simple circular topology.
Increased discrimination capacity (finer JNDs) does not automatically equate to the perception of genuinely new colors.
If "new colors" are claimed, it is crucial to investigate if these colors can be synthesized or matched using standard RGB primaries. If so, it challenges the notion that they represent truly novel perceptual dimensions beyond the tristimulus framework.

Procedure and Analysis:

Analyze data from Stage 4 Stepwise Color Differentiation: Examine the step counts at which participants indicate all colors are represented. Compare step counts between groups (e.g., trichromats vs. potential tetrachromats).
Analyze reported "new color" sensations: If participants report "new colors," meticulously analyze the specific colors presented in the sequence at the step where these sensations emerged. Investigate if these colors can be characterized as mixtures of existing hues or if they represent something genuinely novel.
Color Attractor Investigation (following JND establishment): After establishing JND thresholds (especially in discrete presentations), investigate color attractors and their "best exemplars." This stage should be separated from initial discrimination tasks to minimize the influence of learned color associations and terminology on perceptual judgments. Variations in color choices for "best exemplars" (e.g., different shades of "blue") are more likely to reflect individual preferences and learned associations than fundamental differences in spectral perception itself.

Open-Source Availability and Further Application:

The web application's open-source nature is a significant advantage. It can be freely used and adapted for research purposes, allowing for standardized and easily reproducible experiments across different labs. Its simple interface and direct data recording capabilities make it a valuable tool for routine use and for collecting comparative data alongside more complex experimental setups. The application can be further extended to incorporate features for JND measurements using adaptive procedures, and for more sophisticated analysis of color grouping and categorization data.

Conclusion:

By integrating this web application into our methodological framework, we provide a practical and powerful tool for researchers to investigate color perception rigorously. The application directly supports key experimental paradigms for studying color bands, discrete color discrimination, and stepwise differentiation, addressing the critical methodological challenges and conceptual ambiguities in the field. Its features are specifically designed to facilitate the implementation of the proposed multi-stage approach, enabling a more systematic and data-driven exploration of color perception, including claims of expanded chromatic experiences.

This integrated framework, incorporating the web application, now offers both a conceptual roadmap and a tangible toolset for advancing research in color perception

Physically-Based, Octave-Modeled, Logarithmic Hue Wheel

The following sequence of graphics illustrates the construction of this hue wheel, providing a more intuitive visualization than the abstract group mappings presented in the mathematical section. It also clearly demonstrates how and which region is assigned to non-spectral magenta, which, interestingly, has ample "space" to close the wheel smoothly.

While electromagnetic waves are often described in terms of wavelengths in color science, music theory typically focuses on frequency ratios. To bridge these perspectives, the graphics and accompanying explanations incorporate both metrics, referencing each value as appropriate for clarity and context.

Key Graphics and Steps:

1. Linear Spectrum (200–1600 THz): A linear representation of the electromagnetic spectrum, highlighting the visible range (~400–750 THz) and including black regions beyond visible light for reference.

2. Octave Doubling: Frequencies are repeated at \(2^k\) generating three "rainbows" separated by black gaps. Non-spectral magenta, which does not exist in the physical spectrum, will be placed within these gaps.

3. Logarithmic Scale: The three rainbows are equalized in size by compressing the scale logarithmically.

4. Magenta Addition: A normal distribution curve fills the gaps, smoothly blending red and blue without altering their intensity.

5. Hue Wheel: A single rainbow from magenta to magenta forms a continuous logarithmic hue wheel.

Chroma Shift: Abney Effect and Stretched Tunings

While some perceptual parallels between vision and hearing are more abstract, others, such as the chroma shift, can be modeled or at least compared more directly. The Abney effect in vision and the phenomenon that led to stretched tunings for pianos both involve a shift in perceived chroma.

Abney Effect (Visual Chroma Shift):

The Abney effect describes a visual phenomenon where adding white light to a monochromatic light (a "unique hue") causes a shift in its perceived hue. For example, adding white light to red light makes the red appear more purplish, even though the spectral composition of the original red light remains unchanged. Similarly, adding white light to green makes it appear blueish.

[Image]

Stretched Tunings (Auditory Chroma Shift):

Stretched tunings for pianos address a similar perceptual phenomenon in the auditory domain. In theory, perfectly tuned octaves should have a precise 1:2 frequency ratio. However, in practice, pianos sound more "in tune" when the higher octaves are stretched slightly—meaning the intervals are made slightly wider than the theoretical 1:2 ratio. This compensates for a perceptual chroma shift: higher frequencies are perceived as slightly flatter than their theoretical equivalents, while lower frequencies are perceived as slightly sharper.

Connecting the Effects:

The key to understanding the connection between the Abney effect and stretched tunings lies in considering the spectral composition of white light and the harmonic series of musical tones. We can approximate standard white light as a combination of red, green, and blue light in a ratio similar to a major chord (approximately 4:5:6 in frequency ratios). This means that adding white light to a monochromatic light is analogous to adding multiple harmonics (or partials) to a fundamental tone in music.

For instance, if red is selected as the origin, a {1:1, 4:5, 2:3} or 4:5:6 ratio could be mapped to 435:543:652 THz. Therefore a sound featuring only red, green, and blue would visually represent a major chord..

Red ~ 432 THz = f(1)
Green ~ 543 THz = f(Red * 5/4)
Blue ~ 652 THz = f(Red * 3/2)

In a piano timbre, the higher octaves contain more partials, and these partials are more evenly distributed across the frequency spectrum, making it sound more "complex" or "rich". This addition of higher partials is analogous to the addition of white light which changes the hue. Because of the logarithmic nature of pitch perception the higher partials are percieved as flatter than they are in a linear frequency space, so stretching the octaves is a way to compensate this perceptual effect.

Just as the addition of white light shifts the perceived hue of a monochromatic light towards purple (in the case of red) or cyan (in the case of green), the addition of higher partials in a piano timbre shifts the perceived pitch of higher octaves downwards. This is why stretching the octaves makes them sound more "in tune" to our ears.

It's important to note that the degree of stretching in piano tunings is relatively small and often imperceptible to untrained listeners, much like the subtle hue shifts in the Abney effect are not always consciously noticed.

(DRAFT)