Annals of Mathematics and Physics
Independent Researcher, Chitwan, Nepal
Cite this as
Pathak M. Prime Clustering and Prime Gaps via the Pathak Continuum Compression Hypothesis (PCCH). Ann Math Phys. 2026;9(1):027-033. Available from: 10.17352/amp.000177
Copyright License
© 2026 Pathak M. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.Prime numbers have fascinated mathematicians due to their seemingly random distribution and mysterious clustering behavior. We propose a new approach based on the Pathak Continuum Compression Hypothesis (PCCH), which views the number line as a continuum affected by arithmetic interaction patterns among numbers. Instead of a generative or causal model of primes, PCCH provides a descriptive and structural reframing of prime number phenomena that captures regularities in prime number clustering and prime number gaps via interaction-weighted compression effects.
Prime numbers are like the fundamental building blocks of all numbers, basic atoms of the number world. But despite centuries of study, their exact pattern and distribution still baffle mathematicians. We know roughly how many primes there are up to a point and how they tend to thin out, but why they cluster in certain ways or why gaps between primes appear the way they do is still a big mystery.
Traditional number theory mainly counts primes or estimates their density but doesn’t really explain the mechanics behind these patterns-the forces or reasons causing primes to clump or spread apart. That’s where Pathak Continuum Compression Hypothesis (PCCH) steps in, built on the foundation of Pathak’s Theory of Number Interaction (PTNI) [1].
PTNI proposes a concept: every number isn’t just floating alone, but interacts with other numbers through a kind of interactive strength. Likewise, in accordance with PTNI, PCCH says, this force either compresses (pulls numbers closer) or stretches (pushes them apart) the continuum (the “number line”) itself between them.
According to PTNI, this interaction force, which we call Interact, depends on two things:
The net strength or compression score at a point x on the number line is:
Where
We define S(x, y) as:
Rationale: Rather than listing all the specific operations, we can say that arithmetic accordance is any significant numerical relationship between x and y that can be derived from basic arithmetic operations. This can be additive, subtractive, multiplicative, divisive, modular, or any other structural relationship that suggests a numeric alignment. In this way, S(x, y) is now completely arithmetic-based and does not involve primality in any way, thus avoiding circularity while still retaining the structural significance for PCCH analysis.
For example, numbers such as 6 and 12 will have strong interaction due to multiplicative arithmetic influence (12 being a multiple of 6), while numbers such as 7 and 12 have weak interaction, even though they are numerically close due to not having any meaningful arithmetic correspondence across standard operations.
This demonstrates that the observed compression is a consequence of arithmetic structure and numerical proximity rather than just an explicit encoding of primality, thereby addressing the circularity concern.
This formulation is intentionally simple and demonstrates the key compression patterns without relying on primality itself.
The function C(x) is called the Pathak Compression Function. It measures the degree of compression (negative values) or stretching (positive values) in the continuum at x.
Empirical calculations of C(x) for integers x reveal a remarkable pattern:
This suggests that primes tend to exist in regions where the continuum is locally compressed due to strong interactive forces from neighboring primes, resulting in prime clustering [2-5].
Conversely, regions with positive compression scores represent stretched continuum zones correlating with larger prime gaps.
Here, we have interpreted a data set for compression scores with primes and composites with the help of the PCCH net compression formula:
Where R = 10 (Standard value) and, n = 1 (standard value)
Below is a sample of the calculated compression scores C(x) for integers x from 2 to 200, alongside their primality status (Table 1 and Figure 1):
To quantify the observed pattern, we compute the fraction of primes and composites exhibiting negative and positive compression scores, respectively, in the range 2 ≤ x ≤ 200: Thus, excluding 2,3,4 and 6, every prime number has C(x) < 0 and every composite number has C(x) > 0.
This confirms that negative compression zones predominantly correspond to primes, while positive zones correspond to composites, supporting the PCCH hypothesis.
As we observe the whole compression scores in accordance with primes and composites, we can see three major patterns:
Let n > 0 be fixed. In the Pathak Compression Function
The dominant contribution to C(x) arises from terms corresponding to small values of i (i.e., nearest neighbors), while the contribution from distant terms decays rapidly as i increases.
Justification: Since |x − (x ± i)| = i, the weighting factor in each term of C(x) is proportional to 1/in. For n > 0, we have
Hence, as I grow larger, the contribution of the corresponding interaction term becomes negligible. This implies that the nearest neighboring integers exert the strongest influence on the compression score C(x).
This local dominance property explains why prime clustering and gap structures are primarily governed by nearby interactions and why the qualitative pattern of compression scores remains stable under variations of R and n.
Graphical illustrations supporting this lemma are provided below.
Graphical Representation (Figure 2):
Changes in R don’t destroy the pattern; they just make it smoother, preserving the pattern. Prime clusters and gaps are clearly visible.
Graphical Representation (Figure 3):
Changes in n don’t destroy the pattern; it just makes it smoother, with the pattern preserved. Prime clusters and gaps are clearly visible.
Graphical Representation (Figure 4):
Hence, despite changing both R and n, the essential “compression fingerprint” of numbers didn’t vanish. The sign structure of C(x) remains stable.
The PCCH framework provides a physical analogy:
Thus, PCCH is more accurately described as a structural and heuristic tool for understanding prime clustering and prime gaps. The compression and stretching regions offer a descriptive reframing of known distributional properties of primes rather than a causal or generative explanation of prime formation.
Some exceptions can be seen while doing calculations using PCCH. The exceptions and their possible explanation are given below:
For 2, the reason for the asymmetry in its neighborhood is that it does not have any numbers smaller than itself, hence no negative interactions. For 3, the reason why its left neighbor alone cannot produce a balanced compression effect is that the left neighbor is 2.
In the case of 4, the neighboring numbers’ contribution to the compression is uneven because of the lack of small-number interactions. In the case of 6, the interactions of the neighboring smaller numbers with strong arithmetic ties (such as 2 and 3) result in a negative contribution, typically deviating from the positive pattern.
The current expression of S(x, y) is grounded in coarse arithmetic relationships and structures, rather than explicit primality. This approach prevents circular dependencies on primality and is sufficient to show the development of compression and stretching patterns in the PCCH framework.
”I conjecture that, except 2 and 3, every prime number exhibits negative compression scores, whereas, except for 4 and 6, every composite number demonstrates positive compression scores.”
PCCH offers a structural framework for understanding prime clustering and prime gaps as patterns that arise within an interaction-weighted numerical continuum. Negative compression scores indicate regions of the continuum that are locally compressed, where primes are likely to cluster, while positive compression scores indicate regions of the continuum that are stretched, corresponding to prime gaps.
In this way, PCCH reinterprets traditional knowledge of prime number distribution in terms of numerical continuum compression driven by interaction.

PTZ: We're glad you're here. Please click "create a new query" if you are a new visitor to our website and need further information from us.
If you are already a member of our network and need to keep track of any developments regarding a question you have already submitted, click "take me to my Query."