9

Not too long ago, I studied a modified Collatz rule where

$$f(x)= \begin{cases} 3x+5, & \text{if $x$ is odd} \\ x/2, & \text{if $x$ is even} \end{cases}$$

by observing the trajectories of $n$ with some code I wrote. The code would calculate the trajectory of each seed or starting number $n$ beginning with $1$ until the trajectory reached a loop. The code will then dump the loop into a spreadsheet and then repeat the process for $n+1$ until some defined limit for $n$ was reached. The resulting spreadsheet contains every starting number and the loops each of those numbers ended up in. I did not record the original trajectories in the spreadsheet.

In this Google Document, I created pie charts for the sample sizes 100, 1,000, 10,000, 100,000, and 1,000,000.

The results were made by defining some sample size up to some number, sorting all of the numbers based on what loop their trajectories entered, and then creating ratios for those relationships.

Here is a link to the raw data my code generated:

https://drive.google.com/drive/folders/0BzfYa_--3heeNkVpd1NPd090aDA?usp=sharing

(note: Viewing the 10,000 sample size worked just fine for me, however you would need to download the sample sizes 100,000 and 1,000,000 to view them)

The results show the percentages vary quite a bit from sample to sample, however in the general scheme of things the data seems to be somewhat consistent. For example, my data shows the 19 loop is the end of roughly half the trajectories of the numbers in the samples. Only one percentage never changed from sample to sample; unsurprisingly, the 20-10-5 loop consisted of 1/5 of all tested values.

I am unsure if this “loop bias” I observed is a consequence for relying on a sample size to begin with, human/code error, or if there is a mathematical explanation for what makes certain loops more popular than others. I have a few ideas for why some bias occurs, however I am not confident in them, mostly because my ideas heavily rely on speculation I do not know how to prove formally.

EDIT: Here are the loops in order of appearance:

  • [1, 8, 4, 2, 1]
  • [19, 62, 31, 98, 49, 152, 76, 38, 19]
  • [5, 20, 10, 5]
  • [23, 74, 37, 116, 58, 29, 92, 46, 23]
  • [187, 566, 283, 854, 427, 1286, 643, 1934, 967, 2906, 1453, 4364, 2182, 1091, 3278, 1639, 4922, 2461, 7388, 3694, 1847, 5546, 2773, 8324, 4162, 2081, 6248, 3124, 1562, 781, 2348, 1174, 587, 1766, 883, 2654, 1327, 3986, 1993, 5984, 2992, 1496, 748, 374, 187]
  • [347, 1046, 523, 1574, 787, 2366, 1183, 3554, 1777, 5336, 2668, 1334, 667, 2006, 1003, 3014, 1507, 4526, 2263, 6794, 3397, 10196, 5098, 2549, 7652, 3826, 1913, 5744, 2872, 1436, 718, 359, 1082, 541, 1628, 814, 407, 1226, 613, 1844, 922, 461, 1388, 694, 347]

EDIT 2:

I agree that smaller numbers may be responsible for skewing the data. Therefore, I picked the sample size 100,000 to 1,000,000 to test this theory. I uploaded the results to the original Google Doc with the other pie charts.

I was surprised to find, well the same chart. The ratios were slightly different as usual, but aside from that I am unsure to conclude if this test debunks the hypothesis or iterates the small number problem. I could try different sample sizes, however I do not know if that is a good idea or not.

To provide some insight on what I think is going on, I will show you a digital version of some notes I sketched and explain where my speculations came from.

In May, I drew some sketches of trees and made some speculations about what I observed. I assumed if a loop had a branch or a tail coming from the original even numbers in the loop, then the loop would connect to more numbers. I also assumed smaller even multiples (if $n$ is odd, then an even multiple is $n*2^a$, where $a$ is any value) branching to multiples of three "restricted" the size of the loops.

Of course, none of these statements are objective, much less provable. I wanted to share them in case there were any interesting mathematical patterns occurring or if this information shed light on anything...

Here is a digital version of my sketches.

Note: the trees are built using the "reverse Collatz method" or "${(n-1)}/{3}$, or in this case, an adapted version of that method. To divide $n$ by 2, go one number left. To multiply $n$ by 3 and add 5, find the bottom of the "T", which points to the next even number.

Warning: I showed this to a friend and the tree sketch confused them. If you find this sketch confusing, let me know and I will re-draw the entire thing with arrows instead.

Key:

  • If an even number branches, It will have a "T" above it. The first odd number on the "T" is the resulting odd number after applying ${(n-5)}/{3}$. The following even numbers are the even multiples of the odd number. (ex. In the 19 loop, 38 will have a "T" over it. 11 is the resulting odd number, and the even numbers after 11 are $11*2^1$, $11*2^2$, $11*2^3$, ...
  • Blue numbers are members of a loop.
  • Red numbers followed by a "no" sign are multiples of 3.
  • Purple "T"s connect the loop.
  • Green "T"s emphasize the extra "tail" or branch.
  • Orange "T"s emphasize where a tail could have been, but the number branched to a multiple of 3 instead.
  • Arrows connect the separated ends of the loop.
  • "..." are used to convey numbers not shown.

I color-coded the sketch to draw attention to certain properties. I figured it would make it easier to understand.

  • Two comments. Firstly I found $6$ loops, each marked by one of the values containing $1,5,19,29,187,541$. Secondly, I think the relative frequency of the loops depends possibly on the length $N$ of the loops (measured in count of odd numbers, generating a $N$'th power of 3 in its "characteristic" formula. But I'm not completely sure about this, didn't expand really the formulae yet... – Gottfried Helms Aug 05 '17 at 11:17
  • Thank you for pointing that out, I labeled the last two loops incorrectly and fixed it. Secondly, I looked into that in the past, and surprisingly the longer loops seem to contain less numbers overall. I plotted the loops in a tree and found most of the odd numbers in the larger loops lead to dead ends. I also suspect if using a sample size or having more small numbers is skewing the results. – Griffon Theorist697 Aug 05 '17 at 14:12
  • Just out of curiosity: what is a "dead end" here? – Gottfried Helms Aug 06 '17 at 23:45
  • I am sorry for taking a while to respond. I double checked some of my notes and realized I over speculated what I found, and most importantly, needed to show the trees I drew. I need more time to draw out the larger two loops, however I will leave a link to my illustrations. To answer your question, a "dead end" is simply a number branching to a multiple of 3, however I had the wrong idea in mind when I said "...most of the odd numbers in the large loops lead to dead ends..." – Griffon Theorist697 Aug 07 '17 at 22:29
  • "... the dead end..." - well this were obvious if I knew your trees were "backward"-transformation. I speculated about this but was not sure according to your explanations. Thanks for the clarification and compliments for your careful and serious manoevering in the chaos... :-) – Gottfried Helms Aug 08 '17 at 06:40
  • I have done a view into the generalization to $3x+r$ the occurences and relative frequencies of cycles. It is at the moment a draft, but perhaps helps to "open eyes" - at least for me it that study has done this job. If you're interested see http://go.helms-net.de/math/collatz/Collatz_3x_r.pdf I'll work more on it (also to make it more readable) I think I can come back to the specific frequencies-problem better now after being armed with that new insights. – Gottfried Helms Aug 08 '17 at 21:51
  • Looking at the pdf, I think it is really interesting that prime numbers contribute to the formation of loops ($a/5$ = more loops?). Once I get my far less impressive code running, I will continue to study the different loops of the modified $3x+a$ rules to see if I can figure out why $3x+5$ has more loops than $3x+7$, or why these loops form in the first place. I already figured out a few things such as one of the causes for loop length psssst it's a secret!, but aside from that, it will take me much longer to find anything new. I look forward to what we find! – Griffon Theorist697 Aug 08 '17 at 23:17
  • btw.: small update of the text – Gottfried Helms Aug 08 '17 at 23:21
  • One argument, why $3x+5$ has more cycles than $3x+7$ becomes visible in formulae 6.b) and 6.c). Assume $2^S-3^N=p$ is prime. Since it is in the denominator the numerator or $r$ must contain this primefactor so that $a_1$ can be integer. So let $S=5,N=3$ then we have $32-27=p=5$. Now let $r=5$ Then all vectors $E_{N,S}$ which are not simply rotations give cycles: $[1,1,3]$,$[1,2,2]$ This happens with $2^S-3^N=p=5=r$ . $2^S-3^N=p=7$ gives $N=2,S=4$ and the only combinations of $E_{N,S}$ are $[1,3]$,$[2,2]$. But $[2,2]$ gives the trivial cycle, so we have one cycle less.One can go deeper in this – Gottfried Helms Aug 08 '17 at 23:39
  • somewhat related : https://math.stackexchange.com/questions/2241758/on-a-collatz-like-problem-with-two-end-cycles – mercio Aug 09 '17 at 14:39
  • @GriffonTheorist697, I have done quite a bit of work on the Collatz problem from this angle, and I believe I can answer some of your questions. I'd like to compare notes with you. Would you be willing to correspond off of MSE? – G Tony Jacobs Aug 22 '17 at 03:08
  • @GottfriedHelms, if you want to compare notes, too, I'd be happy to correspond with anyone who is investigating this aspect of the Collatz problem. – G Tony Jacobs Aug 22 '17 at 03:19
  • @GTonyJacobs: I'd like to communicate about this, too - but up to september I'm on holiday with a hotel-account only and can only think and write about math absolutely randomly. But after that, I think we can establish some communication-line. – Gottfried Helms Aug 22 '17 at 07:14
  • @GTonyJacobs I am happy to share my notes on Meta, but I don't know the best way to do so. Also, you may find my notes disappointing, only because my lack of formal mathematical training will become extremely evident. – Griffon Theorist697 Aug 22 '17 at 16:02
  • Well, I have a conjecture regarding the answer to this question, but it will take some coding to verify. I have some code in Python that does this kind of computation, but I'll be the first to admit, I'm not an expert programmer.

    I'm not sure the best way to communicate. I'd give you my email address, if you're comfortable working that way.

    – G Tony Jacobs Aug 22 '17 at 16:11
  • Don't worry about lack of formal training. I was getting results on Collatz before I had any formal training, either. If you're on the scent, you're on the scent, and I'm happy to hunt with you. – G Tony Jacobs Aug 22 '17 at 16:12
  • @GTonyJacobs I checked with meta support and they said we cannot use meta to share notes. However, they did suggest you can open a chatroom. Would you like to share a Google drive and add notes to that instead? I can invite people to the drive folder via Gmail. – Griffon Theorist697 Aug 23 '17 at 00:32
  • Whatever works. I've got some stuff in a pdf that I could upload somewhere. – G Tony Jacobs Aug 23 '17 at 00:34
  • @GTonyJacobs I created a Google folder to share notes. To access it, I need your Gmail to share it with you. Please send it to catgriffon3@gmail.com – Griffon Theorist697 Aug 23 '17 at 00:58

2 Answers2

5

this is not yet an answer, just a comment for more illustration

Part 1 - Table of some properties of the $6$ known cycles.

update - a somewhat longer exposition and a longer table at my homepage

  • First column: $a_\min$ as the smallest element of a cycle
  • second column: relative frequency of the type of cycle as tail of a trajectory. Only odd numbers $a_1=1$ to $a_1=999 999$ were tested.
  • Third column: $N$ is length of the cycle (odd numbers $a_k$ only are counted)
  • Foruth column: $S$ is sum-of-exponents $A_k$ (see below) or "number-of-halving-steps"
  • Fifth column: let the transferfunction be defined between odd numbers $a \to b$. Then $b = (3a+5)/2^A$. The given vector is the vector of exponents $A_k$ of a trajectory of length $N$.
    For a certain length $N$ more than one vector can be possible - not only by rotation (which gives cyclically the members of one cycle) but also besides of the pure rotations by other combinations having the same $(N,S)$ which gives then true different cycles.

Table 3x+5:

a_min   rel freq% N  S  vector of exponents
----------------------------------------
  5     20.0000   1  2  [2]            "trivial cycle"
  - - - - - - - - - - - - - - - - - - - 
  1     14.0844   1  3  [3]           because 3*1+r=8=2^3 -> 1

 19     49.6780   3  5  [1, 1, 3]
 29      9.2606   3  5  [2, 1, 2]

187      3.2618  17 27  [1, 1, 1, 1, 1, 2, 1, 1, 2, 1, 2, 3, 2, 1, 1, 1, 5]
347      3.7152  17 27  [1, 1, 1, 1, 3, 1, 1, 1, 1, 2, 2, 4, 1, 2, 1, 2, 2]

Note that a very similar structure occurs for $3x+7$ and $3x+13$ and etc. problem. For instance for the $3x+13$ we get the following table

Table 3x+13:

  a min   relfreq%  N    S   vector                     
  --------------------------------------------------------------------- 
    13  7.692000    1    2  [2]               "trivial cycle"
  ---------------------------------------------------------- 
     1 47.550000    1    4  [4]               // 3*1 + r = 2^4 -> 1 
   -   -   -   -   -   -   -   -   -   -   -   -   -   -   -   -   -   - 
   211  3.334000    5    8  [1, 1, 1, 1, 4]   // 2^8 - 3^5 = 13 = r 
   259  3.934000    5    8  [1, 1, 1, 3, 2] 
   227  1.880000    5    8  [1, 1, 1, 2, 3] 
   287  4.380000    5    8  [1, 2, 1, 1, 3]  
   251  1.958000    5    8  [1, 1, 2, 1, 3] 
   283  2.506000    5    8  [1, 1, 2, 2, 2] 
   319  1.424000    5    8  [1, 2, 1, 2, 2] 

   131 25.342000   15   24  [1, 1, 1, 3, 1, 1, 1, 1, 1, 2, 1, 2, 1, 2, 5] 


Part 2 - approching the problem of different relative frequencies

I'm looking at the backwards-tree starting at the cycle-element $19$ vs. that starting at the cycle-element $29$ Here the key for the greater frequency of the occurence of the $19$-cycle seems to be that the backwards-transformations cover the smaller (odd) numbers compared with that of the $29$-cycle - which means, that the smaller numbers transform to the $19$-cycle compared with the $29$-cycle by the $(3x+5)/2^A$-map. I cannot really formalize this for the relative frequencies below some fixed upper bound $N$ at the moment, but it might give some good intuition...

The representation in a line is as follows:

a_parent   [vector A]

vector $A$ is here the (infinite) vector of all numbers $a_k$ going down to $a_\text{parent}$ by one transformation: $ a_\text{parent}=(3a_k+5)/2^B$ . Of course each element of $A$ (except that which are divisible by $3$) are parents of another vector $AA$. The first couple of this entries are documented then i the following line, indented by some more spaces.

I printed that recursive tree, which has also a cyclic subtree of recursion-cycle of $3$, to a depth to $5$ . To focus the aspect of containing many small numbers, I omitted parents larger than $1000$ and also their subtrees although this might not be perfectly correct, because they can have parents themselves which asre smaller than $1000$ - but I left this aside.

The tree based on cycle-element $19$ has many more small numbers than the tree bases on cycle-element $29$.


mytree(19,5)

 31 [19, 81, 329, 1321, 5289, 21161, "..."]
    19 [11, 49, 201, 809, 3241, 12969, "..."]
        11 [13, 57, 233, 937, 3753, 15017, "..."]
            13 [7, 33, 137, 553, 2217, 8873, "..."]
                7 [3, 17, 73, 297, 1193, 4777, "..."]
                    17 [21, 89, 361, 1449, 5801, 23209, "..."]
                    73 [47, 193, 777, 3113, 12457, 49833, "..."]
                    ... [ ...]
                137 [181, 729, 2921, 11689, 46761, 187049, "..."]
                    181 [119, 481, 1929, 7721, 30889, 123561, "..."]
                    ... [ ...]
                553 [367, 1473, 5897, 23593, 94377, 377513, "..."]
                    367 [243, 977, 3913, 15657, 62633, 250537, "..."]
                    ... [ ...]
                ... [ ...]
            233 [309, 1241, 4969, 19881, 79529, 318121, "..."]
                ... [ ...]
            937 [623, 2497, 9993, 39977, 159913, 639657, "..."]
                623 [829, 3321, 13289, 53161, 212649, 850601, "..."]
                    829 [551, 2209, 8841, 35369, 141481, 565929, "..."]
                    ... [ ...]
                ... [ ...]
            ... [ ...]
        49 [31, 129, 521, 2089, 8361, 33449, "..."]
            31 [19, 81, 329, 1321, 5289, 21161, "..."]
                19 [11, 49, 201, 809, 3241, 12969, "..."]
                    11 [13, 57, 233, 937, 3753, 15017, "..."]
                    49 [31, 129, 521, 2089, 8361, 33449, "..."]
                    809 [1077, 4313, 17257, 69033, 276137, 1104553, "..."]
                    ... [ ...]
                329 [437, 1753, 7017, 28073, 112297, 449193, "..."]
                    437 [581, 2329, 9321, 37289, 149161, 596649, "..."]
                    ... [ ...]
                ... [ ...]
            521 [693, 2777, 11113, 44457, 177833, 711337, "..."]
                ... [ ...]
            ... [ ...]
        809 [1077, 4313, 17257, 69033, 276137, 1104553, "..."]
            ... [ ...]
        ... [ ...]
    329 [437, 1753, 7017, 28073, 112297, 449193, "..."]
        437 [581, 2329, 9321, 37289, 149161, 596649, "..."]
            581 [773, 3097, 12393, 49577, 198313, 793257, "..."]
                773 [1029, 4121, 16489, 65961, 263849, 1055401, "..."]
                    ... [ ...]
                ... [ ...]
            ... [ ...]
        ... [ ...]
    ... [ ...]

mytree(29,5)

 23 [29, 121, 489, 1961, 7849, 31401, "..."]
    29 [37, 153, 617, 2473, 9897, 39593, "..."]
        37 [23, 97, 393, 1577, 6313, 25257, "..."]
            23 [29, 121, 489, 1961, 7849, 31401, "..."]
                29 [37, 153, 617, 2473, 9897, 39593, "..."]
                    37 [23, 97, 393, 1577, 6313, 25257, "..."]
                    617 [821, 3289, 13161, 52649, 210601, 842409, "..."]
                    ... [ ...]
                121 [79, 321, 1289, 5161, 20649, 82601, "..."]
                    79 [51, 209, 841, 3369, 13481, 53929, "..."]
                    ... [ ...]
                ... [ ...]
            97 [63, 257, 1033, 4137, 16553, 66217, "..."]
                257 [341, 1369, 5481, 21929, 87721, 350889, "..."]
                    341 [453, 1817, 7273, 29097, 116393, 465577, "..."]
                    ... [ ...]
                ... [ ...]
            ... [ ...]
        617 [821, 3289, 13161, 52649, 210601, 842409, "..."]
            821 [1093, 4377, 17513, 70057, 280233, 1120937, "..."]
                ... [ ...]
            ... [ ...]
        ... [ ...]
    121 [79, 321, 1289, 5161, 20649, 82601, "..."]
        79 [51, 209, 841, 3369, 13481, 53929, "..."]
            209 [277, 1113, 4457, 17833, 71337, 285353, "..."]
                277 [183, 737, 2953, 11817, 47273, 189097, "..."]
                    737 [981, 3929, 15721, 62889, 251561, 1006249, "..."]
                    ... [ ...]
                ... [ ...]
            841 [559, 2241, 8969, 35881, 143529, 574121, "..."]
                559 [371, 1489, 5961, 23849, 95401, 381609, "..."]
                    371 [493, 1977, 7913, 31657, 126633, 506537, "..."]
                    ... [ ...]
                ... [ ...]
            ... [ ...]
        ... [ ...]
    ... [ ...]


An even more intuitive tree is the same tree but values taken to $\log_2()$. The progression in the vectors in then nearly linear, and the values of the $19$-tree seem a bit more dense than that of the $29$-tree if we select a window of values with a fixed lower and upper bound. But I don't want to suggest that this impression is already objective and could already answer your question!

mytreelog(19,5)

 4.95 [4.25, 6.34, 8.36, 10.4, 12.4, 14.4]
    4.25 [3.46, 5.61, 7.65, 9.66, 11.7, 13.7]
        3.46 [3.70, 5.83, 7.86, 9.87, 11.9, 13.9]
            3.70 [2.81, 5.04, 7.10, 9.11, 11.1, 13.1]
                2.81 [1.58, 4.09, 6.19, 8.21, 10.2, 12.2]
                    4.09 [4.39, 6.48, 8.50, 10.5, 12.5, 14.5]
                    6.19 [5.55, 7.59, 9.60, 11.6, 13.6, 15.6]
                7.10 [7.50, 9.51, 11.5, 13.5, 15.5, 17.5]
                    7.50 [6.89, 8.91, 10.9, 12.9, 14.9, 16.9]
                9.11 [8.52, 10.5, 12.5, 14.5, 16.5, 18.5]
                    8.52 [7.92, 9.93, 11.9, 13.9, 15.9, 17.9]
            7.86 [8.27, 10.3, 12.3, 14.3, 16.3, 18.3]
            9.87 [9.28, 11.3, 13.3, 15.3, 17.3, 19.3]
                9.28 [9.70, 11.7, 13.7, 15.7, 17.7, 19.7]
                    9.70 [9.11, 11.1, 13.1, 15.1, 17.1, 19.1]
        5.61 [4.95, 7.01, 9.03, 11.0, 13.0, 15.0]
            4.95 [4.25, 6.34, 8.36, 10.4, 12.4, 14.4]
                4.25 [3.46, 5.61, 7.65, 9.66, 11.7, 13.7]
                    3.46 [3.70, 5.83, 7.86, 9.87, 11.9, 13.9]
                    5.61 [4.95, 7.01, 9.03, 11.0, 13.0, 15.0]
                    9.66 [10.1, 12.1, 14.1, 16.1, 18.1, 20.1]
                8.36 [8.77, 10.8, 12.8, 14.8, 16.8, 18.8]
                    8.77 [9.18, 11.2, 13.2, 15.2, 17.2, 19.2]
            9.03 [9.44, 11.4, 13.4, 15.4, 17.4, 19.4]
        9.66 [10.1, 12.1, 14.1, 16.1, 18.1, 20.1]
    8.36 [8.77, 10.8, 12.8, 14.8, 16.8, 18.8]
        8.77 [9.18, 11.2, 13.2, 15.2, 17.2, 19.2]
            9.18 [9.59, 11.6, 13.6, 15.6, 17.6, 19.6]
                9.59 [10.0, 12.0, 14.0, 16.0, 18.0, 20.0]

mytreelog(29,5)

 4.52 [4.86, 6.92, 8.93, 10.9, 12.9, 14.9]
    4.86 [5.21, 7.26, 9.27, 11.3, 13.3, 15.3]
        5.21 [4.52, 6.60, 8.62, 10.6, 12.6, 14.6]
            4.52 [4.86, 6.92, 8.93, 10.9, 12.9, 14.9]
                4.86 [5.21, 7.26, 9.27, 11.3, 13.3, 15.3]
                    5.21 [4.52, 6.60, 8.62, 10.6, 12.6, 14.6]
                    9.27 [9.68, 11.7, 13.7, 15.7, 17.7, 19.7]
                6.92 [6.30, 8.33, 10.3, 12.3, 14.3, 16.3]
                    6.30 [5.67, 7.71, 9.72, 11.7, 13.7, 15.7]
            6.60 [5.98, 8.01, 10.0, 12.0, 14.0, 16.0]
                8.01 [8.41, 10.4, 12.4, 14.4, 16.4, 18.4]
                    8.41 [8.82, 10.8, 12.8, 14.8, 16.8, 18.8]
        9.27 [9.68, 11.7, 13.7, 15.7, 17.7, 19.7]
            9.68 [10.1, 12.1, 14.1, 16.1, 18.1, 20.1]
    6.92 [6.30, 8.33, 10.3, 12.3, 14.3, 16.3]
        6.30 [5.67, 7.71, 9.72, 11.7, 13.7, 15.7]
            7.71 [8.11, 10.1, 12.1, 14.1, 16.1, 18.1]
                8.11 [7.52, 9.53, 11.5, 13.5, 15.5, 17.5]
                    9.53 [9.94, 11.9, 13.9, 15.9, 17.9, 19.9]
            9.72 [9.13, 11.1, 13.1, 15.1, 17.1, 19.1]
                9.13 [8.54, 10.5, 12.5, 14.5, 16.5, 18.5]
                    8.54 [8.95, 10.9, 13.0, 15.0, 17.0, 19.0]
0

The reason you see the same proportions converging to each cycle, irrespective of the portion of the natural numbers you sample is as follows:

Each cycle has a fixed number of tributaries or branches which is enumerated by the odd numbers in that loop. For example $1,8,4,2,1$ has one branch. Although each branch is a component of a loop, if followed in reverse it also extends upwards infinitely and receives infinitely many further incoming branches.

Every odd number $x$ can be viewed as the root of a branch extending upwards containing the numbers $x,2x,4x,8x,16x,32x\ldots$. It is the number of those branches extending upwards through the integers, and the frequency in which they in turn have further branches, which govern the proportion of integers in any given range which converge to the given cycle.

Each of those branches, depending on its value $\equiv\mod 3$, will receive incoming branches at certain points. I'm not familiar with the $3x+5$ structure but the rules are generally the same as the conventional conjecture, in which all the odd numbers which are $\equiv0\mod 3$ receive no such incoming branches, those $x\equiv 1\mod 3$ receive incoming branches to every even power of $2$ (i.e. $4x,16x,64x,\ldots$) and those $x\equiv 2\mod 3$ receive branches on every odd power of $2$.

The sub-branches of every branch are distributed in exactly equal proportions between 0,1,2 mod 3 and therefore the sub-branches of every branch within a loop increase in equal proportions as you ascend through the integers.

Once you rise to a level within the integers where you are above all of the cycles, you should therefore find that the proportion of branches converging to each cycle is determined by a) the order of that cycle measured by the number of odd numbers it contains and b) the proportions of the numbers converging to it, which receive branches (as per the example I have given above for the conventional CC).

Robert Frost
  • 9,620
  • I am unsure if the number of odd numbers determines loop proportions. For instance, The data suggests the 19 loop takes up 49% of all natural numbers more or less while the 23 loop takes up around 9.5%, even though both loops share the same number of odd numbers. As for the proportions of numbers converging to various loops, it is hard to know. The speculation that simply more "dead ends" connect to certain loops over others would explain this, however it is also possible that not enough branches for the larger loops were found due to using a sample size. – Griffon Theorist697 Aug 21 '17 at 12:50
  • @GriffonTheorist697 it's a combination of a) the number of odd members of the cycle, and b) the proportion of those which receive sub-branches. In the conventional conjecture, every branch receives sub-branches to exactly $2/3$ of its immediate odd predecessors. e.g. immediate predecessors of $1$ are $1,5,21,85,341,1365...$ Exactly $2/3$ of those receive incoming sub-branches, i.e. $1,5,85,341,...$ If your $3x+5$ is different then that must be because different classes of numbers will receive incoming branches on a varying proportion of their immediate odd predecessors. – Robert Frost Aug 21 '17 at 13:28
  • @GriffonTheorist697 Check the immediate odd predecessors of $19$ and $23$ and see what they are mod $3$ and then check what proportion of those are in the range of the function $3x+5$ – Robert Frost Aug 21 '17 at 13:30
  • This answer is certainly looking in the right direction. The predominance of the 19-loop over the 23-loop is explained to a large degree because $19\equiv 1\pmod 3$ and $23\equiv 2$. The import of that fact depends on which value of $q$ you're working with, as far as the $3n+q$ function. Too see the regularities, you need to look at all possible $q$ values, not just one or two. I have a lot of results in this area. – G Tony Jacobs Aug 22 '17 at 14:17
  • @RobertFrost I agree that the "$2/3$" effect would explain that these loops grow proportionally. However, since both loops seem to grow simultaneously, it seems to be the $19$ loop is bigger. It is possible the sample size biases smaller numbers such as those in the $19$ & $1$ loops because there are always more multiples of smaller numbers than larger values when there is a limit. However, this does not explain why the $1$ loop is smaller than the $19$ loop (for all natural numbers). Therefore, the problem is if a sample sizes is the wrong approach or not for collecting data on this problem. – Griffon Theorist697 Aug 22 '17 at 16:24
  • @GTonyJacobs In my digital sketches, I drew the consequence of the extra "tail" connected to $38$, however this is because 19 $\equiv$ $1$ (mod $9$) and the second branch happens to be part of the loop and not the first, unlike $37$ in the $23$ loop. I failed to draw this out, however the $187$ and $347$ loops also have this extra "tail" effect along with their other tails. The problem is if the sample size draws more numbers from the tails of the $19$ loop vs. those of the larger loops. – Griffon Theorist697 Aug 22 '17 at 16:41
  • I see what you mean about the tails. I guess its more about which loops have extra tails, and how small the numbers are in those. – G Tony Jacobs Aug 22 '17 at 16:51
  • Any cycle with an element larger than $2$ in its shape vector will have extra tails, with the number of extra tails corresponding an a vector element $a$ given by $\lfloor(a-1)/2 \rfloor$. – G Tony Jacobs Aug 22 '17 at 16:54