Converting Base 10 Integers To Base B

Bases By The Numbers - Base 10 to Base B

When discussing the process of base conversion, a logical place to begin is by learning how to first convert our every-day, standard base 10 integers into their corresponding equivalents in other bases.  We will develop a simple PHP function to do this base 10 integer to base B conversion in arbitrary-precision to any number of digits we wish.

The process of integer base conversion is actually very simple and consists of nothing more than collecting up the remainders of a series of reductive integer divisions, in reverse order of their generation (in right-to-left order), and then fetching their corresponding base B digit symbols from within the digits spectrum used for the conversion.

Here, a reductive integer division means that the resulting integer quotient of the division by the base value replaces the original dividend in preparation for the next cycle, if any.  Each division cycle thus reduces the subsequent numerator (X value) in the sequence of divisions.  For example, the integer division of 13/2 results in the integer quotient 6 (Q=6) and a remainder of 1 (R=1).

The process of converting the base 10 integer 13 into binary (base 2):

Let:
X = Base 10 integer to be converted
B = Alternative base into which to convert X
Q = Integer quotient of integer division X/B
R = Integer remainder of integer division X/B

  1. Starting with base 10 integer X, we divide it by the base (B), keeping only the integer quotient (Q) and the remainder (R) as demonstrated below.

    Starting with ... X=13 and B=2:

    $Q = bcdiv($X, $B);
    $R = bcmod($X, $B);
    $X = $Q;



 X / B = Q, R
13 / 2 = 6, 1
 6 / 2 = 3, 0
 3 / 2 = 1, 1
 1 / 2 = 0, 1

The following random example shows the base 10 integer,  X=960843282267206,  expressed in its corresponding equivalents in all bases from 2 to 36.

Base 10 value X = 960843282267206

B   The same numerical value expressed in base B
2   11011010011110000111000010100001100011110001000110
3   11200000001201110101121000011202
4   3122132013002201203301012
5   2001414424020230022311
6   13243313151000502502
7   406246431153136652
8   33236070241436106
9   4600051411530152
10  960843282267206
11  259177390786228
12  8B921A4717AA32
13  3231A195802990
14  12D3B0C4C11A62
15  7613B13606E3B
16  369E1C2863C46
17  1B0A65GA300F9
18  EH1H31F66DG2
19  84DBGIF6CH71
20  4DGCIG5E3806
21  2FCEG464GHI2
22  1E3J8EDE6618
23  104ADA9A35BK
24  F3GN28G5KDE
25  A1LOE22F2D6
26  6KP3IPBA2B0
27  4I01JCAG04K
28  36N6OG3FR32
29  286LD96HL47
30  1IOE95OC3GQ
31  15AHMPKD1AM
32  R9S718CF26
33  KN683UHB88
34  FS1LQBTPOQ
35  C6O048F09G
36  9GL9B60UH2

The above example table demonstrates how the same numerical value can be expressed in several base systems.  

The table below was derived by randomly selecting one of the conversions from within the table above, in this case base 5, to demonstrate how its individual digits were derived by converting the base 10 integer, X=960843282267206, into base 5, one digit at a time, using a series of successive integer divisions and reductions and converting the sequential remainders into base 5 digit symbols.  The same process can used with any base to render EXACT arbitrary-precision base 10 integer to base B conversions to any number of digits.

X                 /Base  Remainder  Symbol
960843282267206   /5     1          1
192168656453441   /5     1          1
38433731290688    /5     3          3
7686746258137     /5     2          2
1537349251627     /5     2          2
307469850325      /5     0          0
61493970065       /5     0          0
12298794013       /5     3          3
2459758802        /5     2          2
491951760         /5     0          0
98390352          /5     2          2
19678070          /5     0          0
3935614           /5     4          4
787122            /5     2          2
157424            /5     4          4
31484             /5     4          4
6296              /5     1          1
1259              /5     4          4
251               /5     1          1
50                /5     0          0
10                /5     0          0
2                 /5     2          2


960843282267206 in base 10
=
2001414424020230022311 in base 5



The Number of Digits in a Base B Integer

It is very interesting to note that we do not have to actually perform any conversion of X to base B to simply predict how many base B digits the result would have if we did!  For example, we may not know what the base 10 integer 960843282267206 looks like when expressed in base 5, but we can still predict that it consists of 22 base 5 digits by using this simple logarithmic formula:


Ydigits = 1 + floor(log(X) / log(B))

In PHP, this formula can be written in very similar form as:

$Ydigits = 1 + floor(log($X) / log($B));

In this case, the log(x) function refers to the natural logarithm.
However, base 10 or any other logarithmic base could be used.



EXAMPLE: For Base 10 Integer X Converted Into Base B Integer Y:

Given base 10 integer, X=960843282267206, to be converted to base B=15:
Base 15 integer equivalent Y = 7613B13606E3B

Base 15 digits in Y = 1 + floor(log(960843282267206) / log(15)) = 13

The above base conversion table for X = 960843282267206, is reproduced again below, with this version of the table including the counts of the digits (in parentheses).

Base 10 value X = 960843282267206

B   Same numerical value expressed in base B (with digits count)
2   11011010011110000111000010100001100011110001000110 (50)
3   11200000001201110101121000011202 (32)
4   3122132013002201203301012 (25)
5   2001414424020230022311 (22)
6   13243313151000502502 (20)
7   406246431153136652 (18)
8   33236070241436106 (17)
9   4600051411530152 (16)
10  960843282267206 (15)
11  259177390786228 (15)
12  8B921A4717AA32 (14)
13  3231A195802990 (14)
14  12D3B0C4C11A62 (14)
15  7613B13606E3B (13)
16  369E1C2863C46 (13)
17  1B0A65GA300F9 (13)
18  EH1H31F66DG2 (12)
19  84DBGIF6CH71 (12)
20  4DGCIG5E3806 (12)
21  2FCEG464GHI2 (12)
22  1E3J8EDE6618 (12)
23  104ADA9A35BK (12)
24  F3GN28G5KDE (11)
25  A1LOE22F2D6 (11)
26  6KP3IPBA2B0 (11)
27  4I01JCAG04K (11)
28  36N6OG3FR32 (11)
29  286LD96HL47 (11)
30  1IOE95OC3GQ (11)
31  15AHMPKD1AM (11)
32  R9S718CF26 (10)
33  KN683UHB88 (10)
34  FS1LQBTPOQ (10)
35  C6O048F09G (10)
36  9GL9B60UH2 (10)

For Higher Bases

For the higher bases, (greater than 10), we traditionally use the sequential (not case sensitive) letters of the English alphabet as extended digit symbols, such as the letters "A" to "F", as used in hexadecimal numbers.  Normally, there is no single digit with a value of 15, so we use the letter "F" to represent it.  When dividing by 16 and the remainder is 15, we use the symbol "F" to represent that digit value in hexadecimal and higher bases.  For numbers expressed in any base from 2 to 10, the digit symbols are the same as the digits themselves.  We only apply the sequential letters to the higher bases when we run out of normal digits to represent their values, that is, base 11 and above.


Below is a simple PHP function designed to perform the complete, exact arbitrary-precision base 10 integer to base B conversion process.  Note that since we are doing arbitrary-precision computations, the function can handle integer base conversions extending out to hundreds or even thousands of digits with EXACT accuracy.

 function bcBase_10_Int_To_Base_B ($X10, $B)
{
 $X = $X10;
 if (bccomp($X, 0) == 0) {return "0";}

 $DigitsSpectrum = substr("0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ",0,$B);

 $XB = "";

  while (bccomp($X, 0) > 0)
 {
  $XBDigitSymbol = substr($DigitsSpectrum, bcmod($X,$B),1);
  $XB = substr($DigitsSpectrum, $XBDigitSymbol . $XB;
  $X = bcdiv($X,$B);
 }
 return $XB;
}

In the above function, to generate the specific digits spectrum string for any given base from B=2 to B=36, we use the following code line:

$DigitsSpectrum = substr("0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ",0,$B);

Since our standard base 10 digits spectrum consists of all the numerical digits from 0 to 9, the base 10 digits spectrum string takes the form, "0123456789".

If we were converting into binary (B=2), then the binary digits spectrum string would only consist of the 2 digits, "01".

If we were converting into octal (B=8), the digits spectrum would take the form, "01234567".

If we were converting into hexadecimal (B=16), the hexadecimal digits spectrum string would take the form, "0123456789ABCDEF".

In the case of B=24, the digits spectrum would take the form, "0123456789ABCDEFGHIJKLMN", and so on.

The 0-based position index of a digit in the digits spectrum corresponds to the value of the remainder of the integer division of the current base 10 integer, X, by the current base (B).  This division is computed and the corresponding base B digit symbol extracted from the digits spectrum by the line:

$XBDigitSymbol = substr($DigitsSpectrum, bcmod($X, $B), 1);

An integer division here means that both dividend (X) and divisor (B) are integers and that any fractional part is dropped without rounding.  The division of 127/16 would result in exactly 7, and not 7.9375, as in floating point arithmetic nor rounded up to 8.  Any decimal fraction that results from the division is simply ignored and dropped without exception.

Within the function, the value of the remainder of the integer division, X/B, is computed by the statement:

bcmod($X,$B)

This value is the index of the base B digit symbol within the digits spectrum.

A simple test call to the above function might resemble the following:
$X = "960843282267206";
$B = 8;

print bcBase_10_Int_To_Base_B($X, $B);

// The printed result should be:
   33236070241436106

Even Higher Bases

For Base 2 to Base 36

As previously defined above, the case insensitive digits spectrum (up to base 36) is:
"0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ"
=
"0123456789abcdefghijklmnopqrstuvwxyz"

For Bases > 36

If we allow case sensitivity, where lowercase "a" is NOT identical to uppercase "A", and both symbols represent entirely different values, then we can use the uppercase letters to extend the digits spectrum by yet another 26 letters, giving us a CASE SENSITIVE digits spectrum consisting of 62 unique digits.

The case sensitive base 62 digits spectrum is:
"0123456789abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ"

Note that the lowercase letters from "a" to "f" must be used to represent hexadecimal values under this convention, since the uppercase letters, "A" to "F", are NOT equivalent symbols here.

Allowing case sensitivity this way, lets us handle bases from 2 to 62.  Of course, we could extend the digit symbols indefinitely to represent even higher bases.

SPECIAL NOTE:
  • When displaying numbers in higher bases, we need to be careful which font we choose to display or print the digit symbols.  For example, care must be taken so that digits like "0" (zero) and "1" (one) are not mistaken for letters, like "O" or the lower case "L" or uppercase "I".

    Since in some font styles, certain numerical digits can closely resemble certain letters, we need to read such higher-base numbers carefully, and a well-chosen alpha-numeric font can help reduce this kind of confusion. 

    To avoid this problem, either the 'Consolas' or 'Inconsolata' font is recomended because they show a slash through the digit zero (0) which easily distinguishes the digit from the letter 'O' when higher bases are used.  For those who do not have either font or something similar, they are freely available on the Internet.




Jay Tanner - 2020