Microsoft Binary Format


In computing, Microsoft Binary Format was a format for floating-point numbers used in Microsoft's BASIC language products, including MBASIC, GW-BASIC and QuickBASIC prior to version 4.00.
There are two main versions of the format. The original version was designed for memory-constrained systems and stored numbers in 32 bits, with a 23-bit mantissa, 1-bit sign, and an 8-bit exponent. Extended BASIC included a double-precision type with 64 bits.
During the period when it was being ported to the MOS 6502, computers were beginning to ship with more memory as a standard feature. This version was offered with the original 32-bit format or an optional expanded 40-bit format. The 40-bit format was used by most home computers of the 1970s and 1980s. These two versions are sometimes known as "6-digit" and "9-digit" respectively.
QuickBASIC, prior to version 4, reintroduced the double-precision format using a 55-bit mantissa in a 64-bit format. MBF was abandoned during the move to QuickBASIC 4, which used the standard IEEE 754 format, introduced a few years earlier.

History

and Paul Allen were working on Altair BASIC in 1975. They were developing the software at Harvard University on a DEC PDP-10 running their Altair emulator. One thing they lacked was code to handle floating-point numbers, required to support calculations with very big and very small numbers, which would be particularly useful for science and engineering. One of the proposed uses of the Altair was as a scientific calculator.
front panel
At a dinner at Currier House, an undergraduate residential house at Harvard, Gates and Allen complained to a dinner companions that they had to write this code and one of them, Monte Davidoff, told them that he had written floating-point routines before and convinced Gates and Allen that he was capable of writing the Altair BASIC floating-point code. At the time, while IBM had introduced their own programs, there was no standard for floating-point numbers, so Davidoff had to come up with his own. He decided that 32 bits would allow enough range and precision. When Allen had to demonstrate it to MITS, it was the first time it ran on an actual Altair. But it worked, and when he entered ‘PRINT 2+2’, Davidoff's adding routine gave the correct answer.
The source code for Altair BASIC was thought to have been lost to history, but resurfaced in 2000. It had been sitting behind Gates's former tutor and dean Harry Lewis's file cabinet, who rediscovered it. A comment in the source credits Davidoff as the writer of Altair BASIC's math package.
Altair BASIC took off, and soon most early home computers ran some form of Microsoft BASIC. The BASIC port for the 6502 CPU, such as used in the Commodore PET, took up more space due to the lower code density of the 6502. Because of this it would likely not fit in a single ROM chip together with the machine-specific input and output code. Since an extra chip was necessary, extra space was available, and this was used in part to extend the floating-point format from 32 to 40 bits. This extended format was not only provided by Commodore BASIC 1 & 2, but was also supported by Applesoft BASIC I & II since version 1.1, KIM-1 BASIC since version 1.1a, and MicroTAN BASIC since version 2b. Not long afterwards the Z80 ports, such as Level II BASIC for the TRS-80, introduced the 64-bit, double-precision format as a separate data type from 32-bit, single-precision. Microsoft used the same floating-point formats in their implementation of Fortran and for their macro assembler MASM, although their spreadsheet Multiplan and their COBOL implementation used binary-coded decimal floating point. Even so, for a while MBF became the de facto floating-point format on home computers, to the point where people still occasionally encounter legacy files and file formats using it.
minicomputer

IEEE 754

As early as in 1976, Intel was starting the development of a floating-point coprocessor. Intel hoped to be able to sell a chip containing good implementations of all the operations found in the widely varying maths software libraries.
John Palmer, who managed the project, contacted William Kahan of the University of California, who suggested that Intel use the floating point of Digital Equipment Corporation's VAX. The first VAX, the VAX-11/780 had just come out in late 1977, and its floating point was highly regarded. However, seeking to market their chip to the broadest possible market, Intel wanted the best floating point possible, and Kahan went on to draw up specifications.
When rumours of Intel's new chip reached its competitors, they started a standardization effort, called IEEE 754, to prevent Intel from gaining too much ground. Kahan got Palmer's permission to participate; he was allowed to explain Intel's design decisions and their underlying reasoning, but not anything related to Intel's implementation architecture. VAX's floating-point formats differed from MBF only in that it had the sign in the most significant bit.
floating-point coprocessor
It turned out that an 8-bit exponent was not wide enough for some operations desired for double-precision numbers, e.g. to store the product of two 32-bit numbers. Both Kahan's proposal and a counter-proposal by DEC therefore used 11 bits, like the time-tested 60-bit floating-point format of the CDC 6600 from 1965. Kahan's proposal also provided for infinities, which are useful when dealing with division-by-zero conditions; not-a-number values, which are useful when dealing with invalid operations; denormal numbers, which help mitigate problems caused by underflow; and a better balanced exponent bias, which can help avoid overflow and underflow when taking the reciprocal of a number.
In 1980 the Intel 8087 chip was already released, but DEC remained opposed, to denormal numbers in particular, because of performance concerns and since it would give DEC a competitive advantage to standardise on DEC's format. The next year DEC had a study done in order to demonstrate that gradual underflow was a bad idea, but the study concluded the opposite, and DEC gave in. In 1985 the standard was ratified, but it had already become the de facto standard a year earlier, implemented by many manufacturers.
By the time QuickBASIC 4.00 was released, the IEEE 754 standard had become widely adopted—for example, it was incorporated into Intel's 387 coprocessor and every x86 processor from the 486 on. QuickBASIC versions 4.0 and 4.5 use IEEE 754 floating-point variables by default, but there is a command-line option /MBF for the IDE and the compiler that switches from IEEE to MBF floating-point numbers, to support earlier-written programs that rely on details of the MBF data formats. Visual Basic also uses the IEEE 754 format instead of MBF.

Technical details

MBF numbers consist of an 8-bit base-2 exponent with a bias of 128, so that exponents −127…−1 are represented by x = 1…127, exponents 0…127 are represented by x = 128…255, with a special case for x = 0 representing the whole number being zero, a sign bit and a 23-, 31- or 55-bit mantissa of the significand. There is always a 1-bit implied to the left of the explicit mantissa, and the radix point is located before this assumed bit.
The MBF double-precision format provides less scale than the IEEE 754 format, and although the format itself provides almost one extra decimal digit of precision, in practice the stored values are less accurate because IEEE calculations use 80-bit intermediate results, and MBF doesn't. Unlike IEEE floating point, MBF doesn't support denormal numbers, infinities or NaNs.
MBF single-precision format :
ExponentSignSignificand
8 bits,
bit 31–24
1 bit,
bit 23
23 bits,
bit 22–0
xxxxxxxxsmmmmmmmmmmmmmmmmmmmmmmm

MBF extended-precision format :
ExponentSignSignificand
8 bits,
bit 39–32
1 bit,
bit 31
31 bits,
bit 30–0
xxxxxxxxsmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm

MBF double-precision format :
ExponentSignSignificand
8 bits,
bit 63–56
1 bit,
bit 55
55 bits,
bit 54–0
xxxxxxxxsmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm

Examples