Cross Reference: /freebsd-11-stable/contrib/gcc/config/rs6000/darwin-ldouble-format

169689SkanLong double format
169689Skan==================
169689Skan
169689Skan  Each long double is made up of two IEEE doubles.  The value of the
169689Skanlong double is the sum of the values of the two parts (except for
169689Skan-0.0).  The most significant part is required to be the value of the
169689Skanlong double rounded to the nearest double, as specified by IEEE.  For
169689SkanInf values, the least significant part is required to be one of +0.0
169689Skanor -0.0.  No other requirements are made; so, for example, 1.0 may be
169689Skanrepresented as (1.0, +0.0) or (1.0, -0.0), and the low part of a NaN
169689Skanis don't-care.
169689Skan
169689SkanClassification
169689Skan--------------
169689Skan
169689SkanA long double can represent any value of the form
169689Skan  s * 2^e * sum(k=0...105: f_k * 2^(-k))
169689Skanwhere 's' is +1 or -1, 'e' is between 1022 and -968 inclusive, f_0 is
169689Skan1, and f_k for k>0 is 0 or 1.  These are the 'normal' long doubles.
169689Skan
169689SkanA long double can also represent any value of the form
169689Skan  s * 2^-968 * sum(k=0...105: f_k * 2^(-k))
169689Skanwhere 's' is +1 or -1, f_0 is 0, and f_k for k>0 is 0 or 1.  These are
169689Skanthe 'subnormal' long doubles.
169689Skan
169689SkanThere are four long doubles that represent zero, two that represent
169689Skan+0.0 and two that represent -0.0.  The sign of the high part is the
169689Skansign of the long double, and the sign of the low part is ignored.
169689Skan
169689SkanLikewise, there are four long doubles that represent infinities, two
169689Skanfor +Inf and two for -Inf.
169689Skan
169689SkanEach NaN, quiet or signalling, that can be represented as a 'double'
169689Skancan be represented as a 'long double'.  In fact, there are 2^64
169689Skanequivalent representations for each one.
169689Skan
169689SkanThere are certain other valid long doubles where both parts are
169689Skannonzero but the low part represents a value which has a bit set below
169689Skan2^(e-105).  These, together with the subnormal long doubles, make up
169689Skanthe denormal long doubles.
169689Skan
169689SkanMany possible long double bit patterns are not valid long doubles.
169689SkanThese do not represent any value.
169689Skan
169689SkanLimits
169689Skan------
169689Skan
169689SkanThe maximum representable long double is 2^1024-2^918.  The smallest
169689Skan*normal* positive long double is 2^-968.  The smallest denormalised
169689Skanpositive long double is 2^-1074 (this is the same as for 'double').
169689Skan
169689SkanConversions
169689Skan-----------
169689Skan
169689SkanA double can be converted to a long double by adding a zero low part.
169689Skan
169689SkanA long double can be converted to a double by removing the low part.
169689Skan
169689SkanComparisons
169689Skan-----------
169689Skan
169689SkanTwo long doubles can be compared by comparing the high parts, and if
169689Skanthose compare equal, comparing the low parts.
169689Skan
169689SkanArithmetic
169689Skan----------
169689Skan
169689SkanThe unary negate operation operates by negating the low and high parts.
169689Skan
169689SkanAn absolute or absolute-negate operation must be done by comparing
169689Skanagainst zero and negating if necessary.
169689Skan
169689SkanAddition and subtraction are performed using library routines.  They
169689Skanare not at present performed perfectly accurately, the result produced
169689Skanwill be within 1ulp of the range generated by adding or subtracting
169689Skan1ulp from the input values, where a 'ulp' is 2^(e-106) given the
169689Skanexponent 'e'.  In the presence of cancellation, this may be
169689Skanarbitrarily inaccurate.  Subtraction is done by negation and addition.
169689Skan
169689SkanMultiplication is also performed using a library routine.  Its result
169689Skanwill be within 2ulp of the correct result.
169689Skan
169689SkanDivision is also performed using a library routine.  Its result will
169689Skanbe within 3ulp of the correct result.