Cross Reference: /seL4-camkes-master/projects/musllibc/src/math/i386

History log of /seL4-camkes-master/projects/musllibc/src/math/i386_sel4/scalbn.s
Revision		Date	Author	Comments
# baa43bca		19-Mar-2012	Rich Felker <dalias@aerifal.cx>	optimize scalbn family the fscale instruction is slow everywhere, probably because it involves a costly and unnecessary integer truncation operation that ends up being a no-op in common usages. instead, construct a floating point scale value with integer arithmetic and simply multiply by it, when possible. for float and double, this is always possible by going to the next-larger type. we use some cheap but effective saturating arithmetic tricks to make sure even very large-magnitude exponents fit. for long double, if the scaling exponent is too large to fit in the exponent of a long double value, we simply fallback to the expensive fscale method. on atom cpu, these changes speed up scalbn by over 30%. (min rdtsc timing dropped from 110 cycles to 70 cycles.)
# 58ff9e8e		19-Mar-2012	Rich Felker <dalias@aerifal.cx>	asm for scalbn family unlike some implementations, these functions perform the equivalent of gcc's -ffloat-store on the result before returning. this is necessary to raise underflow/overflow/inexact exceptions, perform the correct rounding with denormals, etc.