Cross Reference: /macosx-10.10.1/llvmCore-3425.0.34/lib/Target/X86/

//===---------------------------------------------------------------------===//
// Random ideas for the X86 backend: FP stack related stuff
//===---------------------------------------------------------------------===//

//===---------------------------------------------------------------------===//

Some targets (e.g. athlons) prefer freep to fstp ST(0):
http://gcc.gnu.org/ml/gcc-patches/2004-04/msg00659.html

//===---------------------------------------------------------------------===//

This should use fiadd on chips where it is profitable:
double foo(double P, int *I) { return P+*I; }

We have fiadd patterns now but the followings have the same cost and
complexity. We need a way to specify the later is more profitable.

def FpADD32m  : FpI<(ops RFP:$dst, RFP:$src1, f32mem:$src2), OneArgFPRW,
                    [(set RFP:$dst, (fadd RFP:$src1,
                                     (extloadf64f32 addr:$src2)))]>;
                // ST(0) = ST(0) + [mem32]

def FpIADD32m : FpI<(ops RFP:$dst, RFP:$src1, i32mem:$src2), OneArgFPRW,
                    [(set RFP:$dst, (fadd RFP:$src1,
                                     (X86fild addr:$src2, i32)))]>;
                // ST(0) = ST(0) + [mem32int]

//===---------------------------------------------------------------------===//

The FP stackifier should handle simple permutates to reduce number of shuffle
instructions, e.g. turning:

fld P	->		fld Q
fld Q			fld P
fxch

or:

fxch	->		fucomi
fucomi			jl X
jg X

Ideas:
http://gcc.gnu.org/ml/gcc-patches/2004-11/msg02410.html


//===---------------------------------------------------------------------===//

Add a target specific hook to DAG combiner to handle SINT_TO_FP and
FP_TO_SINT when the source operand is already in memory.

//===---------------------------------------------------------------------===//

Open code rint,floor,ceil,trunc:
http://gcc.gnu.org/ml/gcc-patches/2004-08/msg02006.html
http://gcc.gnu.org/ml/gcc-patches/2004-08/msg02011.html

Opencode the sincos[f] libcall.

//===---------------------------------------------------------------------===//

None of the FPStack instructions are handled in
X86RegisterInfo::foldMemoryOperand, which prevents the spiller from
folding spill code into the instructions.

//===---------------------------------------------------------------------===//

Currently the x86 codegen isn't very good at mixing SSE and FPStack
code:

unsigned int foo(double x) { return x; }

foo:
	subl $20, %esp
	movsd 24(%esp), %xmm0
	movsd %xmm0, 8(%esp)
	fldl 8(%esp)
	fisttpll (%esp)
	movl (%esp), %eax
	addl $20, %esp
	ret

This just requires being smarter when custom expanding fptoui.

//===---------------------------------------------------------------------===//
Name		Date	Size
..		24-Oct-2014	31
AsmParser/	H	24-Oct-2014	7
CMakeLists.txt	H A D	24-Oct-2014	2 KiB
Disassembler/	H	24-Oct-2014	10
InstPrinter/	H	24-Oct-2014	11
LLVMBuild.txt	H A D	24-Oct-2014	1 KiB
Makefile	H A D	24-Oct-2014	861
MCTargetDesc/	H	24-Oct-2014	16
README-FPStack.txt	H A D	24-Oct-2014	2.7 KiB
README-MMX.txt	H A D	24-Oct-2014	1.5 KiB
README-SSE.txt	H A D	24-Oct-2014	27 KiB
README-UNIMPLEMENTED.txt	H A D	24-Oct-2014	679
README-X86-64.txt	H A D	24-Oct-2014	6 KiB
README.txt	H A D	24-Oct-2014	53.6 KiB
TargetInfo/	H	24-Oct-2014	6
Utils/	H	24-Oct-2014	7
X86.h	H A D	24-Oct-2014	2.7 KiB
X86.td	H A D	24-Oct-2014	15.1 KiB
X86AsmPrinter.cpp	H A D	24-Oct-2014	28.3 KiB
X86AsmPrinter.h	H A D	24-Oct-2014	3 KiB
X86CallingConv.td	H A D	24-Oct-2014	18.5 KiB
X86CodeEmitter.cpp	H A D	24-Oct-2014	51.1 KiB
X86COFFMachineModuleInfo.cpp	H A D	24-Oct-2014	614
X86COFFMachineModuleInfo.h	H A D	24-Oct-2014	1.4 KiB
X86CompilationCallback_Win64.asm	H A D	24-Oct-2014	1.6 KiB
X86ELFWriterInfo.cpp	H A D	24-Oct-2014	4.1 KiB
X86ELFWriterInfo.h	H A D	24-Oct-2014	2.2 KiB
X86FastISel.cpp	H A D	24-Oct-2014	74.7 KiB
X86FloatingPoint.cpp	H A D	24-Oct-2014	65.7 KiB
X86FrameLowering.cpp	H A D	24-Oct-2014	57.2 KiB
X86FrameLowering.h	H A D	24-Oct-2014	2.5 KiB
X86Instr3DNow.td	H A D	24-Oct-2014	4.3 KiB
X86InstrArithmetic.td	H A D	24-Oct-2014	59 KiB
X86InstrBuilder.h	H A D	24-Oct-2014	6.6 KiB
X86InstrCMovSetCC.td	H A D	24-Oct-2014	5.1 KiB
X86InstrCompiler.td	H A D	24-Oct-2014	80.4 KiB
X86InstrControl.td	H A D	24-Oct-2014	12.3 KiB
X86InstrExtension.td	H A D	24-Oct-2014	8.7 KiB
X86InstrFMA.td	H A D	24-Oct-2014	18 KiB
X86InstrFormats.td	H A D	24-Oct-2014	28.1 KiB
X86InstrFPStack.td	H A D	24-Oct-2014	33.9 KiB
X86InstrFragmentsSIMD.td	H A D	24-Oct-2014	18.9 KiB
X86InstrInfo.cpp	H A D	24-Oct-2014	203.2 KiB
X86InstrInfo.h	H A D	24-Oct-2014	19.2 KiB
X86InstrInfo.td	H A D	24-Oct-2014	95 KiB
X86InstrMMX.td	H A D	24-Oct-2014	27.6 KiB
X86InstrShiftRotate.td	H A D	24-Oct-2014	44.7 KiB
X86InstrSSE.td	H A D	24-Oct-2014	392.7 KiB
X86InstrSVM.td	H A D	24-Oct-2014	2.1 KiB
X86InstrSystem.td	H A D	24-Oct-2014	24.2 KiB
X86InstrVMX.td	H A D	24-Oct-2014	3.2 KiB
X86InstrXOP.td	H A D	24-Oct-2014	14.9 KiB
X86ISelDAGToDAG.cpp	H A D	24-Oct-2014	102.6 KiB
X86ISelLowering.cpp	H A D	24-Oct-2014	642.7 KiB
X86ISelLowering.h	H A D	24-Oct-2014	37.4 KiB
X86JITInfo.cpp	H A D	24-Oct-2014	19.3 KiB
X86JITInfo.h	H A D	24-Oct-2014	3 KiB
X86MachineFunctionInfo.cpp	H A D	24-Oct-2014	444
X86MachineFunctionInfo.h	H A D	24-Oct-2014	5.6 KiB
X86MCInstLower.cpp	H A D	24-Oct-2014	29.1 KiB
X86MCInstLower.h	H A D	24-Oct-2014	1.3 KiB
X86RegisterInfo.cpp	H A D	24-Oct-2014	29.4 KiB
X86RegisterInfo.h	H A D	24-Oct-2014	5.2 KiB
X86RegisterInfo.td	H A D	24-Oct-2014	17.8 KiB
X86Relocations.h	H A D	24-Oct-2014	2 KiB
X86Schedule.td	H A D	24-Oct-2014	15.6 KiB
X86ScheduleAtom.td	H A D	24-Oct-2014	27.7 KiB
X86SelectionDAGInfo.cpp	H A D	24-Oct-2014	9.9 KiB
X86SelectionDAGInfo.h	H A D	24-Oct-2014	1.9 KiB
X86Subtarget.cpp	H A D	24-Oct-2014	12.9 KiB
X86Subtarget.h	H A D	24-Oct-2014	11.2 KiB
X86TargetMachine.cpp	H A D	24-Oct-2014	7.1 KiB
X86TargetMachine.h	H A D	24-Oct-2014	4.5 KiB
X86TargetObjectFile.cpp	H A D	24-Oct-2014	1.9 KiB
X86TargetObjectFile.h	H A D	24-Oct-2014	1.5 KiB
X86VZeroUpper.cpp	H A D	24-Oct-2014	9.3 KiB