Age | Commit message (Collapse) | Author | |
---|---|---|---|
2018-05-02 | arch-x86: implement movntps/movntpd SSE insts | Steve Reinhardt | |
These are non-temporal packed SSE stores. Change-Id: I526cd6551b38d6d35010bc6173f23d017106b466 Reviewed-on: https://gem5-review.googlesource.com/9861 Reviewed-by: Gabe Black <gabeblack@google.com> Maintainer: Gabe Black <gabeblack@google.com> | |||
2015-10-06 | x86: implement rcpps and rcpss SSE insts | Steve Reinhardt | |
These are packed single-precision approximate reciprocal operations, vector and scalar versions, respectively. This code was basically developed by copying the code for sqrtps and sqrtss. The mrcp micro-op was simplified relative to msqrt since there are no double-precision versions of this operation. | |||
2015-07-04 | x86: Adjust the size of the values written to the x87 misc registers | Nikos Nikoleris | |
All x87 misc registers are implemented in an array of 64 bit values but in real hardware the size of some of these registers is smaller. Previsouly all 64 bits where incorrectly set and then later read. To ensure correctness we mask the value in setMiscRegNoEffect to write only the valid bits. Committed by: Nilay Vaish <nilay@cs.wisc.edu> | |||
2015-01-10 | x86 : fxsave and fxrestore missing template code | Emilio Castillo | |
This patch corrects the FXSAVE and FXRSTOR Macroops. The actual code used for saving/restore the FP registers is in the file but it was not used. The FXSAVE and FXRSTOR instructions are used in the kernel for saving and loading the state of the mmx,xmm and fpu registers. This operation is triggered in FS by issuing a Device Not Available Fault. The cr0 register has a TS flag that is set upon each context change. Every time a task access any FP related register (SIMD as well) if the TS flag is set to one, the device not available fault is issued. The kernel saves the current state of the registers, and restore the previous state of the currently running task. Right now Gem5 lacks of this capability. the Device Not Available Fault is never issued, leading to several problems when different threads share the same CPU and SMT is not used. The PARSEC Ferret benchmark is an example of this behavior. In order to test this a hack in the atomic cpu code was done to detect if a static instruction has any FP operands and the cr0 reg TS bit is set. This check must be done in the ISA dependent code. But it seems to be tricky to access the cr0 register while executing an instruction. Committed by: Nilay Vaish <nilay@cs.wisc.edu> | |||
2015-01-03 | x86: implements the simd128 ADDSUBPD instruction | Maxime Martinasso | |
This patch implements the simd128 ADDSUBPD instruction for the x86 architecture. Tested with a simple program in assembly language which executes the instruction. Checked that different versions of the instruction are executed by using the execution tracing option. Committed by: Nilay Vaish <nilay@cs.wisc.edu | |||
2013-09-30 | x86: Add support for FXSAVE, FXSAVE64, FXRSTOR, and FXRSTOR64 | Andreas Sandberg | |
2012-05-19 | x86 ISA: Implement the sse3 haddps instruction. | Marc Orr | |
Shuffle the 32 bit values into position, and then add in parallel. | |||
2010-05-23 | copyright: Change HP copyright on x86 code to be more friendly | Nathan Binkert | |
2009-12-19 | X86: Add a common named flag for signed media operations. | Gabe Black | |
2009-12-19 | X86: Create a common flag with a name to indicate high multiplies. | Gabe Black | |
2009-12-19 | X86: Create a common flag with a name to indicate scalar media instructions. | Gabe Black | |
2009-11-04 | X86: Fix problem with movhps instruction | Vince Weaver | |
This problem is like the one fixed with movhpd a few weeks ago. A +8 displacement is used to access memory when there should be none. This fix is needed for the perlbmk spec2k benchmark to run. | |||
2009-10-30 | X86: Add support for x86 psrldq and pslldq instructions | Vince Weaver | |
These are complicated instructions and the micro-code might be suboptimal. This has been tested with some small sample programs (attached) The psrldq instruction is needed by various spec2k programs. | |||
2009-10-30 | X86: Implement the X86 sse2 haddpd instruction | Vince Weaver | |
This patch implements the haddpd instruction. It fixes the problem in the previous version (pointed out by Gabe Black) where an incorrect result would happen if you issue the instruction with the same argument twice, i.e. "haddpd %xmm0,%xmm0" This instruction is used by many spec2k benchmarks. | |||
2009-10-27 | X86: Replace "DISPLACEMENT" with disp in movhpd. | Gabe Black | |
2009-10-27 | Fix problem with the x86 sse movhpd instruction. | Vince Weaver | |
The movhpd instruction was writing to the wrong memory offset. | |||
2009-10-21 | Implement X86 sse2 movdqu and movdqa instructions | Vince Weaver | |
The movdqa instruction should enforce 16-byte alignment. This implementation does not do that. These instructions are needed for most of x86_64 spec2k to run. | |||
2009-08-17 | X86: Implement MOVQ2DQ. | Gabe Black | |
2009-08-17 | X86: Implement MOVDQ2Q. | Gabe Black | |
2009-08-17 | X86: Implement the media instructions that convert fp values to ints. | Gabe Black | |
2009-08-17 | X86: Implement the instructions that compare fp values and write a mask as a ↵ | Gabe Black | |
result. | |||
2009-08-17 | X86: Implement the instructions that compare fp values and write to rflags. | Gabe Black | |
2009-08-17 | X86: Implement MOVSS. | Gabe Black | |
2009-08-17 | X86: Implement LDMXCSR. | Gabe Black | |
2009-08-17 | X86: Implement STMXCSR. | Gabe Black | |
2009-08-17 | X86: Implement the shuffle media instructions. | Gabe Black | |
2009-08-17 | X86: Implement the mask move instructions. | Gabe Black | |
2009-08-17 | X86: Implement the instructions that move sign bits. | Gabe Black | |
2009-08-17 | X86: Implement the insert/extract instructions. | Gabe Black | |
2009-08-17 | X86: Implement the media average instructions. | Gabe Black | |
2009-08-17 | X86: Implement the multiply and add instructions. | Gabe Black | |
2009-08-17 | X86: Implement the media shifts that operate on 64 bits or less at a time. | Gabe Black | |
2009-08-17 | X86: Implement the sum of absolute differences instructions. | Gabe Black | |
2009-08-17 | X86: Implement the media integer subtract instructions. | Gabe Black | |
2009-08-17 | X86: Implement the integer media multiply instructions. | Gabe Black | |
2009-08-17 | X86: Implement the integer media max instructions. | Gabe Black | |
2009-08-17 | X86: Implement the integer media min instructions. | Gabe Black | |
2009-08-17 | X86: Implement the media integer addition instructions. | Gabe Black | |
2009-08-17 | X86: Implement the instructions that convert between forms of floating point. | Gabe Black | |
2009-08-17 | X86: Implement the instructions that compare fp values and write masks as ↵ | Gabe Black | |
the result. | |||
2009-08-17 | X86: Implement the MOVDDUP instruction. | Gabe Black | |
2009-08-17 | X86: Implement many of the media mov instructions. | Gabe Black | |
2009-08-17 | X86: Implement the media instructions that convert integer values to ↵ | Gabe Black | |
floating point. | |||
2009-08-17 | X86: Implement the floating point media instructions. | Gabe Black | |
2009-08-17 | X86: Implement the floating point media multiply instructions. | Gabe Black | |
2009-08-17 | X86: Implement the floating point media subtract instructions. | Gabe Black | |
2009-08-17 | X86: Implement the floating point media add instructions. | Gabe Black | |
2009-08-17 | X86: Implement the media sqrt instructions. | Gabe Black | |
2009-08-17 | X86: Implement the media floating point max instructions. | Gabe Black | |
2009-08-17 | X86: Implement the media floating point min instructions. | Gabe Black | |