gem5 - gem5

Age	Commit message (Collapse)	Author
2011-05-23	O3: Fix offset calculation into storeQueue buffer for store->load forwarding	Geoffrey Blake
	Calculation of offset to copy from storeQueue[idx].data structure for load to store forwarding fixed to be difference in bytes between store and load virtual addresses. Previous method would induce bug where a load would index into buffer at the wrong location.
2011-05-23	O3: Fix issue w/wbOutstading being decremented multiple times on blocked cache.	Geoffrey Blake
	If a split load fails on a blocked cache wbOutstanding can be decremented twice if the first part of the split load succeeds and the second part fails. Condition the decrementing on not having completed the first part of the load.
2011-05-23	O3: Fix issue with interrupts/faults occuring in the middle of a macro-op	Geoffrey Blake
	This patch fixes two problems with the O3 cpu model. The first is an issue with an instruction fetch causing a fault on the next address while the current macro-op is being issued. This happens when the micro-ops exceed the fetch bandwdith and then on the next cycle the fetch stage attempts to issue a request to the next line while it still has micro-ops to issue if the next line faults a fault is attached to a micro-op in the currently executing macro-op rather than a "nop" from the next instruction block. This leads to an instruction incorrectly faulting when on fetch when it had no reason to fault. A similar problem occurs with interrupts. When an interrupt occurs the fetch stage nominally stops issuing instructions immediately. This is incorrect in the case of a macro-op as the current location might not be interruptable.
2011-05-13	Trace: Allow printing ASIDs and selectively tracing based on user/kernel code.	Chander Sudanthi
	Debug flags are ExecUser, ExecKernel, and ExecAsid. ExecUser and ExecKernel are set by default when Exec is specified. Use minus sign with ExecUser or ExecKernel to remove user or kernel tracing respectively.
2011-05-13	O3: Fix an issue with a load & branch instruction and mem dep squashing	Geoffrey Blake
	Instructions that load an address and are control instructions can execute down the wrong path if they were predicted correctly and then instructions following them are squashed. If an instruction is a memory and control op use the predicted address for the next PC instead of just advancing the PC. Without this change NPC is used for the next instruction, but predPC is used to verify that the branch was successful so the wrong path is silently executed.
2011-05-09	work around gcc 4.5 warning	Nathan Binkert

2011-05-07	NetworkTest: added sim_cycles parameter to the network tester.	Tushar Krishna
	The network tester terminates after injecting for sim_cycles (default=1000), instead of having to explicitly pass --maxticks from the command line as before. If fixed_pkts is enabled, the tester only injects maxpackets number of packets, else it keeps injecting till sim_cycles. The tester also works with zero command line arguments now.
2011-05-04	CPU: Add some useful debug message to the timing simple cpu.	Ali Saidi

2011-05-04	CPU: Fix a case where timing simple cpu faults can nest.	Ali Saidi
	If we fault, change the state to faulting so that we don't fault again in the same cycle.
2011-05-04	O3: Remove assertion for case that is actually handled in code.	Ali Saidi
	If an nonspeculative instruction has a fault it might not be in the nonSpecInsts map.
2011-05-04	O3: Fix a small corner case with the lsq hazard detection logic.	Ali Saidi

2011-04-20	stats: one more name violation	Nathan Binkert

2011-04-19	stats: rename stats so they can be used as python expressions	Nathan Binkert

2011-04-15	trace: reimplement the DTRACE function so it doesn't use a vector	Nathan Binkert
	At the same time, rename the trace flags to debug flags since they have broader usage than simply tracing. This means that --trace-flags is now --debug-flags and --trace-help is now --debug-help
2011-04-15	debug: create a Debug namespace	Nathan Binkert

2011-04-15	includes: fix up code after sorting	Nathan Binkert

2011-04-15	includes: sort all includes	Nathan Binkert

2011-04-04	ARM: Fix checkpoint restoration into O3 CPU and the way O3 switchCpu works.	Ali Saidi
	This change fixes a small bug in the arm copyRegs() code where some registers wouldn't be copied if the processor was in a mode other than MODE_USER. Additionally, this change simplifies the way the O3 switchCpu code works by utilizing TheISA::copyRegs() to copy the required context information rather than the adhoc copying that goes on in the CPU model. The current code makes assumptions about the visibility of int and float registers that aren't true for all architectures in FS mode.
2011-04-04	ARM: Cleanup implementation of ITSTATE and put important code in PCState.	Ali Saidi
	Consolidate all code to handle ITSTATE in the PCState object rather than touching a variety of structures/objects.
2011-04-04	CPU: Remove references to memory copy operations	Ali Saidi

2011-04-04	O3: Tighten memory order violation checking to 16 bytes.	Ali Saidi
	The comment in the code suggests that the checking granularity should be 16 bytes, however in reality the shift by 8 is 256 bytes which seems much larger than required.
2011-03-31	Ruby: have the rubytester pass contextId to Ruby.	Lisa Hsu

2011-03-28	This patch supports cache flushing in MOESI_hammer	Somayeh Sardashti

2011-03-26	mips: cleanup ISA-specific code	Korey Sewell
	*** (1): get rid of expandForMT function MIPS is the only ISA that cares about having a piece of ISA state integrate multiple threads so add constants for MIPS and relieve the other ISAs from having to define this. Also, InOrder was the only core that was actively calling this function * * * (2): get rid of corespecific type The CoreSpecific type was used as a proxy to pass in HW specific params to a MIPS CPU, but since MIPS FS hasnt been touched for awhile, it makes sense to not force every other ISA to use CoreSpecific as well use a special reset function to set it. That probably should go in a PowerOn reset fault anyway.
2011-03-22	This patch fixes a build error in networktest.cc that occurs with gcc4.2	Tushar Krishna

2011-03-21	This patch adds the network tester for simple and garnet networks.	Tushar Krishna
	The tester code is in testers/networktest. The tester can be invoked by configs/example/ruby_network_test.py. A dummy coherence protocol called Network_test is also addded for network-only simulations and testing. The protocol takes in messages from the tester and just pushes them into the network in the appropriate vnet, without storing any state.
2011-03-19	Ruby: Convert AccessModeType to RubyAccessMode	Nilay Vaish
	This patch converts AccessModeType to RubyAccessMode so that both the protocol dependent and independent code uses the same access mode.
2011-03-17	ARM: Fix subtle bug in LDM.	Ali Saidi
	If the instruction faults mid-op the base register shouldn't be written back.
2011-03-17	ARM: Detect and skip udelay() functions in linux kernel.	Ali Saidi
	This change speeds up booting, especially in MP cases, by not executing udelay() on the core but instead skipping ahead tha amount of time that is being delayed.
2011-03-17	O3: Send instruction back to fetch on squash to seed predecoder correctly.	Ali Saidi

2011-03-17	O3: Cleanup the commitInfo comm struct.	Ali Saidi
	Get rid of unused members and use base types rather than derrived values where possible to limit amount of state.
2011-03-17	Mem: Fix issue with dirty block being lost when entire block transferred to ↵	Ali Saidi
	non-cache. This change fixes the problem for all the cases we actively use. If you want to try more creative I/O device attachments (E.g. sharing an L2), this won't work. You would need another level of caching between the I/O device and the cache (which you actually need anyway with our current code to make sure writes propagate). This is required so that you can mark the cache in between as top level and it won't try to send ownership of a block to the I/O device. Asserts have been added that should catch any issues.
2011-03-17	O3: Fix unaligned stores when cache blocked	Ali Saidi
	Without this change the a store can be issued to the cache multiple times. If this case occurs when the l1 cache is out of mshrs (and thus blocked) the processor will never make forward progress because each cycle it will send a single request using the recently freed mshr and not completing the multipart store. This will continue forever.
2011-03-01	Spelling: Fix the a spelling error by changing mmaped to mmapped.	Gabe Black
	There may not be a formally correct spelling for the past tense of mmap, but mmapped is the spelling Google doesn't try to autocorrect. This makes sense because it mirrors the past tense of map->mapped and not the past tense of cape->caped. --HG-- rename : src/arch/alpha/mmaped_ipr.hh => src/arch/alpha/mmapped_ipr.hh rename : src/arch/arm/mmaped_ipr.hh => src/arch/arm/mmapped_ipr.hh rename : src/arch/mips/mmaped_ipr.hh => src/arch/mips/mmapped_ipr.hh rename : src/arch/power/mmaped_ipr.hh => src/arch/power/mmapped_ipr.hh rename : src/arch/sparc/mmaped_ipr.hh => src/arch/sparc/mmapped_ipr.hh rename : src/arch/x86/mmaped_ipr.hh => src/arch/x86/mmapped_ipr.hh
2011-02-25	Ruby: Make DataBlock.hh independent of RubySystem	Nilay Vaish
	This patch changes DataBlock.hh so that it is not dependent on RubySystem. This dependence seems unecessary. All those functions that depende on RubySystem have been moved to DataBlock.cc file.
2011-02-25	O3CPU: Fix iqCount and lsqCount SMT fetch policies.	Timothy M. Jones
	Fixes two of the SMT fetch policies in O3CPU that were returning the count of instructions in the IQ or LSQ rather than the thread ID to fetch from.
2011-02-23	inorder: InstSeqNum bug	Korey Sewell
	Because int and not InstSeqNum was used in a couple of places, you can overflow the int type and thus get wierd bugs when the sequence number is negative (or some wierd value)
2011-02-23	inorder: dyn inst initialization	Korey Sewell
	remove constructors that werent being used (it just gets confusing) use initialization list for all the variables instead of relying on initVars() function
2011-02-23	inorder: cache packet handling	Korey Sewell
	-use a pointer to CacheReqPacket instead of PacketPtr so correct destructors get called on packet deletion - make sure to delete the packet if the cache blocks the sendTiming request or for some reason we dont use the packet - dont overwrite memory requests since in the worst case an instruction will be replaying a request so no need to keep allocating a new request - we dont use retryPkt so delete it - fetch code was split out already, so just assert that this is a memory reference inst. and that the staticInst is available
2011-02-23	O3: When a prefetch causes a fault, don't record it in the inst	Ali Saidi

2011-02-23	O3: If there is an outstanding table walk don't let the inst queue sleep.	Ali Saidi
	If there is an outstanding table walk and no other activity in the CPU it can go to sleep and never wake up. This change makes the instruction queue always active if the CPU is waiting for a store to translate. If Gabe changes the way this code works then the below should be removed as indicated by the todo.
2011-02-23	ARM: Do something for ISB, DSB, DMB	Ali Saidi

2011-02-23	ARM: Fix bug that let two table walks occur in parallel.	Ali Saidi

2011-02-23	O3: Fix bug when a squash occurs right before TLB miss returns.	Ali Saidi
	In this case we need to throw away the TLB miss, not assume it was the one we were waiting for.
2011-02-18	m5: merge inorder/release-notes/make_release changes	Korey Sewell

2011-02-18	inorder: add names and slot #s to res. dprints	Korey Sewell

2011-02-18	inorder: ignore nops in execution unit	Korey Sewell

2011-02-18	inorder: update graduation unit	Korey Sewell
	make sure instructions are able to commit before writing back to the RF do not commit more than 1 non-speculative instruction per cycle
2011-02-18	inorder: recognize isSerializeAfter flag	Korey Sewell
	keep track of when an instruction needs the execution behind it to be serialized. Without this, in SE Mode instructions can execute behind a system call exit().
2011-02-18	inorder: update default thread size(=1)	Korey Sewell
	a lot of structures get allocated based off that MaxThreads parameter so this is an effort to not abuse it