gem5 - gem5

Age	Commit message (Collapse)	Author
2011-04-20	stats: one more name violation	Nathan Binkert

2011-04-19	stats: rename stats so they can be used as python expressions	Nathan Binkert

2011-04-15	trace: reimplement the DTRACE function so it doesn't use a vector	Nathan Binkert
	At the same time, rename the trace flags to debug flags since they have broader usage than simply tracing. This means that --trace-flags is now --debug-flags and --trace-help is now --debug-help
2011-04-15	debug: create a Debug namespace	Nathan Binkert

2011-04-15	includes: fix up code after sorting	Nathan Binkert

2011-04-15	includes: sort all includes	Nathan Binkert

2011-04-04	ARM: Fix checkpoint restoration into O3 CPU and the way O3 switchCpu works.	Ali Saidi
	This change fixes a small bug in the arm copyRegs() code where some registers wouldn't be copied if the processor was in a mode other than MODE_USER. Additionally, this change simplifies the way the O3 switchCpu code works by utilizing TheISA::copyRegs() to copy the required context information rather than the adhoc copying that goes on in the CPU model. The current code makes assumptions about the visibility of int and float registers that aren't true for all architectures in FS mode.
2011-04-04	ARM: Cleanup implementation of ITSTATE and put important code in PCState.	Ali Saidi
	Consolidate all code to handle ITSTATE in the PCState object rather than touching a variety of structures/objects.
2011-04-04	CPU: Remove references to memory copy operations	Ali Saidi

2011-04-04	O3: Tighten memory order violation checking to 16 bytes.	Ali Saidi
	The comment in the code suggests that the checking granularity should be 16 bytes, however in reality the shift by 8 is 256 bytes which seems much larger than required.
2011-03-31	Ruby: have the rubytester pass contextId to Ruby.	Lisa Hsu

2011-03-28	This patch supports cache flushing in MOESI_hammer	Somayeh Sardashti

2011-03-26	mips: cleanup ISA-specific code	Korey Sewell
	*** (1): get rid of expandForMT function MIPS is the only ISA that cares about having a piece of ISA state integrate multiple threads so add constants for MIPS and relieve the other ISAs from having to define this. Also, InOrder was the only core that was actively calling this function * * * (2): get rid of corespecific type The CoreSpecific type was used as a proxy to pass in HW specific params to a MIPS CPU, but since MIPS FS hasnt been touched for awhile, it makes sense to not force every other ISA to use CoreSpecific as well use a special reset function to set it. That probably should go in a PowerOn reset fault anyway.
2011-03-22	This patch fixes a build error in networktest.cc that occurs with gcc4.2	Tushar Krishna

2011-03-21	This patch adds the network tester for simple and garnet networks.	Tushar Krishna
	The tester code is in testers/networktest. The tester can be invoked by configs/example/ruby_network_test.py. A dummy coherence protocol called Network_test is also addded for network-only simulations and testing. The protocol takes in messages from the tester and just pushes them into the network in the appropriate vnet, without storing any state.
2011-03-19	Ruby: Convert AccessModeType to RubyAccessMode	Nilay Vaish
	This patch converts AccessModeType to RubyAccessMode so that both the protocol dependent and independent code uses the same access mode.
2011-03-17	ARM: Fix subtle bug in LDM.	Ali Saidi
	If the instruction faults mid-op the base register shouldn't be written back.
2011-03-17	ARM: Detect and skip udelay() functions in linux kernel.	Ali Saidi
	This change speeds up booting, especially in MP cases, by not executing udelay() on the core but instead skipping ahead tha amount of time that is being delayed.
2011-03-17	O3: Send instruction back to fetch on squash to seed predecoder correctly.	Ali Saidi

2011-03-17	O3: Cleanup the commitInfo comm struct.	Ali Saidi
	Get rid of unused members and use base types rather than derrived values where possible to limit amount of state.
2011-03-17	Mem: Fix issue with dirty block being lost when entire block transferred to ↵	Ali Saidi
	non-cache. This change fixes the problem for all the cases we actively use. If you want to try more creative I/O device attachments (E.g. sharing an L2), this won't work. You would need another level of caching between the I/O device and the cache (which you actually need anyway with our current code to make sure writes propagate). This is required so that you can mark the cache in between as top level and it won't try to send ownership of a block to the I/O device. Asserts have been added that should catch any issues.
2011-03-17	O3: Fix unaligned stores when cache blocked	Ali Saidi
	Without this change the a store can be issued to the cache multiple times. If this case occurs when the l1 cache is out of mshrs (and thus blocked) the processor will never make forward progress because each cycle it will send a single request using the recently freed mshr and not completing the multipart store. This will continue forever.
2011-03-01	Spelling: Fix the a spelling error by changing mmaped to mmapped.	Gabe Black
	There may not be a formally correct spelling for the past tense of mmap, but mmapped is the spelling Google doesn't try to autocorrect. This makes sense because it mirrors the past tense of map->mapped and not the past tense of cape->caped. --HG-- rename : src/arch/alpha/mmaped_ipr.hh => src/arch/alpha/mmapped_ipr.hh rename : src/arch/arm/mmaped_ipr.hh => src/arch/arm/mmapped_ipr.hh rename : src/arch/mips/mmaped_ipr.hh => src/arch/mips/mmapped_ipr.hh rename : src/arch/power/mmaped_ipr.hh => src/arch/power/mmapped_ipr.hh rename : src/arch/sparc/mmaped_ipr.hh => src/arch/sparc/mmapped_ipr.hh rename : src/arch/x86/mmaped_ipr.hh => src/arch/x86/mmapped_ipr.hh
2011-02-25	Ruby: Make DataBlock.hh independent of RubySystem	Nilay Vaish
	This patch changes DataBlock.hh so that it is not dependent on RubySystem. This dependence seems unecessary. All those functions that depende on RubySystem have been moved to DataBlock.cc file.
2011-02-25	O3CPU: Fix iqCount and lsqCount SMT fetch policies.	Timothy M. Jones
	Fixes two of the SMT fetch policies in O3CPU that were returning the count of instructions in the IQ or LSQ rather than the thread ID to fetch from.
2011-02-23	inorder: InstSeqNum bug	Korey Sewell
	Because int and not InstSeqNum was used in a couple of places, you can overflow the int type and thus get wierd bugs when the sequence number is negative (or some wierd value)
2011-02-23	inorder: dyn inst initialization	Korey Sewell
	remove constructors that werent being used (it just gets confusing) use initialization list for all the variables instead of relying on initVars() function
2011-02-23	inorder: cache packet handling	Korey Sewell
	-use a pointer to CacheReqPacket instead of PacketPtr so correct destructors get called on packet deletion - make sure to delete the packet if the cache blocks the sendTiming request or for some reason we dont use the packet - dont overwrite memory requests since in the worst case an instruction will be replaying a request so no need to keep allocating a new request - we dont use retryPkt so delete it - fetch code was split out already, so just assert that this is a memory reference inst. and that the staticInst is available
2011-02-23	O3: When a prefetch causes a fault, don't record it in the inst	Ali Saidi

2011-02-23	O3: If there is an outstanding table walk don't let the inst queue sleep.	Ali Saidi
	If there is an outstanding table walk and no other activity in the CPU it can go to sleep and never wake up. This change makes the instruction queue always active if the CPU is waiting for a store to translate. If Gabe changes the way this code works then the below should be removed as indicated by the todo.
2011-02-23	ARM: Do something for ISB, DSB, DMB	Ali Saidi

2011-02-23	ARM: Fix bug that let two table walks occur in parallel.	Ali Saidi

2011-02-23	O3: Fix bug when a squash occurs right before TLB miss returns.	Ali Saidi
	In this case we need to throw away the TLB miss, not assume it was the one we were waiting for.
2011-02-18	m5: merge inorder/release-notes/make_release changes	Korey Sewell

2011-02-18	inorder: add names and slot #s to res. dprints	Korey Sewell

2011-02-18	inorder: ignore nops in execution unit	Korey Sewell

2011-02-18	inorder: update graduation unit	Korey Sewell
	make sure instructions are able to commit before writing back to the RF do not commit more than 1 non-speculative instruction per cycle
2011-02-18	inorder: recognize isSerializeAfter flag	Korey Sewell
	keep track of when an instruction needs the execution behind it to be serialized. Without this, in SE Mode instructions can execute behind a system call exit().
2011-02-18	inorder: update default thread size(=1)	Korey Sewell
	a lot of structures get allocated based off that MaxThreads parameter so this is an effort to not abuse it
2011-02-18	inorder: don't overuse getLatency()	Korey Sewell
	resources don't need to call getLatency because the latency is already a member in the class. If there is some type of special case where different instructions impose a different latency inside a resource then we can revisit this and add getLatency() back in
2011-02-18	inorder: update max. resource bandwidths	Korey Sewell
	each resource has a certain # of requests it can take per cycle. update the #s here to be more realistic based off of the pipeline width and if the resource needs to be accessed on multiple cycles
2011-02-18	inorder: cleanup in destructors	Korey Sewell
	cleanup hanging pointers and other cruft in the destructors
2011-02-18	inorder: fix cache/fetch unit memory leaks	Korey Sewell
	--- need to delete the cache request's data on clearRequest() now that we are recycling requests --- fetch unit needs to deallocate the fetch buffer blocks when they are replaced or squashed.
2011-02-18	inorder: remove events for zero-cycle resources	Korey Sewell
	if a resource has a zero cycle latency (e.g. RegFile write), then dont allocate an event for it to use
2011-02-18	inorder: update pipeline interface for handling finished resource reqs	Korey Sewell
	formerly, to free up bandwidth in a resource, we could just change the pointer in that resource but at the same time the pipeline stages had visibility to see what happened to a resource request. Now that we are recycling these requests (to avoid too much dynamic allocation), we can't throw away the request too early or the pipeline stage gets bad information. Instead, mark when a request is done with the resource all together and then let the pipeline stage call back to the resource that it's time to free up the bandwidth for more instructions * inteface notes * - When an instruction completes and is done in a resource for that cycle, call done() - When an instruction fails and is done with a resource for that cycle, call done(false) - When an instruction completes, but isnt finished with a resource, call completed() - When an instruction fails, but isnt finished with a resource, call completed(false) * * * inorder: tlbmiss wakeup bug fix
2011-02-18	inorder: remove request map, use request vector	Korey Sewell
	take away all instances of reqMap in the code and make all references use the built-in request vectors inside of each resource. The request map was dynamically allocating a request per instruction. The request vector just allocates N number of requests during instantiation and then the surrounding code is fixed up to reuse those N requests *** setRequest() and clearRequest() are the new accessors needed to define a new request in a resource
2011-02-18	inorder: add valid bit for resource requests	Korey Sewell
	this will allow us to reuse resource requests within a resource instead of always dynamically allocating
2011-02-18	inorder: remove reqRemoveList	Korey Sewell
	we are going to be getting away from creating new resource requests for every instruction so no more need to keep track of a reqRemoveList and clean it up every tick
2011-02-18	inorder: initialize res. req. vectors based on resource bandwidth	Korey Sewell
	first change in an optimization that will stop InOrder from allocating new memory for every instruction's request to a resource. This gets expensive since every instruction needs to access ~10 requests before graduation. Instead, the plan is to allocate just enough resource request objects to satisfy each resource's bandwidth (e.g. the execution unit would need to allocate 3 resource request objects for a 1-issue pipeline since on any given cycle it could have 2 read requests and 1 write request) and then let the instructions contend and reuse those allocated requests. The end result is a smaller memory footprint for the InOrder model and increased simulation performance
2011-02-13	O3: Fetch from the microcode ROM when needed.	Gabe Black