gem5 - gem5

Age	Commit message (Collapse)	Author
2011-02-12	Ruby: Reorder Cache Lookup in Protocol Files	Nilay Vaish
	The patch changes the order in which L1 dcache and icache are looked up when a request comes in. Earlier, if a request came in for instruction fetch, the dcache was looked up before the icache, to correctly handle self-modifying code. But, in the common case, dcache is going to report a miss and the subsequent icache lookup is going to report a hit. Given the invariant - caches under the same controller keep track of disjoint sets of cache blocks, we can move the icache lookup before the dcache lookup. In case of a hit in the icache, using our invariant, we know that the dcache would have reported a miss. In case of a miss in the icache, we know that icache would have missed even if the dcache was looked up before looking up the icache. Effectively, we are doing the same thing as before, though in the common case, we expect reduction in the number of lookups. This was empirically confirmed for MOESI hammer. The ratio lookups to access requests is now about 1.1 to 1.
2011-02-12	inorder: clean up the old way of inst. scheduling	Korey Sewell
	remove remnants of old way of instruction scheduling which dynamically allocated a new resource schedule for every instruction
2011-02-12	inorder: utilize cached skeds in pipeline	Korey Sewell
	allow the pipeline and resources to use the cached instruction schedule and resource sked iterator
2011-02-12	inorder: define iterator for resource schedules	Korey Sewell
	resource skeds are divided into two parts: front end (all insts) and back end (inst. specific) each of those are implemented as separate lists, so this iterator wraps around the traditional list iterator so that an instruction can walk it's schedule but seamlessly transfer from front end to back end when necessary
2011-02-12	inorder: stage scheduler for front/back end schedule creation	Korey Sewell
	add a stage scheduler class to replace InstStage in pipeline_traits.cc use that class to define a default front-end, resource schedule that all instructions will follow. This will also replace the back end schedule in pipeline_traits.cc. The reason for adding this is so that we can cache instruction schedules in the future instead of calling the same function over/over again as well as constantly dynamically alllocating memory on every instruction to try to figure out it's schedule
2011-02-12	inorder: cache instruction schedules	Korey Sewell
	first step in a optimization to not dynamically allocate an instruction schedule for every instruction but rather used cached schedules
2011-02-12	inorder: comments for resource sked class	Korey Sewell

2011-02-12	inorder: remove unused file	Korey Sewell
	inst_buffer file isn't used , so remove it
2011-02-12	inorder: remove unused isa ops	Korey Sewell
	pass/fail ops were used for testing but arent part of isa
2011-02-11	VNC/ARM: Use VNC server and add support to boot into X11	Ali Saidi

2011-02-11	VNC: Add VNC server to M5	Ali Saidi

2011-02-11	Serialization: Allow serialization of stl lists	Ali Saidi

2011-02-11	O3: Fix pipeline restart when a table walk completes in the fetch stage.	Giacomo Gabrielli
	When a table walk is initiated by the fetch stage, the CPU can potentially move to the idle state and never wake up. The fetch stage must call cpu->wakeCPU() when a translation completes (in finishTranslation()).
2011-02-11	O3: Fix a few bugs in the TableWalker object.	Giacomo Gabrielli
	Uncacheable requests were set as such only in atomic mode. currState->delayed is checked in place of currState->timing for resetting currState in atomic mode.
2011-02-11	SimpleCPU: Fix a case where a DTLB fault redirects fetch and an I-side walk ↵	Ali Saidi
	occurs. This change fixes an issue where a DTLB fault occurs and redirects fetch to handle the fault and the ITLB requires a walk which delays translation. In this case the status of the cpu isn't updated appropriately, and an additional instruction fetch occurs. Eventually this hits an assert as multiple instruction fetches are occuring in the system and when the second one returns the processor is in the wrong state. Some asserts below are removed because it was always true (typo) and the state after the initiateAcc() the processor could be in any valid state when a d-side fault occurs.
2011-02-11	O3: Enhance data address translation by supporting hardware page table walkers.	Giacomo Gabrielli
	Some ISAs (like ARM) relies on hardware page table walkers. For those ISAs, when a TLB miss occurs, initiateTranslation() can return with NoFault but with the translation unfinished. Instructions experiencing a delayed translation due to a hardware page table walk are deferred until the translation completes and kept into the IQ. In order to keep track of them, the IQ has been augmented with a queue of the outstanding delayed memory instructions. When their translation completes, instructions are re-executed (only their initiateAccess() was already executed; their DTB translation is now skipped). The IEW stage has been modified to support such a 2-pass execution.
2011-02-11	ARM: Fix timer calculations.	Ali Saidi
	The timer calculations were a bit off so time would run faster than it otherwise should
2011-02-11	Timesync: Make sure timesync event is setup after curTick is unserialized	Ali Saidi
	Setup initial timesync event in initState or loadState so that curTick has been updated to the new value, otherwise the event is scheduled in the past.
2011-02-09	ruby: removed duplicate make response call	Brad Beckmann

2011-02-08	MESI CMP: Unset TBE pointer in L2 cache controller	Nilay Vaish
	The TBE pointer in the MESI CMP implementation was not being set to NULL when the TBE is deallocated. This resulted in segmentation fault on testing the protocol when the ProtocolTrace was switched on.
2011-02-07	X86: Obey the wp bit of CR0.	Tim Harris
	If cr0.wp ("write protect" bit) is clear then do not generate page faults when writing to write-protected pages in kernel mode.
2011-02-07	X86: Use all 64 bits of the lstar register in the SYSCALL_64 macroop.	Tim Harris
	During SYSCALL_64, use dataSize=8 when handling new rip (ref http://www.intel.com/Assets/PDF/manual/253668.pdf 5.8.8 IA32_LSTAR is a 64-bit address)
2011-02-07	X86: Fix JMP_FAR_I to unpack a far pointer correctly.	Tim Harris
	JMP_FAR_I was unpacking its far pointer operand using sll instead of srl like it should, and also putting the components in the wrong registers for use by other microcode.
2011-02-07	X86: Read the LDT/GDT at CPL0 when executing an iret.	Tim Harris
	During iret access LDT/GDT at CPL0 rather than after transition to user mode (if I'm reading the Intel IA-64 architecture spec correctly, the contents of the descriptor table are read before the CPL is updated).
2011-02-07	Orion: Replace printf() with fatal()	Nilay Vaish
	The code for Orion 2.0 makes use of printf() at several places where there as an error in configuration of the model. These have been replaced with fatal().
2011-02-07	ruby: add stdio header in SRAM.hh	Korey Sewell
	missing header file caused RUBY_FS to not compile
2011-02-07	X86: Fix compiling vtophys.cc	Gabe Black

2011-02-06	ruby: support to stallAndWait the mandatory queue	Brad Beckmann
	By stalling and waiting the mandatory queue instead of recycling it, one can ensure that no incoming messages are starved when the mandatory queue puts signficant of pressure on the L1 cache controller (i.e. the ruby memtester). --HG-- rename : src/mem/slicc/ast/WakeUpDependentsStatementAST.py => src/mem/slicc/ast/WakeUpAllDependentsStatementAST.py
2011-02-06	ruby: minor fix to deadlock panic message	Brad Beckmann

2011-02-06	garnet: Split network power in ruby.stats	Joel Hestness
	Split out dynamic and static power numbers for printing to ruby.stats
2011-02-06	MOESI_hammer: fixed dir bug counting received acks	Brad Beckmann

2011-02-06	ruby: numa bit fix for sparse memory	Brad Beckmann

2011-02-06	MOESI_CMP_token: removed unused message fields	Tushar Krishna

2011-02-06	mem: Added support for Null data packet	Brad Beckmann
	The packet now identifies whether static or dynamic data has been allocated and is used by Ruby to determine whehter to copy the data pointer into the ruby request. Subsequently, Ruby can be told not to update phys memory when receiving packets.
2011-02-06	m5: added work completed monitoring support	Brad Beckmann

2011-02-06	dev: fixed bugs to extend interrupt capability beyond 15 cores	Brad Beckmann

2011-02-06	x86: Timing support for pagetable walker	Joel Hestness
	Move page table walker state to its own object type, and make the walker instantiate state for each outstanding walk. By storing the states in a queue, the walker is able to handle multiple outstanding timing requests. Note that functional walks use separate state elements.
2011-02-06	TimingSimpleCPU: split data sender state fix	Joel Hestness
	In sendSplitData, keep a pointer to the senderState that may be updated after the call to handle*Packet. This way, if the receiver updates the packet senderState, it can still be accessed in sendSplitData.
2011-02-06	ruby: Fix RubyPort to properly handle retrys	Brad Beckmann

2011-02-06	Ruby: Fix to return cache block size to CPU for split data transfers	Joel Hestness

2011-02-06	Ruby: Add support for locked memory accesses in X86_FS	Joel Hestness

2011-02-06	Ruby: Update the Ruby request type names for LL/SC	Joel Hestness

2011-02-06	ruby: Assert for x86 misaligned access	Brad Beckmann
	This patch ensures only aligned access are passed to ruby and includes a fix to the DPRINTF address print.
2011-02-06	MOESI_hammer: Added full-bit directory support	Brad Beckmann

2011-02-06	x86: Add checkpointing capability to devices	Joel Hestness
	Add checkpointing capability to the Intel 8254 timer, CMOS, I8042, PS2 Keyboard and Mouse, I82094AA, I8237, I8254, I8259, and speaker devices
2011-02-06	x86: Add checkpointing capability to arch components	Joel Hestness
	Add checkpointing capability to the x86 interrupt device and the TLBs
2011-02-06	x86: implements vtophys	Joel Hestness
	Calls walker to look up virt. to phys. page mapping
2011-02-06	IntDev: packet latency fix	Joel Hestness
	The x86 local apic now includes a separate latency parameter for interrupts.
2011-02-06	MessagePort: implement the virtual recvTiming function to avoid double pkt ↵	Joel Hestness
	delete Double packet delete problem is due to an interrupt device deleting a packet that the SimpleTimingPort also deletes. Since MessagePort descends from SimpleTimingPort, simply reimplement the failing code from SimpleTimingPort: recvTiming.
2011-02-06	MOESI_hammer: trigge queue fix.	Joel Hestness