gem5 - gem5

Age	Commit message (Collapse)	Author
2014-10-01	misc: Fix issues identified by static analysis	Andreas Hansson
	Another bunch of issues addressed.
2014-10-01	arm: Use MiscRegIndex rather than int when flattening	Andreas Hansson
	Some additional type checking to avoid future issues.
2014-10-01	arm: More UBSan cleanups after additional full-system runs	Andreas Hansson
	Some incorrect casting to IntRegIndex, and a few uninitialized members in the i8254xGBe device.
2014-09-27	arm: Fixed undefined behaviours identified by gcc	Andreas Hansson
	This patch fixes the runtime errors highlighted by the undefined behaviour sanitizer. In the end there were two issues. First, when rotating an immediate, we ended up shifting an uint32_t by 32 in some cases. This case is fixed by checking for a rotation by 0 positions. Second, the Mrc15 and Mcr15 are operating on an IntReg and a MiscReg, but we used the type RegRegImmOp and passed a MiscRegIndex as an IntRegIndex. This issue is resolved by introducing a MiscRegRegImmOp and RegMiscRegImmOp with the appropriate types. With these fixes there are no runtime errors identified for the full ARM regressions.
2014-09-27	arch: Use const StaticInstPtr references where possible	Andreas Hansson
	This patch optimises the passing of StaticInstPtr by avoiding copying the reference-counting pointer. This avoids first incrementing and then decrementing the reference-counting pointer.
2014-09-27	scons: Address issues related to gcc 4.9.1	Andreas Hansson
	Fix a number few minor issues to please gcc 4.9.1. Removing the '-fuse-linker-plugin' flag means no libraries are part of the LTO process, but hopefully this is an acceptable loss, as the flag causes issues on a lot of systems (only certain combinations of gcc, ld and ar work).
2014-09-27	dev: Output invalid access size in IsaFake panic	Curtis Dunham

2014-09-27	mem: Output precise range when XBar has conflicts	Curtis Dunham

2014-09-27	mem: Provide better diagnostic for unconnected port	Curtis Dunham
	When _masterPort is null, a message to that effect is more helpful than a segfault.
2014-09-27	misc: Fix a bunch of minor issues identified by static analysis	Andreas Hansson
	Add some missing initialisation, and fix a handful benign resource leaks (including some false positives).
2014-09-20	cpu: Remove unused deallocateContext calls	Mitch Hayenga
	The call paths for de-scheduling a thread are halt() and suspend(), from the thread context. There is no call to deallocateContext() in general, though some CPUs chose to define it. This patch removes the function from BaseCPU and the cores which do not require it.
2014-09-20	alpha,arm,mips,power,x86,cpu,sim: Cleanup activate/deactivate	Mitch Hayenga
	activate(), suspend(), and halt() used on thread contexts had an optional delay parameter. However this parameter was often ignored. Also, when used, the delay was seemily arbitrarily set to 0 or 1 cycle (no other delays were ever specified). This patch removes the delay parameter and 'Events' associated with them across all ISAs and cores. Unused activate logic is also removed.
2014-09-20	mem: Rename Bus to XBar to better reflect its behaviour	Andreas Hansson
	This patch changes the name of the Bus classes to XBar to better reflect the actual timing behaviour. The actual instances in the config scripts are not renamed, and remain as e.g. iobus or membus. As part of this renaming, the code has also been clean up slightly, making use of range-based for loops and tidying up some comments. The only changes outside the bus/crossbar code is due to the delay variables in the packet. --HG-- rename : src/mem/Bus.py => src/mem/XBar.py rename : src/mem/coherent_bus.cc => src/mem/coherent_xbar.cc rename : src/mem/coherent_bus.hh => src/mem/coherent_xbar.hh rename : src/mem/noncoherent_bus.cc => src/mem/noncoherent_xbar.cc rename : src/mem/noncoherent_bus.hh => src/mem/noncoherent_xbar.hh rename : src/mem/bus.cc => src/mem/xbar.cc rename : src/mem/bus.hh => src/mem/xbar.hh
2014-04-25	mem: Add access statistics for the snoop filter	Stephan Diestelhorst
	Adds a simple access counter for requests and snoops for the snoop filter and also classifies hits based on whether a single other holder existed or whether multiple shares held the line.
2014-09-20	mem: Tie in the snoop filter in the coherent bus	Stephan Diestelhorst

2014-04-24	mem: Add a simple snoop counter per bus	Stephan Diestelhorst
	This patch adds a simple counter for both total messages and a histogram for the fan-out of snoop messages. The fan-out describes to how many ports snoops had to be sent per incoming request / snoop-from-below. Without any cleverness, this usually means to either all, or all but the requesting port.
2014-04-24	misc: Add functions for doing popcount and power-of-two checking	Stephan Diestelhorst
	Adds two public domain algorithms for determining number of set bits and also whether a value is a power of two, uses the builtin that is available in GCC and clang for popcount.
2014-09-20	mem: Simple Snoop Filter	Stephan Diestelhorst
	This is a first cut at a simple snoop filter that tracks presence of lines in the caches "above" it. The snoop filter can be applied at any given cache hierarchy and will then handle the caches above it appropriately; there is no need to use this only in the last-level bus. This design currently has some limitations: missing stats, no notion of clean evictions (these will not update the underlying snoop filter, because they are not sent from the evicting cache down), no notion of capacity for the snoop filter and thus no need for invalidations caused by capacity pressure in the snoop filter. These are planned to be added on top with future change sets.
2014-08-12	energy: Tighter checking of levels for DFS systems	Stephan Diestelhorst
	There are cases where users might by accident / intention specify less voltage operating points thatn frequency points. We consider one of these cases special: giving only a single voltage to a voltage domain effectively renders it as a static domain. This patch adds additional logic in the auxiliary parts of the functionality to handle these cases properly (simple driver asking for N>1 operating levels, we should return the same voltage for all of them) and adds error checking code in the voltage domain.
2014-07-25	energy: Add the Energy Controller in the right configs	Stephan Diestelhorst
	Tie in the newly created energy controller components in the default configurations.
2014-09-20	energy: Memory-mapped Energy Controller component	Akash Bagdia
	This patch provides an Energy Controller device that provides software (driver) access to a DVFS handler. The device is currently residing in the dev/arm tree, but there is nothing inherently ARM specific in the behaviour. It is currently only tested and supported for ARM Linux, hence the location.
2014-06-16	energy: Small extentions and fixes for DVFS handler	Stephan Diestelhorst
	These additions allow easier interoperability with and querying from an additional controller which will be in a separate patch. Also adding warnings for changing the enabled state of the handler across checkpoint / resume and deviating from the state in the configuration. Contributed-by: Akash Bagdia <akash.bagdia@arm.com>
2014-09-20	mem: Add DDR4 bank group timing	Wendy Elsasser
	Added the following parameter to the DRAMCtrl class: - bank_groups_per_rank This defaults to 1. For the DDR4 case, the default is overridden to indicate bank group architecture, with multiple bank groups per rank. Added the following delays to the DRAMCtrl class: - tCCD_L : CAS-to-CAS, same bank group delay - tRRD_L : RAS-to-RAS, same bank group delay These parameters are only applied when bank group timing is enabled. Bank group timing is currently enabled only for DDR4 memories. For all other memories, these delays will default to '0 ns' In the DRAM controller model, applied the bank group timing to the per bank parameters actAllowedAt and colAllowedAt. The actAllowedAt will be updated based on bank group when an ACT is issued. The colAllowedAt will be updated based on bank group when a RD/WR burst is issued. At the moment no modifications are made to the scheduling.
2014-09-20	mem: Add memory rank-to-rank delay	Wendy Elsasser
	Add the following delay to the DRAM controller: - tCS : Different rank bus turnaround delay This will be applied for 1) read-to-read, 2) write-to-write, 3) write-to-read, and 4) read-to-write command sequences, where the new command accesses a different rank than the previous burst. The delay defaults to 2*tCK for each defined memory class. Note that this does not correspond to one particular timing constraint, but is a way of modelling all the associated constraints. The DRAM controller has some minor changes to prioritize commands to the same rank. This prioritization will only occur when the command stream is not switching from a read to write or vice versa (in the case of switching we have a gap in any case). To prioritize commands to the same rank, the model will determine if there are any commands queued (same type) to the same rank as the previous command. This check will ensure that the 'same rank' command will be able to execute without adding bubbles to the command flow, e.g. any ACT delay requirements can be done under the hoods, allowing the burst to issue seamlessly.
2014-09-20	cpu: Update DRAM traffic gen	Wendy Elsasser
	Add new DRAM_ROTATE mode to traffic generator. This mode will generate DRAM traffic that rotates across banks per rank, command types, and ranks per channel The looping order is illustrated below: for (ranks per channel) for (command types) for (banks per rank) // Generate DRAM Command Series This patch also adds the read percentage as an input argument to the DRAM sweep script. If the simulated read percentage is 0 or 100, the middle for loop does not generate additional commands. This loop is used only when the read percentage is set to 50, in which case the middle loop will toggle between read and write commands. Modified sweep.py script, which generates DRAM traffic. Added input arguments and support for new DRAM_ROTATE mode. The script now has input arguments for: 1) Read percentage 2) Number of ranks 3) Address mapping 4) Traffic generator mode (DRAM or DRAM_ROTATE) The default values are: 100% reads, 1 rank, RoRaBaCoCh address mapping, and DRAM traffic gen mode For the DRAM traffic mode, added multi-rank support.
2014-09-20	dev: Add support for 9p proxying over VirtIO	Andreas Sandberg
	This patch adds support for 9p filesystem proxying over VirtIO. It can currently operate by connecting to a 9p server over a socket (VirtIO9PSocket) or by starting the diod 9p server and connecting over pipe (VirtIO9PDiod). WARNING: Checkpoints are currently not supported for systems with 9p proxies!
2014-09-20	dev: Add a VirtIO block device model	Andreas Sandberg

2014-09-20	dev: Add a VirtIO console device model	Andreas Sandberg

2014-09-20	dev, pci: Implement basic VirtIO support	Andreas Sandberg
	This patch adds support for VirtIO over the PCI bus. It does so by providing the following new SimObjects: * VirtIODeviceBase - Abstract base class for VirtIO devices. * PciVirtIO - VirtIO PCI transport interface. A VirtIO device is hooked up to the guest system by adding a PciVirtIO device to the PCI bus and connecting it to a VirtIO device using the vio parameter. New VirtIO devices should inherit from VirtIODevice base and implementing one or more VirtQueues. The VirtQueues are usually device-specific and all derive from the VirtQueue class. Queues must be registered with the base class from the constructor since the device assumes that the number of queues stay constant.
2014-09-20	dev: Refactor terminal<->UART interface to make it more generic	Andreas Sandberg
	The terminal currently assumes that the transport to the guest always inherits from the Uart class. This assumption breaks when implementing, for example, a VirtIO consoles. This patch removes this assumption by adding pointer to the from the terminal to the uart and replacing it with a more general callback interface. The Uart, or any other class using the terminal, class implements an instance of the callbacks class and registers it with the terminal.
2014-09-20	base: Clean up redundant string functions and use C++11	Andreas Hansson
	This patch does a bit of housekeeping on the string helper functions and relies on the C++11 standard library where possible. It also does away with our custom string hash as an implementation is already part of the standard library.
2014-09-20	base: Add getSectionNames to IniFile	Andrew Bardsley
	Add an accessor to IniFile to list all the sections in the file.
2014-09-20	cpu: Add ExecFlags debug flag	Mitch Hayenga
	Adds a debug flag to print out the flags a instruction is tagged with.
2014-09-20	mem: Remove the GHB prefetcher from the source tree	Mitch Hayenga
	There are two primary issues with this code which make it deserving of deletion. 1) GHB is a way to structure a prefetcher, not a definitive type of prefetcher 2) This prefetcher isn't even structured like a GHB prefetcher. It's basically a worse version of the stride prefetcher. It primarily serves to confuse new gem5 users and most functionality is already present in the stride prefetcher.
2014-09-20	cpu: use probes infrastructure to do simpoint profiling	Dam Sunwoo
	Instead of having code embedded in cpu model to do simpoint profiling use the probes infrastructure to do it.
2014-09-20	config: Cleanup .json config file generation	Andrew Bardsley
	This patch 'completes' .json config files generation by adding in the SimObject references and String-valued parameters not currently printed. TickParamValues are also changed to print in the same tick-value format as in .ini files. This allows .json files to describe a system as fully as the .ini files currently do. This patch adds a new function config_value (which mirrors ini_str) to each ParamValue and to SimObject. This function can then be explicitly changed to give different .json and .ini printing behaviour rather than being written in terms of ini_str.
2014-09-19	arch: Pass faults by const reference where possible	Andreas Hansson
	This patch changes how faults are passed between methods in an attempt to copy as few reference-counting pointer instances as possible. This should avoid unecessary copies being created, contributing to the increment/decrement of the reference counters.
2014-09-19	cpu: Use a deque in o3 rename instruction queue	Andreas Hansson
	Switch from a list to a data structure with better data layout.
2014-09-19	base: Ensure the CP annotation compiles again	Andreas Hansson
	A bit of revamping to get the CP annotate functionality to compile.
2014-09-19	misc: Use safe_cast when assumptions are made about return value	Andreas Hansson
	This patch changes two dynamic_cast to safe_cast as we assume the return value is not NULL (without checking).
2014-09-19	misc: Restore ostream flags where needed	Andreas Hansson
	This patch ensures we adhere to the normal ostream usage rules, and restore the flags after modifying them.
2014-09-19	stats: Fix flow-control bug in Vector2D printing	Andreas Hansson

2014-09-19	misc: Remove assertions ensuring unsigned values >= 0	Andreas Hansson

2014-09-19	mem: Check return value of checkFunctional in SimpleMemory	Andreas Hansson
	Simple fix to ensure we only iterate until we are done.
2014-09-19	mem: Add checks to sendTimingReq in cache	Andreas Hansson
	A small fix to ensure the return value is not ignored.
2014-09-15	ruby: network: revert some of the changes from ad9c042dce54	Nilay Vaish
	The changeset ad9c042dce54 made changes to the structures under the network directory to use a map of buffers instead of vector of buffers. The reasoning was that not all vnets that are created are used and we needlessly allocate more buffers than required and then iterate over them while processing network messages. But the move to map resulted in a slow down which was pointed out by Andreas Hansson. This patch moves things back to using vector of message buffers.
2014-09-12	cpu: Fix memory access in Minor not setting parent Request flags	Andrew Bardsley
	This patch fixes cases where uncacheable/memory type flags are not set correctly on a memory op which is split in the LSQ. Without this patch, request->request if freely used to check flags where the flags should actually come from the accumulation of request fragment flags. This patch also fixes a bug where an uncacheable access which passes through tryToSendRequest more than once can increment LSQ::numAccessesInMemorySystem more than once.
2014-09-12	style: Fix line continuation, especially in debug messages	Andrew Bardsley
	This patch closes a number of space gaps in debug messages caused by the incorrect use of line continuation within strings. (There's also one consistency change to a similar, but correct, use of line continuation)
2014-09-12	minor: Fix typo in DPRINTF for Minor branch prediction	Andreas Hansson

2014-09-09	sim: Automatically unregister probe listeners	Andreas Sandberg
	The ProbeListener base class automatically registers itself with a probe manager. Currently, the class does not unregister a itself when it is destroyed, which makes removing probes listeners somewhat cumbersome. This patch adds an automatic call to manager->removeListener in the ProbeListener destructor, which solves the problem.