gem5 - gem5

Age	Commit message (Collapse)	Author
2015-06-01	kvm, arm, dev: Add an in-kernel GIC implementation	Andreas Sandberg
	This changeset adds a GIC implementation that uses the kernel's built-in support for simulating the interrupt controller. Since there is currently no support for state transfer between gem5 and the kernel, the device model does not support serialization and CPU switching (which would require switching to a gem5-simulated GIC).
2015-06-01	kvm: Handle inst events at the current instruction count	Andreas Sandberg
	There are cases (particularly when attaching GDB) when instruction events are scheduled at the current instruction tick. This used to trigger an assertion error in kvm. This changeset adds a check for this condition and forces KVM to do a quick entry that completes any pending IO operations, but does not execute any new instructions, before servicing the event. We could check if we need to enter KVM at all, but forcing a quick entry is makes the code slightly cleaner and does not hurt correctness (performance is hardly an issue in these cases).
2015-06-01	kvm, arm: Move ARM-specific files to arch/arm/kvm/	Andreas Sandberg
	This changeset moves the ARM-specific KVM CPU implementation to arch/arm/kvm/. This change is expected to keep the source tree somewhat cleaner as we start adding support for ARMv8 and KVM in-kernel interrupt controller simulation. --HG-- rename : src/cpu/kvm/ArmKvmCPU.py => src/arch/arm/kvm/ArmKvmCPU.py rename : src/cpu/kvm/arm_cpu.cc => src/arch/arm/kvm/arm_cpu.cc rename : src/cpu/kvm/arm_cpu.hh => src/arch/arm/kvm/arm_cpu.hh
2015-05-26	arm: implement the CONTEXTIDR_EL2 system reg.	Curtis Dunham

2015-05-26	arm, stats: Update stats to reflect reduction in misc reg reads	Andreas Hansson

2015-05-26	arm: Make address translation faster with better caching	Nathanael Premillieu
	This patch adds better caching of the sys regs for AArch64, thus avoiding unnecessary calls to tc->readMiscReg(MISCREG_CPSR) in the non-faulting case.
2015-05-26	base: Allow multiple interleaved ranges	Andreas Hansson
	This patch changes how the address range calculates intersection such that a system can have a number of non-overlapping interleaved ranges without complaining. Without this patch we end up with a panic.
2015-05-26	stats: Update MinorCPU regressions after accounting fix	Andreas Hansson

2015-05-26	cpu: Fix a bug in counting issued instructions in MinorCPU	Andrew Bardsley
	The MinorCPU would count bubbles in Execute::issue as part of the num_insts_issued and so sometimes reach the instruction issue limit incorrectly. Fixed by checking for a bubble in one new place.
2015-05-26	arm: Implement some missing syscalls (SE mode)	Giacomo Gabrielli
	Adding a few syscalls that were previously considered unimplemented.
2015-05-26	ruby: Deprecation warning for RubyMemoryControl	Andreas Hansson
	A step towards removing RubyMemoryControl and shift users to DRAMCtrl. The latter is faster, more representative, very versatile, and is integrated with power models.
2015-05-23	arm, stats: Update stats to reflect changes to generic timer	Andreas Sandberg
	The addition of a virtual timer affects stats in minor and o3.
2015-05-23	arm, dev: Add support for a memory mapped generic timer	Andreas Sandberg
	There are cases when we don't want to use a system register mapped generic timer, but can't use the SP804. For example, when using KVM on aarch64, we want to intercept accesses to the generic timer, but can't do so if it is using the system register interface. In such cases, we need to use a memory-mapped generic timer. This changeset adds a device model that implements the memory mapped generic timer interface. The current implementation only supports a single frame (i.e., one virtual timer and one physical timer).
2015-05-23	arm: Get rid of pointless have_generic_timer param	Andreas Sandberg
	The ArmSystem class has a parameter to indicate whether it is configured to use the generic timer extension or not. This parameter doesn't affect any feature flags in the current implementation and is therefore completely unnecessary. In fact, we usually don't set it even if a system has a generic timer. If we ever need to check if there is a generic timer present, we should just request a pointer and check if it is non-null instead.
2015-05-23	dev, arm: Add virtual timers to the generic timer model	Andreas Sandberg
	The generic timer model currently does not support virtual counters. Virtual and physical counters both tick with the same frequency. However, virtual timers allow a hypervisor to set an offset that is subtracted from the counter when it is read. This enables the hypervisor to present a time base that ticks with virtual time in the VM (i.e., doesn't tick when the VM isn't running). Modern Linux kernels generally assume that virtual counters exist and try to use them by default.
2015-05-23	dev, arm: Refactor and clean up the generic timer model	Andreas Sandberg
	This changeset cleans up the generic timer a bit and moves most of the register juggling from the ISA code into a separate class in the same source file as the rest of the generic timer. It also removes the assumption that there is always 8 or fewer CPUs in the system. Instead of having a fixed limit, we now instantiate per-core timers as they are requested. This is all in preparation for other patches that add support for virtual timers and a memory mapped interface.
2015-05-23	kvm: Fix dumping code for large registers	Andreas Sandberg
	The register dumping code in kvm tries to print the bytes in large registers (128 bits and larger) instead of printing them as hex. This changeset fixes that.
2015-05-23	kvm, x86: Guard x86-specific APIs in KvmVM	Andreas Sandberg
	Protect x86-specific APIs in KvmVM with compile-time guards to avoid breaking ARM builds.
2015-05-23	build: Don't test for KVM xsave support on ARM	Andreas Sandberg
	The current build tests for KVM unconditionally check for xsave support. This obviously never works on ARM since xsave is x86-specific. This changeset refactors the build tests probing for KVM support and moves the xsave test to an x86-specific section of is_isa_kvm_compatible().
2015-05-23	arm: Workaround incorrect HDLCD register order in kernel	Andreas Sandberg
	Some versions of the kernel incorrectly swap the red and blue color select registers. This changeset adds a workaround for that by swapping them when instantiating a PixelConverter.
2015-05-23	base: Redesign internal frame buffer handling	Andreas Sandberg
	Currently, frame buffer handling in gem5 is quite ad hoc. In practice, we pass around naked pointers to raw pixel data and expect consumers to convert frame buffers using the (broken) VideoConverter. This changeset completely redesigns the way we handle frame buffers internally. In summary, it fixes several color conversion bugs, adds support for more color formats (e.g., big endian), and makes the code base easier to follow. In the new world, gem5 always represents pixel data using the Pixel struct when pixels need to be passed between different classes (e.g., a display controller and the VNC server). Producers of entire frames (e.g., display controllers) should use the FrameBuffer class to represent a frame. Frame producers are expected to create one instance of the FrameBuffer class in their constructors and register it with its consumers once. Consumers are expected to check the dimensions of the frame buffer when they consume it. Conversion between the external representation and the internal representation is supported for all common "true color" RGB formats of up to 32-bit color depth. The external pixel representation is expected to be between 1 and 4 bytes in either big endian or little endian. Color channels are assumed to be contiguous ranges of bits within each pixel word. The external pixel value is scaled to an 8-bit internal representation using a floating multiplication to map it to the entire 8-bit range.
2015-05-23	base: Clean up bitmap generation code	Andreas Sandberg
	The bitmap generation code is hard to follow and incorrectly uses the size of an enum member to calculate the size of a pixel. This changeset cleans up the code and adds some documentation.
2015-05-19	ruby: Fix RubySystem warm-up and cool-down scope	Joel Hestness
	The processes of warming up and cooling down Ruby caches are simulation-wide processes, not just RubySystem instance-specific processes. Thus, the warm-up and cool-down variables should be globally visible to any Ruby components participating in either process. Make these variables static members and track the warm-up and cool-down processes as appropriate. This patch also has two side benefits: 1) It removes references to the RubySystem g_system_ptr, which are problematic for allowing multiple RubySystem instances in a single simulation. Warmup and cooldown variables being static (global) reduces the need for instance-specific dereferences through the RubySystem. 2) From the AbstractController, it removes local RubySystem pointers, which are used inconsistently with other uses of the RubySystem: 11 other uses reference the RubySystem with the g_system_ptr. Only sequencers have local pointers.
2015-05-15	arm: Identify table-walker requests	Andreas Hansson
	This patch ensures all page-table walks are flagged as such.
2015-05-15	misc: Appease gcc 5.1	Andreas Hansson
	Three minor issues are resolved: 1. Apparently gcc 5.1 does not like negation of booleans followed by bitwise AND. 2. Somehow the compiler also gets confused and warns about NoopMachInst being unused (removing it causes compilation errors though). Most likely a compiler bug. 3. There seems to be a number of instances where loop unrolling causes false positives for the array-bounds check. For now, switch to std::array. Potentially we could disable the warning for newer gcc versions, but switching to std::array is probably a good move in any case.
2015-05-15	sim: Don't clear the active CPU vector in System::initState	Andreas Sandberg
	The system class currently clears the vector of active CPUs in initState(). CPUs are added to the list by registerThreadContext() which is called from BaseCPU::init(). This obviously breaks when the System object is initialized after the CPUs. This changeset removes the offending clear() call since the list will be empty after it has been instantiated anyway.
2015-05-15	config: Use null memory for DRAM sweep script	Andreas Hansson
	Do not waste time when we do not care about the data.
2015-05-15	config: Add new MemConfig options to DRAM sweep script	Wendy Elsasser
	Update script to match current MemConfig options with external_memory_system option set to 0.
2015-05-05	syscall_emul: fix warn_once behavior	Steve Reinhardt
	The current ignoreWarnOnceFunc doesn't really work as expected, since it will only generate one warning total, for whichever "warn-once" syscall is invoked first. This patch fixes that behavior by keeping a "warned" flag in the SyscallDesc object, allowing suitably flagged syscalls to warn exactly once per syscall.
2015-05-05	stats, arm: Update stats for missing FPEXC.EN check	Andreas Hansson
	Only one regression is affected.
2015-05-05	arm: Add missing FPEXC.EN check	Andreas Hansson
	Add a missing check to ensure that exceptions are generated properly.
2015-05-05	arm: enable DCZVA by default in SE mode	Giacomo Gabrielli

2015-05-05	stats: Update stats to reflect cache changes	Andreas Hansson

2015-03-17	mem: Create a request copy for deferred snoops	Stephan Diestelhorst
	Sometimes, we need to defer an express snoop in an MSHR, but the original request might complete and deallocate the original pkt->req. In those cases, create a copy of the request so that someone who is inspecting the delayed snoop can also inspect the request still. All of this is rather hacky, but the allocation / linking and general life-time management of Packet and Request is rather tricky. Deleting the copy is another tricky area, testing so far has shown that the right copy is deleted at the right time.
2015-05-05	arm: Relax ordering for some uncacheable accesses	Andreas Sandberg
	We currently assume that all uncacheable memory accesses are strictly ordered. Instead of always enforcing strict ordering, we now only enforce it if the required memory type is device memory or strongly ordered memory.
2015-05-05	mem, cpu: Add a separate flag for strictly ordered memory	Andreas Sandberg
	The Request::UNCACHEABLE flag currently has two different functions. The first, and obvious, function is to prevent the memory system from caching data in the request. The second function is to prevent reordering and speculation in CPU models. This changeset gives the order/speculation requirement a separate flag (Request::STRICT_ORDER). This flag prevents CPU models from doing the following optimizations: * Speculation: CPU models are not allowed to issue speculative loads. * Write combining: CPU models and caches are not allowed to merge writes to the same cache line. Note: The memory system may still reorder accesses unless the UNCACHEABLE flag is set. It is therefore expected that the STRICT_ORDER flag is combined with the UNCACHEABLE flag to prevent this behavior.
2015-05-05	mem, alpha: Move Alpha-specific request flags	Andreas Sandberg
	Move Alpha-specific memory request flags to an architecture-specific header and map them to the architecture specific flag bit range.
2015-05-05	arm: Remove unnecessary boot uncachability	Andreas Hansson
	With the recent patches addressing how we deal with uncacheable accesses there is no longer need for the work arounds put in place to enforce certain sections of memory to be uncacheable during boot.
2015-05-05	mem: Snoop into caches on uncacheable accesses	Andreas Hansson
	This patch takes a last step in fixing issues related to uncacheable accesses. We do not separate uncacheable memory from uncacheable devices, and in cases where it is really memory, there are valid scenarios where we need to snoop since we do not support cache maintenance instructions (yet). On snooping an uncacheable access we thus provide data if possible. In essence this makes uncacheable accesses IO coherent. The snoop filter is also queried to steer the snoops, but not updated since the uncacheable accesses do not allocate a block.
2015-05-05	arch, cpu: Do not forward snoops to table walker	Andreas Hansson
	This patch simplifies the overall CPU by changing the TLB caches such that they do not forward snoops to the table walker port(s). Note that only ARM and X86 are affected. There is no reason for the ports to snoop as they do not actually take any action, and from a performance point of view we are better of not snooping more than we have to. Should it at a later point be required to snoop for a particular TLB design it is easy enough to add it back.
2015-05-05	mem: Pass shared downstream through caches	Andreas Hansson
	This patch ensures that we pass on information about a packet being shared (rather than exclusive), when forwarding a packet downstream. Without this patch there is a risk that a downstream cache considers the line exclusive when it really isn't.
2015-05-05	mem: Add forward snoop check for HardPFReqs	Ali Jafri
	We should always check whether the cache is supposed to be forwarding snoops before generating snoops.
2015-05-05	mem: Add missing stats update for uncacheable MSHRs	Andreas Hansson
	This patch adds a missing counter update for the uncacheable accesses. By updating this counter we also get a meaningful average latency for uncacheable accesses (previously inf).
2015-05-05	mem: Tidy up BaseCache parameters	Andreas Hansson
	This patch simply tidies up the BaseCache parameters and removes the unused "two_queue" parameter.
2015-05-05	mem: Remove templates in cache model	David Guillen
	This patch changes the cache implementation to rely on virtual methods rather than using the replacement policy as a template argument. There is no impact on the simulation performance, and overall the changes make it easier to modify (and subclass) the cache and/or replacement policy.
2015-05-05	cpu: Work around gcc 4.9 issues with Num_OpClasses	Andreas Hansson
	This patch fixes a recent issue with gcc 4.9 (and possibly more) being convinced that indices outside the array bounds are used when initialising the FUPool members.
2015-05-05	stats: Bring regression stats in line with actual behaviour	Andreas Hansson

2015-04-30	stats: arm: updates	Nilay Vaish

2015-04-29	stats: x86: updates due to change in div latency	Nilay Vaish

2015-04-29	arch, base, dev, kern, sym: FreeBSD support	Ruslan Bukin
	This adds support for FreeBSD/aarch64 FS and SE mode (basic set of syscalls only) Committed by: Nilay Vaish <nilay@cs.wisc.edu>