Add UVM Support to current GPGPU-Sim #219

yechen3 · 2021-04-04T00:42:20Z

Basically merge the implementation of functional and timing simulation support for Unified Virtual Memory from DebashisGanguly's UVMSmart to current version of GPGPU-Sim. In order to improve the simulation performance, an option "skip_cycles_enable" is provided to skip simulation of shader cores when all warps are either waiting for barrier or far memory transaction. It has been tested on rodinia-2.0 benchmark suites, and the result shows that adding UVM support has very tiny impact on simulation time.

Updated the manual

Merge AccelWattach

…same execution unit

Concurrent Kernel support for Accelsim and update configs

This is a relatively critical bug comparing to other memory errors that deserves early merging.

Update shader.h

fix duplicate regfile accesses within same instruction

Fixed constant_cycle

fix sub-core operand collector dispatch rr

… dev

…mmit hash to be more sane

Merging some small changes to the setupenv

Update gpgpusim.config

Adding GitHub CI

rename ci tests

Adding a non-zero return on error

* migrate_cmake: add package dependency checking * migrate_cmake: port setup_environment to CMake * migrate_cmake: break dependency checking and env export gen to different .cmake files * migrate_cmake: use CUDAToolkit_FOUND to test for CUDA compiler * migrate_cmake: use CUDAToolkit_FOUND to test for CUDA compiler * migrate_cmake: use CUDAToolkit_FOUND to test for CUDA compiler * migrate_cmake: properly parse for cuda version number * migrate_cmake: set highest CUDA supported to be 11.10.x * migrate_cmake: specify top level CMake file * migrate_cmake: add libcuda cmake file * migrate_cmake: use global compiler options and definitions * migrate_cmake: add cmake file to src * migrate_cmake: add cmake files for cuda-sim folder * migrate_cmake: add cmake files to gpgpu-sim folder * migrate_cmake: add cmake files for intersim * migrate_cmake: add short test using cmake * migrate_cmake: bump CXX standard requirement to 17 * Add cmake files for accelwattch * migrate_cmake: remove use of GLOB to grab source files * migrate_cmake: comment out the write protection on generated instructions.h * migrate_cmake: create sym folder and add newline to generated setup file * migrate_cmake: fix some path issues * migrate_cmake: let cmake thinks flex and bison generate CXX files * migrate_cmake: fix not linking pthread properly * migrate_cmake: remove debug message * migrate_cmake: add empty libopencl cmake file * migrate_cmake: install phase and runtime version detect * Added install phase to install the shared object and add symlinks * Changes with CUDA toolkit will be detected and triggered a rebuild * GPGPU-Sim detailed version string will be updated on each build * Typo fix and fix correct bin dir * Replace gcc -> g++ in intersim * ignore setup * check CMAKE_BUILD_TYPE * set DCMAKE_BUILD_TYPE --------- Co-authored-by: JRPAN <[email protected]>

CMAKE_BUILD_TYPE should be inside ${}

…-sim#72) LDGSTS/LDGDEPBAR was introduced accel-sim#62, but it's increment part was deleted by mistake. So add it. In some applications, ldgsts may not exist between ldgdepbar. In such cases, add exception handling logic to insert an empty vector. Reported-by: Okkyun Woo <[email protected]> Signed-off-by: Wonhyuk Yang <[email protected]>

* remove implicit casting, cleanup unused bank_warp_shift parameter * update cu init function prototype * remove m_bank_warp_shift from function call

* add automated clang formatter * Automated clang-format * use /bin/bash and add print * use default checkout ref * Format only after tests are success * Run CI on merge group --------- Co-authored-by: barnes88 <[email protected]> Co-authored-by: JRPAN <[email protected]>

* run formatter only on PR * remove unused & unintilized variables * fix signed & unsigned comparison warning * enable merge queue * resolve conflict * in formatter, checkout the forked repo, not the base repo in PR * Try to use jenkins for formatter * Automated Format --------- Co-authored-by: purdue-jenkins <[email protected]>

* Temp commit for Justin and Cassie to sync on code changes for adding per-stream status. * Resolved compile errors. * Removed redundant parameter * Passed cuda_stream_id from accelsim to gpgpusim * Cleaned up unused changes * Changed vector to map, having operator problems. * StreamID defaults to zero * Implemented streams to inc_stats and so on * Fixed TOTAL_ACCESS counts * Implemented GLOBAL_TIMER. * Fixed m_shader->get_kernel SEGFAULT issue in shader.cc. * Use warp_init to track streamID instead of issue_warp * Removed temp debug print * Modified cache_stats to only print data from latest finished stream Added optional arg to cache_stats::print_stats, cache_stats::print_fail_stats and their upstream functions. When streamID is specified, print stats from that stream. When not specified, print all stats. NOTE: current implementation depending on streamid never equals -1 * Removed default arg values of streamID * modified constructor of mem_fetch to pass in streamID * changed get_streamid to get_streamID * Added TODO to gpgpusim_entrypoint.cc and power_stat.cc * Only collect power stats when enabled * print last finished stream in PTX mode using last_streamID * take out additional printf * Add a field to baseline cache to indicate cache level * save gpu object in cache * Print stream ID only once per kernel * rm test print * use -1 for default stream id * cleanup debug prints * remove GLOABL_TIMER * Automated clang-format * Should be correct to print everything in power model * addressing concerns & errors * Automated clang-format * add m_stats_pw in operator+ * Automated Format --------- Co-authored-by: Justin Qiao <[email protected]> Co-authored-by: Justin Qiao <[email protected]> Co-authored-by: Tim Rogers <[email protected]> Co-authored-by: JRPan <[email protected]> Co-authored-by: purdue-jenkins <[email protected]>

…accel-sim#78)

* we have gcc-11 now. Check version for more than 2 digits. * version detection as well - And support c++ 11 by default

* Add accommodations to run gpgpusim with SST simulation framework through balar * Output setup_environment options when sourcing * Add SST directive check when creating sim thread * Add sst side test for jenkins * sst-integration: update Jenkinsfile with offical sst-elements repo and fix bugs in pipeline script * sst-integration: direct jenkins to rebuild gpgpusim before testing for sst * sst-integration: fix bugs in sst repos config * sst-integration: let Jenkins rebuilds simulator Since the simulator needs to be configured with both normal mode and sst mode, need to rebuild make target to clean prior runs. * sst-integration: Update Jenkinsfile to source env vars when running balar test * sst-integration: refactor code to remove __SST__ flag * sst-integration: fix a bug that init cluster twice for sst * sst-integration: fix a bug of not sending mem packets to SST * sst-integration: remove sst flags from makefiles and setup_env * sst-integration: add comments to SST changes * sst-integration: remove rebuilding simulator in jenkins when testing for SST * sst-integration: revert simulator build script * Add a function to support querying function argument info for SST * sst-integration: add version detection for vanadis binary * Automated Format * add version detection support for gcc 10+ * sst-integration: add cudaMallocHost for SST * sst-integration: fix a compilation bug * sst-integration: add sst balar unittest CI * sst-integration: specify GPU_ARCH for CI test * sst-integration: use bash for github actions * sst-integration: use https links for sst repos * sst-integration: add SST dependencies to CI config * sst-integration: remove sudo * sst-integration: default to yes for apt install * sst-integration: add manual trigger for github action * sst-integration: remove wrong on event * sst-integration: limit CPU usage for compilation * sst-integration: fix wrong path * sst-integration: use personal repo for testing * sst-integration: remove sst-core source in CI to free space * sst-integration: SST_Cycle use print stats with stream id * Automated Format * sst-integration: check for diskspace and try to clean it * sst-integration: move out of docker image * sst-integration: testing for ci path * sst-integration: fix syntax * sst-integration: pass env vars * sst-integration: set env properly * sst-integration: merge LLVM build and test into same job * sst-integration: fix step order * sst-integration: checkout correct branch for env-setup * sst-integration: remove resourcing gpu apps * sst-integration: revert back to docker github action * sst-integration: enable debug trace for sst testing * sst-integration: resourcing gpu app for env vars * sst-integration: use GPUAPPS_ROOT for path for gpu app * sst-integration: use GPUAPPS_ROOT for path for gpu app * sst-integration: enable parallel ci tests and fix not returning with cudaMallocHostSST * sst-integration: using debug flag for CI run * sst-integration: revert debug ci run * sst-integration: CI skips cuda sdk download and launch multiple jobs * sst-integration: reenable parallel tests * sst-integration: reduce concurrent test thread count * sst-integration: skip long test for github runner * sst-integration: try running CI with single core * sst-integrtion: add callback to SST to check thread sync is done in SST_Cycle() * sst-integration: ignore lookup if already found and add callbacks to SST * Automated Format * sst-integration: add support for indirect texture access * Automated Format * sste-integration: fix up for PR * Automated Format --------- Co-authored-by: purdue-jenkins <[email protected]>

* fix_sst_callbacks: add weak definitions for sst callbacks * Automated Format --------- Co-authored-by: purdue-jenkins <[email protected]>

Ni Kang and others added 20 commits October 27, 2021 14:24

Updated the manual

b1bb39e

Merge pull request accel-sim#25 from Connie120/dev

9218352

Updated the manual

rm hw_perf.csv from config folder

2a29ea9

Update Copyrights

011891a

Merge pull request accel-sim#27 from JRPan/accelwattach-dev

e466afb

Merge AccelWattach

set default max concurrent ctas to 32 and validate

f0ad71c

fix trace-driven concurrency segfault

43198e9

update max_concurrent kernel based on compute capability

8f71be8

update configs max concurrent kernel based on compute capability

8ba79fb

Fixed old bug that happens when there are different latencies to the …

da6a16a

…same execution unit

Merge pull request accel-sim#28 from barnes88/concurrent

d38b300

Concurrent Kernel support for Accelsim and update configs

fix sub-core operand collector dispatch rr

92f313b

Update shader.h

ee9b626

This is a relatively critical bug comparing to other memory errors that deserves early merging.

Merge pull request accel-sim#35 from FJShen/patch-1

371340d

Update shader.h

Merge branch 'dev' into fix-cu-findready

92830cd

fix duplicate regfile accesses within same instruction

c9cc0a0

Fixed constant_cycle

33ad7c8

Merge pull request accel-sim#39 from barnes88/reg-duplicates

cfba96c

fix duplicate regfile accesses within same instruction

Merge pull request accel-sim#40 from notseefire/dev

1d39b6f

Fixed constant_cycle

Merge pull request accel-sim#32 from barnes88/fix-cu-findready

13c6711

fix sub-core operand collector dispatch rr

shiyuw3 mentioned this pull request May 11, 2022

UVM support #255

Open

nothingface0 and others added 9 commits December 5, 2022 00:54

Fixed regex for files generated by newer bison versions

b102f61

Ignore YY keywords, remove whitespace

0d8af96

Added regex for non-linux platforms

b22122a

Merge branch 'dev' of github.com:tgrogers/gpgpu-sim_distribution into…

a17be08

… dev

Taking out a comment that is no longer relevant and truncating the co…

f5d21b1

…mmit hash to be more sane

Merge pull request accel-sim#43 from tgrogers/dev

5d29789

Merging some small changes to the setupenv

truncate the commit hash

e3483ab

Merge branch 'dev' into dev

2b5c462

Getting rid of some busted old email templates

5cfcc39

JRPan and others added 29 commits January 22, 2024 13:13

Adding Github Actino CI

3c95cd1

Merge pull request accel-sim#63 from accel-sim/FJShen-patch-1

9aeacdf

Update gpgpusim.config

Merge branch 'dev' into dev

a06ebf7

update CI scripts

b2f0ebe

uses actions/checkout@v4

2bbfb8b

Merge branch 'dev' into dev-github-ci

291fb11

fix dubious ownership

77aefac

remove fermi and add newer gen cards

1bdb39a

Merge pull request accel-sim#64 from accel-sim/dev-github-ci

67fc78c

Adding GitHub CI

rename ci tests

d935bd1

Merge pull request accel-sim#65 from accel-sim/dev-ci

b1ff53d

rename ci tests

Merge branch 'dev' into dev

0b93c15

Merge pull request accel-sim#52 from tgrogers/dev

6389301

Adding a non-zero return on error

CMAKE_BUILD_TYPE should be inside ${}

b70d930

Fix Build Type

570d75c

Merge pull request accel-sim#68 from JRPan/dev

7dc9977

CMAKE_BUILD_TYPE should be inside ${}

Added guard to check if L2 is writeback or not (accel-sim#73)

6aa7ed1

Reg bank patch (accel-sim#41)

55419d7

* remove implicit casting, cleanup unused bank_warp_shift parameter * update cu init function prototype * remove m_bank_warp_shift from function call

Add support for SHF ptx instruction (accel-sim#70)

081da0a

Change to calculate L2 BW if core freq and icnt freq are not the same (…

980eb88

…accel-sim#78)

we have gcc-11 now. Check version for more than 2 digits. (accel-sim#79)

667834c

* we have gcc-11 now. Check version for more than 2 digits. * version detection as well - And support c++ 11 by default

fix_sst_callbacks: add weak definitions for sst callbacks (accel-sim#81)

3844f75

* fix_sst_callbacks: add weak definitions for sst callbacks * Automated Format --------- Co-authored-by: purdue-jenkins <[email protected]>

move get_current_occupancy outside conditional (accel-sim#83)

45caf76

yechen3 force-pushed the dev branch from 48936a7 to 45caf76 Compare January 23, 2025 02:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add UVM Support to current GPGPU-Sim #219

Add UVM Support to current GPGPU-Sim #219

yechen3 commented Apr 4, 2021

Add UVM Support to current GPGPU-Sim #219

Are you sure you want to change the base?

Add UVM Support to current GPGPU-Sim #219

Conversation

yechen3 commented Apr 4, 2021