XPURemoting

This is a virtualization framework for CUDA API. The developer can easily add customization to the execution CUDA API, e.g., execute it as an RPC at a remote machine.

Minimal demo

Consits of 4 parts:

server
client
network
codegen

Getting started

How to add customization to a CUDA API: implement_new_apis.md

Requirements

Environment setup: please run a docker container mounting the xpuremoting directory and enter it. (On meepo3 or meepo4, you can use image xpu_remoting:latest). An example command is:

export container_name=xxx
docker run -dit  --shm-size 8G  --name $container_name  --gpus all  --privileged  --network host  -v path/to/xpuremoting:/workspace  xpu_remoting:latest

Note that we need Rust nightly toolchain to build the project, which is not installed in the docker image.

curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh -s -- --default-toolchain nightly
. "$HOME/.cargo/env"
# Optional: install cargo-expand
# See `codegen/README.md` for usage and alternatives
cargo +nightly install cargo-expand

Version checklist
- Rust: nightly
- CMake: at least 3.22.1 (required to run some of the tests)
- Clang: at least 6.0.0-1ubuntu2

Build

cd /path/to/xpuremoting && cargo build

You should specify default feature in both client/server's Cargo.toml to decide which communication methods will be compiled. For example, if you want to use RDMA communication method, you should add "rdma" into the default feature in client/Cargo.toml and server/Cargo.toml.

Config

You can use config.toml file to config communication type, buffer size, RDMA server listener socket and so on.

Since cargo will use cwd as running root folder, you should use absolute path for config file. The default path will be /workspace/conf 8C47 ig.toml. If you want a specific path, you can use the environment variable NETWORK_CONFIG to customize it. For example: NETWORK_CONFIG=/workspace/config.toml cargo run server.

Emulator

The network emulator is a component used to simulate system performance under different network conditions. When enabled, it calculates network latency and make the message receiver busy-wait, thus simulating the network overhead caused by RTT and bandwidth. Real-world tests have shown that the emulator's performance deviates from the actual conditions by no more than 5%. You can configure the rtt and bandwidth settings in config.toml.

This feature is only available when using shared memory. To enable the emulator, you can simply set emulator to true in config.toml.

Test

Unit test

cargo test

Integration test

Launch two terminals, one for server and the other for client.

server side:

cargo run [--features rdma] server

You should use features to decide what communication methods will be compiled. SHM is always compiled.

client side:

cd tests/cuda_api
mkdir -p build && cd build
cmake .. && make
cd ..
find -type f -executable -name "test_*" -exec ./startclient_debug.sh {} \;

P.S. Can use RUST_LOG environment to control the log level (default=debug).

Application test

Please refer to application-guild to run applicatons

Appendix

Build the docker image

Please refer to the link, and use the Dockerfile in the root directory.

Name		Name	Last commit message	Last commit date
Latest commit History 489 Commits
.gitlab		.gitlab
client		client
codegen		codegen
cudasys		cudasys
docs		docs
log		log
measure		measure
network		network
rperf		rperf
server		server
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
README.md		README.md
config.toml		config.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

XPURemoting

Minimal demo

Getting started

Requirements

Build

Config

Emulator

Test

Unit test

Integration test

Application test

Appendix

Build the docker image

About

Uh oh!

Uh oh!

Contributors 6

Languages

SJTU-IPADS/PhoenixOS-Remoting

Folders and files

Latest commit

History

Repository files navigation

XPURemoting

Minimal demo

Getting started

Requirements

Build

Config

Emulator

Test

Unit test

Integration test

Application test

Appendix

Build the docker image

About

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 6

Languages