You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Running criu dump for a simple ROCm application (HelloWorld.cpp) on Ubuntu 22.04 (6.5.0-44-generic kernel) causes kernel crash. This problem occurs with CRIU installed from both master and criu-dev branches.
HelloWorld.cpp:
#include<hip/hip_runtime.h>
#include<string.h>
#include<stdio.h>
#include<stdlib.h>
#include<iostream>
#include<string>
#include<fstream>
#include<unistd.h>
#defineSAMPLE_VERSION"HIP-Examples-Application-v1.0"
#defineSUCCESS0
#defineFAILURE1usingnamespacestd;
__global__ voidhelloworld(char* in, char* out)
{
int num = hipThreadIdx_x + hipBlockDim_x * hipBlockIdx_x;
out[num] = in[num] + 1;
}
intmain(int argc, char* argv[])
{
hipDeviceProp_t devProp;
hipGetDeviceProperties(&devProp, 0);
cout << " System minor " << devProp.minor << endl;
cout << " System major " << devProp.major << endl;
cout << " agent prop name " << devProp.name << endl;
while (1) {
/* Initial input,output for the host and create memory objects for the kernel*/constchar* input = "GdkknVnqkc";
size_t strlength = strlen(input);
cout << "input string:" << endl;
cout << input << endl;
char *output = (char*) malloc(strlength + 1);
char* inputBuffer;
char* outputBuffer;
hipMalloc((void**)&inputBuffer, (strlength + 1) * sizeof(char));
hipMalloc((void**)&outputBuffer, (strlength + 1) * sizeof(char));
hipMemcpy(inputBuffer, input, (strlength + 1) * sizeof(char), hipMemcpyHostToDevice);
hipLaunchKernelGGL(
helloworld,
dim3(1),
dim3(strlength),
0, 0,
inputBuffer, outputBuffer
);
hipMemcpy(output, outputBuffer,(strlength + 1) * sizeof(char), hipMemcpyDeviceToHost);
hipFree(inputBuffer);
hipFree(outputBuffer);
//Add the terminal character to the end of output.
output[strlength] = '\0';
cout << "\noutput string:" << endl;
cout << output << endl;
free(output);
sleep(1);
}
std::cout<<"Passed!\n";
return SUCCESS;
}
A similar problem occurs on RHEL 9.4 (5.14.0-427.26.1.el9_4.x86_64) when criu dump exits with an error (e.g., when the --shell-job option is not specified). This causes an immediate system reboot.
The text was updated successfully, but these errors were encountered:
Running
criu dump
for a simple ROCm application (HelloWorld.cpp
) on Ubuntu 22.04 (6.5.0-44-generic
kernel) causes kernel crash. This problem occurs with CRIU installed from both master and criu-dev branches.HelloWorld.cpp:
rocminfo
journalctl --system
dmesg
A similar problem also occurs with kernel version
6.8.0-38-generic
:dmesg
A similar problem occurs on RHEL 9.4 (
5.14.0-427.26.1.el9_4.x86_64
) whencriu dump
exits with an error (e.g., when the--shell-job
option is not specified). This causes an immediate system reboot.The text was updated successfully, but these errors were encountered: