You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I just simply try to run the code for GIANT-XRT training process for ogbn-arxiv, but it seems the code freezes without allocating any GPUs for training.
How to Reproduce?
(If you developed your own code, please provide a short script that reproduces the error. For existing examples, please provide link.)
Steps to reproduce
(Please provide minimal example of code snippet that reproduces the error. For existing examples, please provide link.)
I also observed this issue. It seems because of OOO, but it doesn't raise any alert or error. I figured It can be solved by using GPU with larger memory.
Description
I just simply try to run the code for GIANT-XRT training process for ogbn-arxiv, but it seems the code freezes without allocating any GPUs for training.
How to Reproduce?
(If you developed your own code, please provide a short script that reproduces the error. For existing examples, please provide link.)
Steps to reproduce
(Please provide minimal example of code snippet that reproduces the error. For existing examples, please provide link.)
(Paste the commands you ran that produced the error.)
1.data_dir=./proc_data_xrt/ogbn-arxiv
bash xrt_train.sh ${data_dir}
2.
What have you tried to solve it?
Error message or code output
The code stuck here. And no GPUs are used.
Environment
(Add as much information about your environment as possible, e.g. dependencies versions.)
The text was updated successfully, but these errors were encountered: