This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#include <vector> | |
#include <chrono> | |
#include <iostream> | |
// Function to get current time in seconds | |
double get_time() { | |
auto now = std::chrono::high_resolution_clock::now(); | |
return std::chrono::duration<double>(now.time_since_epoch()).count(); | |
} |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# AOT ID: ['0_inference'] | |
from ctypes import c_void_p, c_long, c_int | |
import torch | |
import math | |
import random | |
import os | |
import tempfile | |
from math import inf, nan | |
from torch._inductor.hooks import run_intermediate_hooks | |
from torch._inductor.utils import maybe_profile |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
With tqdm took 0.698805 seconds | |
2062651 function calls (2061588 primitive calls) in 0.752 seconds | |
Ordered by: internal time | |
ncalls tottime percall cumtime percall filename:lineno(function) | |
1000002 0.350 0.000 0.355 0.000 loading_utils.py:41(update) | |
1000001 0.242 0.000 0.597 0.000 loading_utils.py:32(__iter__) | |
1 0.102 0.102 0.699 0.699 loading_utils.py:185(with_tqdm_loop) | |
58 0.005 0.000 0.005 0.000 {built-in method marshal.loads} |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Graph break in user code at /workspace/Wan2GP/wan/modules/sage2_core.py:66 | |
Reason: Unsupported: torch.* op returned non-Tensor int call_function <function device_count at 0x795533c18400> | |
User code traceback: | |
File "/workspace/Wan2GP/wan/modules/model.py", line 452, in forward | |
y = self.self_attn( xlist, seq_lens, grid_sizes,freqs) | |
File "/usr/local/lib/python3.11/dist-packages/torch/nn/modules/module.py", line 1750, in _call_impl | |
return forward_call(*args, **kwargs) | |
File "/usr/local/lib/python3.11/dist-packages/mmgp/offload.py", line 1926, in check_empty_cuda_cache | |
return previous_method(*args, **kwargs) | |
File "/workspace/Wan2GP/wan/modules/model.py", line 266, in forward |