Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Save VRehnberg/bb06bbde9c77738276896d8c5fcd4d24 to your computer and use it in GitHub Desktop.
Save VRehnberg/bb06bbde9c77738276896d8c5fcd4d24 to your computer and use it in GitHub Desktop.
(partial) EasyBuild log for failed build of /cephyr/NOBACKUP/priv/c3-staff/eb-tmp/eb-lquq6o8j/files_pr21438/d/DeepSpeed/DeepSpeed-0.14.5-foss-2023a-CUDA-12.1.1.eb (PR(s) #21438) (easyblock PR(s) #3450)
6.49s call unit/checkpoint/test_other_optimizer.py::TestOtherOptimizerCheckpoint::test_checkpoint_fp32_optimizer
6.48s call unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[True-True-1-dtype1]
6.48s call unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[False-True-1-dtype2]
6.48s call unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_4to2[True-False-1-dtype2]
6.48s call unit/runtime/zero/test_zero.py::TestZero3ParamPartitioningBase::test_model_class[EltwiseMultiplicationTestNetwork_Dict]
6.48s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithGrad::test_ckpt_arg_none[mask1]
6.48s call unit/checkpoint/test_zero_optimizer.py::TestZeROSaveLoadEdgeCase::test_save_before_accum_grad_is_done[3]
6.48s call unit/runtime/test_lr_schedulers.py::TestLrSchedule::test_lr_warmup_schedule[linear-33]
6.47s call unit/runtime/zero/test_zero.py::TestZero3DictFwd::test[list]
6.47s call unit/runtime/half_precision/test_fp16.py::TestZeroEmptyGrad::test[2]
6.47s call unit/runtime/half_precision/test_bf16.py::TestZeroDtypeCocktail::test[fp16-bf16]
6.47s call unit/checkpoint/test_zero_optimizer.py::TestSaveTensorClone::test_save_tensor_clone[False-2]
6.47s call unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[False-False-1-dtype2]
6.46s call unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_4to2[True-True-3-dtype1]
6.46s call unit/runtime/half_precision/test_bf16.py::TestZeroDtypeCocktail::test[bf16-fp16]
6.46s call unit/runtime/test_ds_config_dict.py::TestDeprecatedDeepScaleConfig::test
6.45s call unit/comm/test_dist.py::TestDistInitWithModel::test_already_init[True]
6.45s call unit/runtime/test_ds_config_dict.py::TestBatchConfig::test[2-32-18-1-False]
6.45s call unit/runtime/zero/test_zero.py::TestZeroFrozenWeights::test[1]
6.44s call unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_4to2[True-False-3-dtype0]
6.44s call unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_4to2[False-False-3-dtype1]
6.44s call unit/runtime/half_precision/test_fp16.py::TestZero2ReduceScatterOff::test
6.44s call unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[False-False-3-dtype0]
6.43s call unit/inference/quantization/test_intX_quantization.py::TestQuantizedInt::test_quantized_linear[8bits-0]
6.42s call unit/runtime/test_lr_schedulers.py::TestLrRange::test[1e-05-1e-05-1-False]
6.42s call unit/runtime/zero/test_zero.py::TestZero3ParamPartitioningBase::test_model_class[EltwiseMultiplicationTestNetwork_Tuple]
6.42s call unit/runtime/zero/test_zero.py::TestZeroPartitionCache::test_training_partition_cache[True]
6.41s call unit/runtime/half_precision/test_bf16.py::TestZeroAllowUntestedOptimizer::test
6.41s call unit/runtime/sparse_tensor/test_averaging_sparse_gradients.py::TestSparseAdam::test
6.41s call unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_4to2[False-True-1-dtype2]
6.40s call unit/inference/quantization/test_intX_quantization.py::TestQuantizedInt::test_quantized_linear[4bits-0]
6.40s call unit/checkpoint/test_sparse.py::TestSparseCheckpoint::test_non_strict_load_sparse[False-False-True-False]
6.40s call unit/runtime/zero/test_zero_nesting_init.py::TestShutdownInNestingInit::test_shutdown_in_nesting_init
6.40s call unit/runtime/test_ds_config_dict.py::TestDistInit::test
6.39s call unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_4to2[False-True-3-dtype1]
6.39s call unit/runtime/half_precision/test_bf16.py::TestZeroDtypeCocktail::test[default-bf16]
6.39s call unit/runtime/test_data_efficiency.py::TestLegacyCurriculumScheduler::test_fixed_discrete
6.39s call unit/runtime/zero/test_zero.py::TestZero3ParamPartitioningBase::test_reduce_scatter[False]
6.39s call unit/runtime/test_autocast.py::TestAutoCastEnable::test_autocast_linear[False-True]
6.38s call unit/runtime/test_runtime_utils.py::TestCheckOverflow::test[True]
6.38s call unit/runtime/half_precision/test_fp16.py::TestZeroStaticScale::test[False-2]
6.37s call unit/runtime/zero/test_zero.py::TestZero3ParamPartitioningManyParams::test[False]
6.37s call unit/runtime/half_precision/test_dynamic_loss_scale.py::TestUnfused::test_some_overflow
6.37s call unit/checkpoint/test_sparse.py::TestSparseCheckpoint::test_non_strict_load_sparse[True-False-True-False]
6.37s call unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[True-False-3-dtype0]
6.37s call unit/comm/test_dist.py::TestDistInitNoEnv::test
6.36s call unit/runtime/half_precision/test_bf16.py::TestZeroDtypeCocktail::test[fp16-fp32]
6.36s call unit/inference/quantization/test_intX_quantization.py::TestQuantizedInt::test_quantized_linear[4bits-1]
6.36s call unit/runtime/test_data_efficiency.py::TestDataEfficiency::test_curriculum_learning
6.36s call unit/runtime/zero/test_zero.py::TestZero3ParamPartitioningBase::test_model_class[EltwiseMultiplicationTestNetwork_namedtuple]
6.35s call unit/runtime/half_precision/test_bf16.py::TestZero2ReduceScatterOff::test
6.35s call unit/runtime/half_precision/test_fp16.py::TestFP16AdamTypes::test[False-Adam]
6.35s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorOutputOrderingWithoutGrad::test_ckpt_non_tensor_output_ordering[non_tensor_output3]
6.35s call unit/runtime/zero/test_zero.py::TestZeroFrozenWeights::test[3]
6.34s call unit/runtime/zero/test_zero.py::TestZero3RepeatForwardLoop::test[False]
6.34s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to4[True-True-1-dtype0]
6.34s call unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[True-False-1-dtype1]
6.34s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to4[False-True-1-dtype1]
6.34s call unit/runtime/half_precision/test_fp16.py::TestZeroStaticScale::test[False-1]
6.33s call unit/runtime/zero/test_zero.py::TestZeroFrozenWeights::test[2]
6.33s call unit/runtime/test_lr_schedulers.py::TestLrSchedule::test_lr_warmup_schedule[log-15]
6.33s call unit/runtime/test_ds_initialize.py::TestClientOptimizer::test[Callable]
6.32s call unit/runtime/half_precision/test_bf16.py::TestZeroDtypeCocktail::test[fp16-fp16]
6.32s call unit/runtime/zero/test_zero.py::TestZero3ParamPartitioningBase::test_reduce_scatter[True]
6.32s call unit/runtime/zero/test_zero.py::TestZero3ParamPartitioningBase::test_contiguous_gradients[True]
6.32s call unit/runtime/zero/test_zero.py::TestEmptyParameterGroup::test_empty_param_groups[dtype2-False-False]
6.31s call unit/checkpoint/test_zero_optimizer.py::TestZeROSaveLoadEdgeCase::test_save_before_accum_grad_is_done[0]
6.31s call unit/checkpoint/test_sparse.py::TestSparseCheckpoint::test_non_strict_load_sparse[False-False-True-True]
6.31s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to4[False-False-3-dtype2]
6.31s call unit/runtime/zero/test_zero.py::TestZeroOffloadOptim::test[False]
6.31s call unit/runtime/half_precision/test_fp16.py::TestZeroEmptyGrad::test[3]
6.31s call unit/runtime/zero/test_zero.py::TestZero3ParamPartitioningBase::test_prefetching[True]
6.31s call unit/comm/test_dist.py::TestInit::test
6.30s call unit/runtime/half_precision/test_fp16.py::TestZeroSupportedClientOptimizer::test[FusedAdam-3]
6.30s call unit/runtime/test_runtime_utils.py::TestClipGradNorm::test_clipped_val
6.30s call unit/runtime/zero/test_zero_nesting_init.py::TestNestingInit::test_nesting_init
6.28s call unit/runtime/test_lr_schedulers.py::TestLrSchedule::test_lr_warmup_decay_schedule[log-10]
6.28s call unit/runtime/comm/test_coalesced_collectives.py::TestReduceScatterCoalescedTensorSmallerThanWorldSize::test
6.28s call unit/checkpoint/test_lr_scheduler.py::TestLRSchedulerCheckpoint::test_checkpoint_lr_scheduler[0-False]
6.27s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithGrad::test_ckpt_inputs2_outputs3[mask0]
6.26s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorOutputOrderingWithoutGrad::test_ckpt_non_tensor_output_ordering[non_tensor_output2]
6.26s call unit/runtime/test_ds_config_dict.py::TestBasicConfig::test_accelerator
6.26s call unit/runtime/test_lr_schedulers.py::TestGetLrBeforeTrain::test[OneCycle-params2]
6.26s call unit/runtime/zero/test_zero.py::TestZero3ParamPartitioningBase::test_offload_optimizer[False]
6.26s call unit/checkpoint/test_zero_optimizer.py::TestZeROCheckpointFrozenWeights::test_save_exclude_custom_frozen_weights[1]
6.26s call unit/checkpoint/test_zero_optimizer.py::TestZeROSaveLoadEdgeCase::test_load_immediate_save[0]
6.26s call unit/runtime/test_pld.py::TestPLDModel::test_pld_model[0.9]
6.25s call unit/checkpoint/test_zero_optimizer.py::TestZeROCheckpointFrozenWeights::test_save_exclude_custom_frozen_weights[2]
6.25s call unit/runtime/test_ds_config_dict.py::TestBatchConfig::test[2-32-8-2-True]
6.25s call unit/runtime/half_precision/test_fp16.py::TestZero3LazyScatter::test
6.25s call unit/runtime/zero/test_zero.py::TestEmptyParameterGroup::test_empty_param_groups[dtype0-True-True]
6.24s call unit/profiling/flops_profiler/test_flops_profiler.py::TestFlopsProfiler::test
6.24s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithoutGrad::test_ckpt_non_tensor_output[True]
6.24s call unit/linear/test_ctx.py::TestInitTransformers::test_config_init
6.23s call unit/runtime/test_ds_config_dict.py::TestBatchConfig::test[2-32-16-1-True]
6.23s call unit/runtime/half_precision/test_fp16.py::TestZeroStaticScale::test[False-3]
6.23s call unit/runtime/test_ds_initialize.py::TestClientLrScheduler::test[Optimizer-_LRScheduler]
6.22s call unit/comm/test_dist.py::TestDistributedFixture::test[2-16]
6.22s call unit/ops/adagrad/test_cpu_adagrad.py::TestCPUAdagradGPUError::test_cpu_adagrad_gpu_error
6.22s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithoutGrad::test_ckpt_non_tensor_input[non_tensor4]
6.22s call unit/checkpoint/test_zero_optimizer.py::TestZeROCheckpoint::test_load_module_only[0]
6.22s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to4[True-False-3-dtype0]
6.22s call unit/utils/test_init_on_device.py::TestOnDevice::test_on_device[cuda:0]
6.21s call unit/checkpoint/test_zero_optimizer.py::TestZeROSaveLoadEdgeCase::test_immediate_save_load[3]
6.21s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[False-False-3-dtype1]
6.21s call unit/runtime/test_multi_output_model.py::TestThreeOutputModel::test
6.21s call unit/runtime/half_precision/test_fp16.py::TestAdamwFP16EmptyGrad::test
6.20s call unit/checkpoint/test_zero_optimizer.py::TestZeRONonDistributed::test_chmod_exception_handling[2]
6.20s call unit/comm/test_dist.py::TestDistInitWithModel::test_already_init[False]
6.20s call unit/checkpoint/test_zero_optimizer.py::TestZeROElasticCheckpoint::test_elastic_checkpoint_change_dp[True-False-False]
6.20s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithoutGrad::test_ckpt_inputs1_outputs1[mask0]
6.20s call unit/runtime/zero/test_zero.py::TestZero3ParamPartitioningBase::test_zero_grad[False]
6.19s call unit/runtime/zero/test_zero.py::TestZero3ParamPartitioningBase::test_model_class[EltwiseMultiplicationTestNetwork_NamedTuple]
6.19s call unit/runtime/zero/test_zero.py::TestZero3ParamPartitioningBase::test_contiguous_gradients[False]
6.18s call unit/runtime/half_precision/test_fp16.py::TestZeroAllowUntestedOptimizer::test[False-3]
6.17s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[False-True-1-dtype1]
6.17s call unit/runtime/test_autocast.py::TestAutoCastEnable::test_autocast_linear[True-True]
6.17s call unit/runtime/half_precision/test_fp16.py::TestZeroSupportedClientOptimizer::test[Adam-1]
6.17s call unit/checkpoint/test_zero_optimizer.py::TestZeROElasticCheckpoint::test_elastic_checkpoint_change_dp[False-False-False]
6.17s call unit/runtime/test_lr_schedulers.py::TestGetLrBeforeTrain::test[LRRangeTest-params3]
6.16s call unit/checkpoint/test_sparse.py::TestSparseCheckpoint::test_non_strict_load_sparse[False-False-False-False]
6.16s call unit/runtime/test_ds_config_dict.py::TestBasicConfig::test_check_version
6.16s call unit/checkpoint/test_sparse.py::TestSparseCheckpoint::test_non_strict_load_sparse[True-False-False-False]
6.16s call unit/checkpoint/test_zero_optimizer.py::TestZeROElasticCheckpoint::test_elastic_checkpoint_change_dp[False-True-True]
6.15s call unit/runtime/zero/test_zero_context_return.py::TestReturnParam::test_stage_3_output_type[tensor]
6.15s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithoutGrad::test_ckpt_non_tensor_input[non_tensor3]
6.15s call unit/runtime/test_pld.py::TestPLDModel::test_pld_model[0]
6.15s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[True-False-1-dtype0]
6.15s call unit/runtime/zero/test_zero.py::TestZero3ParamPartitioningBase::test_fp16_enabled[False]
6.15s call unit/checkpoint/test_zero_optimizer.py::TestZeROSaveLoadEdgeCase::test_immediate_save_load[2]
6.15s call unit/checkpoint/test_zero_optimizer.py::TestZeROElasticCheckpoint::test_elastic_checkpoint_change_dp[False-False-True]
6.15s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to4[True-True-1-dtype2]
6.14s call unit/runtime/zero/test_zero_context_return.py::TestReturnParam::test_stage_3_output_type[dict]
6.14s call unit/runtime/half_precision/test_fp16.py::TestZeroSupportedClientOptimizer::test[FusedAdam-1]
6.14s call unit/runtime/zero/test_zero.py::TestEmptyParameterGroup::test_empty_param_groups[dtype0-True-False]
6.14s call unit/runtime/half_precision/test_dynamic_loss_scale.py::TestFused::test_no_overflow
6.14s call unit/runtime/zero/test_zero.py::TestZero3ParamPartitioningBase::test_fp16_enabled[True]
6.13s call unit/runtime/half_precision/test_dynamic_loss_scale.py::TestFused::test_all_overflow
6.13s call unit/runtime/test_lr_schedulers.py::TestLrSchedule::test_lr_warmup_schedule[linear-15]
6.13s call unit/runtime/test_ds_initialize.py::TestClientOptimizer::test[Optimizer]
6.13s call unit/runtime/test_ds_initialize.py::TestClientLrSchedulerInit::test_same_lrscheler_and_callable[_LRScheduler]
6.13s call unit/runtime/test_ds_initialize.py::TestClientLrSchedulerInit::test_diff_lrscheler_and_callable[None]
6.13s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithoutGrad::test_ckpt_inputs1_outputs1[mask1]
6.12s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[True-False-1-dtype1]
6.12s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to4[False-True-1-dtype0]
6.11s call unit/runtime/zero/test_zero.py::TestEmptyParameterGroup::test_empty_param_groups[dtype2-True-False]
6.11s call unit/runtime/zero/test_zero.py::TestZero3RepeatForwardLoop::test[True]
6.11s call unit/checkpoint/test_sparse.py::TestSparseCheckpoint::test_non_strict_load_sparse[True-False-True-True]
6.11s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithoutGrad::test_ckpt_inputs2_outputs2[mask1]
6.10s call unit/checkpoint/test_zero_optimizer.py::TestZeROElasticCheckpoint::test_elastic_checkpoint_change_dp[True-False-True]
6.10s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestCheckpointNonTensor::test_ckpt_non_tensor_input[None]
6.10s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[False-False-3-dtype2]
6.10s call unit/runtime/test_pld.py::TestPLDModel::test_pld_model[0.1]
6.10s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[False-True-1-dtype2]
6.10s call unit/runtime/test_lr_schedulers.py::TestLrSchedule::test_lr_warmup_decay_schedule[linear-33]
6.10s call unit/checkpoint/test_shared_weights.py::TestCheckpointSharedWeights::test_checkpoint_shared_weights
6.10s call unit/checkpoint/test_zero_optimizer.py::TestZeROSaveLoadEdgeCase::test_immediate_save_load[0]
6.10s call unit/runtime/zero/test_zero.py::TestZeroAdamOptimizerStepCount::test[3]
6.10s call unit/runtime/half_precision/test_fp16.py::TestZeroEmptyPartition::test[False-3]
6.09s call unit/runtime/test_autocast.py::TestAutoCastDisable::test_disable_autocast_linear[True]
6.09s call unit/runtime/test_lr_schedulers.py::TestOneCycle::test_lr[0.001-0.1-0-21-21]
6.09s call unit/comm/test_dist.py::TestDistInitWithModel::test_no_init[True]
6.09s call unit/runtime/test_ds_config_dict.py::TestBatchConfig::test[2-33-17-2-False]
6.09s call unit/runtime/comm/test_coalesced_collectives.py::TestReduceScatterCoalesced::test_single_input
6.08s call unit/runtime/test_ds_initialize.py::TestClientLrScheduler::test[Callable-None]
6.08s call unit/runtime/test_ds_initialize.py::TestNoOptim::test[0]
6.08s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to4[False-True-1-dtype2]
6.07s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithGrad::test_ckpt_non_tensor_output[None]
6.07s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithoutGrad::test_ckpt_non_tensor_output[2]
6.07s call unit/runtime/test_ds_config_dict.py::TestInitNoOptimizer::test
6.06s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to4[True-False-1-dtype2]
6.06s call unit/checkpoint/test_zero_optimizer.py::TestZeRONonDistributed::test_chmod_exception_handling[1]
6.06s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestCheckpointNonTensorOutputOrdering::test_ckpt_non_tensor_output_ordering[non_tensor_output1]
6.06s call unit/monitor/test_monitor.py::TestCSVMonitor::test_csv_monitor
6.06s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[True-True-1-dtype0]
6.06s call unit/runtime/test_pld.py::TestPLDModel::test_pld_model[1.0]
6.06s call unit/runtime/half_precision/test_bf16.py::TestZeroSupportedClientOptimizer::test[Adam]
6.05s call unit/runtime/test_ds_config_dict.py::TestArgs::test_no_args
6.05s call unit/runtime/test_runtime_utils.py::TestClipGradNorm::test_gather
6.05s call unit/runtime/test_lr_schedulers.py::TestSchedulerOptimizerParity::test[WarmupLR-params0]
6.04s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithGrad::test_ckpt_non_tensor_output[2]
6.04s call unit/runtime/test_lr_schedulers.py::TestOneCycle::test_lr[1e-05-0.1-0-10-0]
6.04s call unit/runtime/test_autocast.py::TestAutoCastDisable::test_disable_autocast_linear[False]
6.04s call unit/checkpoint/test_tag_validation.py::TestCheckpointValidationTag::test_checkpoint_unique_tag[IGNORE]
6.04s call unit/runtime/half_precision/test_fp16.py::TestFP16AdamTypes::test[True-AdamW]
6.04s call unit/runtime/half_precision/test_dynamic_loss_scale.py::TestUnfused::test_no_overflow
6.04s call unit/runtime/test_lr_schedulers.py::TestLrSchedule::test_lr_warmup_decay_schedule[linear-10]
6.04s call unit/runtime/test_lr_schedulers.py::TestLrRange::test[0.01-0.01-19-True]
6.03s call unit/runtime/zero/test_zero.py::TestZero3InitForParentWeightInitialization::test
6.03s call unit/runtime/comm/test_coalesced_collectives.py::TestAllToAllQuantReduceFallback::test_1d_tensor
6.03s call unit/runtime/test_lr_schedulers.py::TestOneCycle::test_lr[1e-05-0.01-0.001-10-100]
6.03s call unit/runtime/half_precision/test_fp16.py::TestAdamFP16ZeroOneCycleCompatibility::test[False-2]
6.03s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithGrad::test_ckpt_inputs2_outputs1[mask0]
6.03s call unit/checkpoint/test_tag_validation.py::TestCheckpointValidationTag::test_checkpoint_unique_tag[WARN]
6.02s call unit/compression/test_compression.py::TestCompression::test_linear_layer_compress
6.02s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to4[True-True-3-dtype0]
6.02s call unit/runtime/zero/test_zero.py::TestZeroPartitionCache::test_training_partition_cache[False]
6.02s call unit/inference/quantization/test_intX_quantization.py::TestQuantizedInt::test_half_int8_quantization
6.01s call unit/runtime/zero/test_zero.py::TestZeroAdamOptimizerStepCount::test[2]
6.01s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[True-False-3-dtype2]
6.01s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to4[False-False-3-dtype1]
6.01s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[True-True-3-dtype1]
6.01s call unit/runtime/zero/test_zero_context.py::TestSerialContext::test_ext_param_getattr
6.01s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithGrad::test_ckpt_non_tensor_output[non_tensor4]
6.01s call unit/inference/quantization/test_intX_quantization.py::TestQuantizedInt::test_float_int4_quantization
6.01s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestActivationCheckpoint::test_ckpt_inputs2_outputs3[mask0]
6.00s call unit/runtime/zero/test_zero.py::TestZero3DictFwd::test[tuple]
6.00s call unit/checkpoint/test_zero_optimizer.py::TestZeROElasticCheckpoint::test_elastic_checkpoint_change_dp[True-True-False]
6.00s call unit/inference/quantization/test_intX_quantization.py::TestQuantizedInt::test_float_int8_quantization
6.00s call unit/monitor/test_monitor.py::TestCSVMonitor::test_empty_csv_monitor
6.00s call unit/runtime/test_lr_schedulers.py::TestLrRange::test[0.01-0.01-19-False]
6.00s call unit/runtime/test_lr_schedulers.py::TestLrRange::test[0.0001-1e-05-1-True]
5.99s call unit/runtime/test_multi_output_model.py::TestTwoOutputModel::test
5.99s call unit/checkpoint/test_zero_optimizer.py::TestSaveTensorClone::test_save_tensor_clone[True-1]
5.99s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[False-False-1-dtype1]
5.99s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestCheckpointNonTensor::test_ckpt_non_tensor_output[non_tensor4]
5.98s call unit/inference/quantization/test_intX_quantization.py::TestQuantizedInt::test_half_int4_quantization
5.98s call unit/compression/test_compression.py::TestCompression::test_conv1d_convertion
5.98s call unit/runtime/test_ds_config_dict.py::TestArgs::test_none_args
5.98s call unit/runtime/test_lr_schedulers.py::TestSchedulerOptimizerParity::test[WarmupDecayLR-params1]
5.97s call unit/runtime/half_precision/test_fp16.py::TestZeroAllowUntestedOptimizer::test[False-2]
5.97s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorOutputOrderingWithoutGrad::test_ckpt_non_tensor_output_ordering[non_tensor_output1]
5.97s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[False-True-1-dtype0]
5.97s call unit/runtime/half_precision/test_fp16.py::TestZeroAllowUntestedOptimizer::test[True-2]
5.96s call unit/runtime/test_lr_schedulers.py::TestLrSchedule::test_lr_warmup_schedule[log-33]
5.96s call unit/runtime/half_precision/test_bf16.py::TestZeroSupportedClientOptimizer::test[FusedAdam]
5.96s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[True-True-1-dtype1]
5.96s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to4[True-False-3-dtype1]
5.96s call unit/comm/test_dist.py::TestDistArgs::test[hello-icosahedron-1138-purple]
5.96s call unit/runtime/test_pld.py::TestNonPLDModel::test_non_pld_model
5.96s call unit/comm/test_dist.py::TestDistInit::test_already_init[True]
5.95s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithGrad::test_ckpt_inputs2_outputs1[mask1]
5.95s call unit/runtime/test_autocast.py::TestAutoCastEnable::test_autocast_linear[True-False]
5.95s call unit/runtime/test_lr_schedulers.py::TestGetLrBeforeTrain::test[WarmupDecayLR-params1]
5.95s call unit/runtime/test_data.py::TestDataLoaderDropLast::test[1-False]
5.95s call unit/runtime/comm/test_coalesced_collectives.py::TestAllToAllQuantReduceFallback::test_non_divisible
5.95s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestCheckpointNonTensor::test_ckpt_non_tensor_input[2]
5.95s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestActivationCheckpoint::test_ckpt_inputs2_outputs2[mask0]
5.95s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithGrad::test_ckpt_inputs2_outputs2[mask1]
5.94s call unit/runtime/test_lr_schedulers.py::TestLrSchedule::test_lr_warmup_decay_schedule[linear-15]
5.94s call unit/runtime/test_ds_config_dict.py::TestConfigLoad::test_json
5.94s call unit/runtime/test_ds_initialize.py::TestClientLrSchedulerInit::test_diff_lrscheler_and_callable_onecyclelr_steplr[None]
5.94s call unit/runtime/test_lr_schedulers.py::TestWarmupCosineLR::test_lr[100-10-0.1-0.2]
5.94s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[True-True-3-dtype0]
5.93s call unit/runtime/zero/test_zero_context_ancestry.py::TestDSInitWZinit::test
5.93s call unit/runtime/half_precision/test_fp16.py::TestAdamFP16ZeroOneCycleCompatibility::test[False-3]
5.93s call unit/runtime/test_lr_schedulers.py::TestOneCycle::test_mom[0.08-0.09-0.001-101]
5.93s call unit/runtime/test_runtime_utils.py::TestCheckOverflow::test[False]
5.93s call unit/runtime/half_precision/test_fp16.py::TestZeroSupportedClientOptimizer::test[Adam-3]
5.93s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[True-True-1-dtype2]
5.92s call unit/runtime/half_precision/test_fp16.py::TestZeroSupportedClientOptimizer::test[Adam-2]
5.92s call unit/runtime/zero/test_zero.py::TestEmptyParameterGroup::test_empty_param_groups[dtype2-True-True]
5.92s call unit/runtime/test_lr_schedulers.py::TestLrRange::test[0.0001-0.001-10-True]
5.92s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestActivationCheckpoint::test_ckpt_arg_none[mask0]
5.92s call unit/checkpoint/test_zero_optimizer.py::TestSaveTensorClone::test_save_tensor_clone[False-1]
5.92s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorOutputOrderingWithGrad::test_ckpt_non_tensor_output_ordering[non_tensor_output1]
5.92s call unit/checkpoint/test_sparse.py::TestSparseCheckpoint::test_non_strict_load_sparse[True-True-True-True]
5.91s call unit/comm/test_dist.py::TestDistInit::test_already_init[False]
5.91s call unit/runtime/zero/test_zero.py::TestZero3DictFwd::test[dict]
5.91s call unit/runtime/test_lr_schedulers.py::TestGetLrBeforeTrain::test[WarmupLR-params0]
5.91s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to4[True-False-1-dtype0]
5.91s call unit/runtime/test_lr_schedulers.py::TestLrSchedule::test_lr_warmup_schedule[log-19]
5.91s call unit/elasticity/test_elastic.py::TestElasticConfigChanged::test
5.91s call unit/runtime/half_precision/test_fp16.py::TestFP16AdamTypes::test[True-Adam]
5.90s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[False-True-3-dtype2]
5.90s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithGrad::test_ckpt_inputs2_outputs3[mask1]
5.90s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithoutGrad::test_ckpt_inputs2_outputs3[mask1]
5.90s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithGrad::test_ckpt_non_tensor_input[non_tensor3]
5.90s call unit/runtime/test_autocast.py::TestAutoCastDisable::test_missing_amp_autocast[False]
5.90s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithGrad::test_ckpt_inputs2_outputs2[mask0]
5.90s call unit/runtime/zero/test_zero.py::TestEmptyParameterGroup::test_empty_param_groups[dtype2-False-True]
5.90s call unit/runtime/zero/test_zero.py::TestIncorectAllgatherBucketSize::test[1000]
5.89s call unit/runtime/test_lr_schedulers.py::TestLrSchedule::test_lr_warmup_decay_schedule[linear-19]
5.89s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestCheckpointNonTensor::test_ckpt_non_tensor_output[None]
5.89s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to4[True-True-3-dtype1]
5.88s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithGrad::test_ckpt_non_tensor_output[True]
5.88s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithGrad::test_ckpt_inputs1_outputs1[mask1]
5.88s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[True-False-3-dtype1]
5.88s call unit/checkpoint/test_sparse.py::TestSparseCheckpoint::test_non_strict_load_sparse[True-True-True-False]
5.88s call unit/runtime/test_autocast.py::TestAutoCastDisable::test_missing_amp_autocast[True]
5.87s call unit/comm/test_dist.py::TestDistInit::test_no_init[True]
5.87s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[True-True-3-dtype2]
5.87s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithGrad::test_ckpt_non_tensor_input[2]
5.87s call unit/runtime/zero/test_zero.py::TestEmptyParameterGroup::test_empty_param_groups[dtype1-False-True]
5.86s call unit/runtime/test_data.py::TestDataLoaderDropLast::test[4-False]
5.86s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[False-False-3-dtype0]
5.86s call unit/runtime/half_precision/test_fp16.py::TestFP16AdamTypes::test[False-AdamW]
5.86s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithoutGrad::test_ckpt_non_tensor_input[True]
5.86s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestCheckpointNonTensor::test_ckpt_non_tensor_input[non_tensor3]
5.86s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithGrad::test_ckpt_inputs1_outputs1[mask0]
5.86s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestCheckpointNonTensor::test_ckpt_non_tensor_output[non_tensor3]
5.86s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorOutputOrderingWithGrad::test_ckpt_non_tensor_output_ordering[non_tensor_output2]
5.86s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestActivationCheckpoint::test_ckpt_inputs2_outputs1[mask0]
5.85s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorOutputOrderingWithGrad::test_ckpt_non_tensor_output_ordering[non_tensor_output3]
5.85s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithGrad::test_ckpt_non_tensor_output[non_tensor3]
5.85s call unit/checkpoint/test_tag_validation.py::TestCheckpointValidationTag::test_checkpoint_unique_tag[FAIL]
5.85s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorOutputOrderingWithGrad::test_ckpt_non_tensor_output_ordering[None]
5.85s call unit/runtime/test_lr_schedulers.py::TestLrSchedule::test_lr_warmup_schedule[log-10]
5.85s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to4[False-False-3-dtype0]
5.85s call unit/runtime/test_ds_initialize.py::TestClientLrSchedulerInit::test_same_lrscheler_and_callable[None]
5.85s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithoutGrad::test_ckpt_inputs2_outputs2[mask0]
5.84s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to4[False-False-1-dtype2]
5.84s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[False-False-1-dtype2]
5.84s call unit/runtime/zero/test_zero_context_return.py::TestReturnParam::test_stage_3_output_type[None]
5.83s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestActivationCheckpoint::test_ckpt_inputs1_outputs1[mask1]
5.83s call unit/runtime/test_lr_schedulers.py::TestLrRange::test[0.001-0.001-10-False]
5.83s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[True-False-3-dtype0]
5.83s call unit/checkpoint/test_latest_checkpoint.py::TestLatestCheckpoint::test_existing_latest
5.82s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to2[True-False-1-dtype2]
5.82s call unit/runtime/test_ds_initialize.py::TestClientLrSchedulerInit::test_diff_lrscheler_and_callable_onecyclelr_steplr[_LRScheduler]
5.82s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestActivationCheckpoint::test_ckpt_inputs2_outputs3[mask1]
5.82s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestActivationCheckpoint::test_ckpt_inputs2_outputs1[mask1]
5.82s call unit/runtime/test_ds_initialize.py::TestClientLrSchedulerInit::test_diff_lrscheler_and_callable_onecyclelr_steplr[Callable]
5.82s call unit/runtime/test_lr_schedulers.py::TestOneCycle::test_mom[0.08-0.09-0-210]
5.82s call unit/runtime/test_ds_initialize.py::TestClientLrSchedulerInit::test_diff_lrscheler_and_callable[_LRScheduler]
5.81s call unit/runtime/test_lr_schedulers.py::TestLrSchedule::test_lr_warmup_schedule[linear-10]
5.81s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithGrad::test_ckpt_non_tensor_input[non_tensor4]
5.81s call unit/runtime/compile/test_compile_zero.py::TestZeRO::test_compile_zero[nvme-2-dtype0]
5.80s call unit/runtime/half_precision/test_fp16.py::TestZeroAllowUntestedOptimizer::test[True-3]
5.80s call unit/elasticity/test_elastic.py::TestNonElasticBatchParams::test
5.80s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestActivationCheckpoint::test_ckpt_inputs1_outputs1[mask0]
5.80s call unit/runtime/test_data.py::TestDataLoaderDropLast::test[1-True]
5.80s call unit/runtime/test_ds_initialize.py::TestClientLrScheduler::test[Callable-_LRScheduler]
5.80s call unit/runtime/test_ds_config_dict.py::TestConfigLoad::test_dict
5.79s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithoutGrad::test_ckpt_non_tensor_output[non_tensor3]
5.79s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestCheckpointNonTensorOutputOrdering::test_ckpt_non_tensor_output_ordering[None]
5.79s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestCheckpointNonTensor::test_ckpt_non_tensor_output[True]
5.78s call unit/runtime/half_precision/test_dynamic_loss_scale.py::TestFused::test_some_overflow
5.78s call unit/runtime/half_precision/test_fp16.py::TestZeroAllowUntestedOptimizer::test[False-1]
5.78s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestCheckpointNonTensor::test_ckpt_non_tensor_input[non_tensor4]
5.78s call unit/runtime/zero/test_zero.py::TestEmptyParameterGroup::test_empty_param_groups[dtype0-False-True]
5.78s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithoutGrad::test_ckpt_arg_none[mask0]
5.78s call unit/runtime/test_data.py::TestDataLoaderDropLast::test[4-True]
5.77s call unit/runtime/zero/test_zero.py::TestEmptyParameterGroup::test_empty_param_groups[dtype1-True-False]
5.77s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestCheckpointNonTensorOutputOrdering::test_ckpt_non_tensor_output_ordering[non_tensor_output3]
5.77s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithGrad::test_ckpt_non_tensor_input[None]
5.77s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestCheckpointNonTensor::test_ckpt_non_tensor_input[True]
5.76s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithoutGrad::test_ckpt_inputs2_outputs3[mask0]
5.76s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestActivationCheckpoint::test_ckpt_arg_none[mask1]
5.76s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithoutGrad::test_ckpt_inputs2_outputs1[mask1]
5.75s call unit/runtime/test_ds_initialize.py::TestClientLrScheduler::test[Optimizer-None]
5.75s call unit/runtime/half_precision/test_fp16.py::TestZeroAllowUntestedOptimizer::test[True-1]
5.74s call unit/runtime/zero/test_zero.py::TestZero3ParamPartitioningBase::test_offload_optimizer[True]
5.74s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestCheckpointNonTensor::test_ckpt_non_tensor_output[2]
5.74s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithGrad::test_ckpt_non_tensor_input[True]
5.74s call unit/runtime/test_ds_initialize.py::TestClientLrScheduler::test[Optimizer-Callable]
5.73s call unit/runtime/zero/test_zero_context.py::TestSerialContext::test_throughput_calculation
5.73s call unit/runtime/half_precision/test_dynamic_loss_scale.py::TestUnfused::test_all_overflow
5.73s call unit/runtime/zero/test_zero.py::TestEmptyParameterGroup::test_empty_param_groups[dtype1-True-True]
5.73s call unit/ops/adam/test_adamw.py::TestAdamConfigs::test[AdamW-False-False-False-resulting_optimizer0]
5.73s call unit/runtime/test_ds_initialize.py::TestClientLrSchedulerInit::test_same_lrscheler_and_callable[Callable]
5.72s call unit/checkpoint/test_latest_checkpoint.py::TestLatestCheckpoint::test_missing_latest
5.71s call unit/runtime/zero/test_zero.py::TestEmptyParameterGroup::test_empty_param_groups[dtype1-False-False]
5.71s call unit/runtime/test_ds_initialize.py::TestClientLrScheduler::test[Callable-Callable]
5.71s setup unit/checkpoint/test_universal_checkpoint.py::TestZeROUniversalCheckpointDP::test_dp_world_size_2to4[True-False-3-dtype2]
5.70s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestCheckpointNonTensorOutputOrdering::test_ckpt_non_tensor_output_ordering[non_tensor_output2]
5.70s call unit/comm/test_dist.py::TestGroupedDistTest::test_two[1138]
5.69s call unit/runtime/test_ds_config_dict.py::TestNoModel::test
5.69s call unit/comm/test_dist.py::TestDistributedFixture::test[4-32]
5.69s call unit/comm/test_dist.py::TestGroupedDistTest::test_one[1138]
5.69s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithoutGrad::test_ckpt_non_tensor_input[2]
5.69s call unit/profiling/flops_profiler/test_flops_profiler.py::TestFlopsProfiler::test_flops_profiler_in_inference
5.68s call unit/runtime/test_ds_initialize.py::TestClientLrScheduler::test[None-Callable]
5.67s call unit/runtime/test_ds_initialize.py::TestClientOptimizer::test[None]
5.67s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithoutGrad::test_ckpt_non_tensor_input[None]
5.66s call unit/runtime/test_ds_initialize.py::TestClientLrSchedulerInit::test_diff_lrscheler_and_callable[Callable]
5.66s call unit/runtime/zero/test_zero.py::TestZeroOffloadOptim::test[True]
5.66s call unit/runtime/zero/test_zero.py::TestIncorectAllgatherBucketSize::test[1001]
5.66s call unit/runtime/zero/test_zero_context.py::TestGatherUpdate::test
5.65s call unit/comm/test_dist.py::TestWorldSizeOverrideDistTest::test_world_size_2
5.65s call unit/checkpoint/test_zero_optimizer.py::TestZeRONonDistributed::test_chmod_exception_handling[3]
5.65s call unit/checkpoint/test_zero_optimizer.py::TestSaveTensorClone::test_save_tensor_clone[True-2]
5.64s call unit/runtime/test_lr_schedulers.py::TestOneCycle::test_mom[0.08-0.09-0-211]
5.64s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestActivationCheckpointWithoutGrad::test_ckpt_arg_none[mask1]
5.64s call unit/runtime/half_precision/test_fp16.py::TestZeroEmptyPartition::test[True-3]
5.64s call unit/runtime/zero/test_zero_context.py::TestScatterGather::test
5.63s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorWithoutGrad::test_ckpt_non_tensor_output[non_tensor4]
5.63s call unit/runtime/zero/test_zero.py::TestEmptyParameterGroup::test_empty_param_groups[dtype0-False-False]
5.62s call unit/comm/test_dist.py::TestDistributedFixture::test[2-32]
5.61s call unit/comm/test_dist.py::TestDistributedFixture::test[4-16]
5.61s call unit/runtime/compile/test_compile_zero.py::TestZeRO::test_compile_zero[nvme-2-dtype2]
5.61s call unit/runtime/compile/test_compile_zero.py::TestZeRO::test_compile_zero[nvme-2-dtype1]
5.60s call unit/runtime/activation_checkpointing/test_activation_checkpointing.py::TestActivationCheckpoint::test_ckpt_inputs2_outputs2[mask1]
5.59s call unit/comm/test_dist.py::TestWorldSizeOverrideDistTest::test_world_size_1
5.54s call unit/runtime/activation_checkpointing/test_activation_checkpointing_non_reentrant.py::TestCheckpointNonTensorOutputOrderingWithoutGrad::test_ckpt_non_tensor_output_ordering[None]
5.52s call unit/runtime/zero/test_zero_dynamic_class.py::TestNewClassDeclaredNestingInit::test_new_class_declared_nesting_init
5.52s call unit/runtime/compile/test_compile_zero.py::TestZeRO::test_compile_zero[nvme-1-dtype1]
5.51s call unit/runtime/compile/test_compile_zero.py::TestZeRO::test_compile_zero[nvme-1-dtype0]
5.51s call unit/runtime/compile/test_compile_zero.py::TestZeRO::test_compile_zero[nvme-1-dtype2]
5.50s call unit/runtime/zero/test_zero_context.py::TestMiCSGatheredParametersFree::test
5.48s call unit/runtime/zero/test_zero_context.py::TestSerialContext::test_subclass_param
5.47s call unit/runtime/zero/test_zero_dynamic_class.py::TestNewClassDeclaredInsideNestingInit::test_new_class_declared_inside_nesting_init
5.44s setup unit/comm/test_dist.py::TestDistributedFixture::test[4-32]
5.37s setup unit/comm/test_dist.py::TestDistributedFixture::test[2-16]
5.37s call unit/runtime/zero/test_zero_context.py::TestSerialContext::test_scattered_init_dist
5.28s call unit/runtime/zero/test_zero_context_ancestry.py::TestSerialParamInit::test_subclass_param_init
5.23s setup unit/comm/test_dist.py::TestDistributedFixture::test[2-32]
5.21s call unit/runtime/zero/test_zero_context.py::TestZeroGatheredParametersFree::test
5.16s setup unit/comm/test_dist.py::TestDistributedFixture::test[4-16]
5.12s call unit/runtime/zero/test_zero_context.py::TestSerialContext::test_scatter_halftype
4.53s call unit/comm/test_dist.py::TestDistInitWithModel::test_no_init[False]
4.34s call unit/comm/test_dist.py::TestDistInit::test_no_init[False]
2.13s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[cpu-3-full-dtype0]
2.12s call unit/ops/adam/test_adamw.py::TestAdamConfigs::test[AdamW-True-False-True-resulting_optimizer6]
1.99s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[nvme-3-full-dtype1]
1.94s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[nvme-3-full-dtype2]
1.92s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[nvme-3-local-dtype2]
1.92s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[cpu-3-full-dtype1]
1.91s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentGet::test_zero_fragments[cpu-3-full-False]
1.90s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[nvme-3-full-dtype0]
1.88s call unit/ops/adam/test_hybrid_adam.py::TestHybridAdam::test_hybrid_adam_equal[8-fp16]
1.88s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentGet::test_zero_fragments[nvme-3-local-False]
1.88s call unit/ops/adam/test_adamw.py::TestAdamConfigs::test[AdamW-True-False-False-resulting_optimizer2]
1.88s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentGet::test_zero_fragments[nvme-3-full-False]
1.86s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[cpu-3-local-dtype2]
1.85s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[cpu-3-local-dtype1]
1.83s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[cpu-3-full-dtype2]
1.83s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentGet::test_zero_fragments[cpu-3-local-True]
1.82s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentGet::test_zero_fragments[nvme-3-full-True]
1.82s call unit/ops/adam/test_adamw.py::TestAdamConfigs::test[AdamW-False-False-True-resulting_optimizer4]
1.82s call unit/ops/lion/test_cpu_lion.py::TestCPULion::test_fused_lion_equal[64-fp16]
1.82s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[cpu-3-local-dtype0]
1.82s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentGet::test_zero_fragments[cpu-3-local-False]
1.81s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentGet::test_zero_fragments[nvme-3-local-True]
1.80s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentGet::test_zero_fragments[cpu-3-full-True]
1.79s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[nvme-3-local-dtype1]
1.79s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[none-3-full-dtype1]
1.78s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[nvme-3-local-dtype0]
1.78s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[cpu-1-full-dtype2]
1.76s call unit/runtime/test_ds_initialize.py::TestOptimizerImplementation::test[None-fp32-zero3]
1.71s call unit/ops/adam/test_cpu_adam.py::TestCPUAdam::test_fused_adam_equal[64-fp16]
1.64s call unit/ops/lion/test_lion.py::TestLionConfigs::test[Lion-True-DeepSpeedCPULion]
1.64s call unit/ops/adam/test_adamw.py::TestAdamConfigs::test[Adam-True-False-True-resulting_optimizer14]
1.60s call unit/ops/adam/test_adamw.py::TestAdamConfigs::test[Adam-True-False-False-resulting_optimizer10]
1.59s call unit/runtime/test_ds_initialize.py::TestOptimizerImplementation::test[None-bf16-zero3]
1.55s call unit/runtime/test_ds_initialize.py::TestOptimizerImplementation::test[None-bf16-zero2]
1.54s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[none-3-full-dtype0]
1.49s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[cpu-2-full-dtype2]
1.48s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[cpu-2-full-dtype1]
1.47s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[cpu-2-full-dtype0]
1.47s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[cpu-1-full-dtype0]
1.46s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[cpu-1-full-dtype1]
1.45s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[none-3-local-dtype0]
1.40s call unit/runtime/test_mup_optimizers.py::TestMuPOptimizers::test[False-MuAdam-Adam]
1.39s call unit/runtime/test_ds_initialize.py::TestOptimizerImplementation::test[fp32-fp16-zero3]
1.37s call unit/runtime/test_ds_initialize.py::TestOptimizerImplementation::test[fp16-bf16-zero3]
1.36s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentGet::test_zero_fragments[cpu-1-full-False]
1.36s call unit/ops/aio/test_aio.py::TestWrite::test_parallel_write[True-True-True]
1.36s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentGet::test_zero_fragments[cpu-2-full-False]
1.36s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentGet::test_zero_fragments[cpu-2-full-True]
1.35s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentGet::test_zero_fragments[cpu-1-full-True]
1.35s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[none-3-local-dtype1]
1.35s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[none-3-local-dtype2]
1.32s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[none-3-full-dtype2]
1.32s call unit/ops/adam/test_adamw.py::TestAdamConfigs::test[Adam-False-False-False-resulting_optimizer8]
1.32s call unit/runtime/test_ds_initialize.py::TestOptimizerImplementation::test[bf16-bf16-zero3]
1.32s call unit/runtime/test_ds_initialize.py::TestOptimizerImplementation::test[fp32-bf16-zero3]
1.31s call unit/ops/aio/test_aio.py::TestRead::test_async_read[True-False-True-False]
1.28s call unit/runtime/test_ds_initialize.py::TestOptimizerImplementation::test[bf16-fp16-zero3]
1.28s call unit/launcher/test_run.py::test_parser_multinode
1.26s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentGet::test_bf16_fragments[False]
1.23s call unit/ops/aio/test_aio.py::TestRead::test_async_read[False-True-False-True]
1.22s call unit/runtime/zero/test_zero_leaf_module.py::TestSetZ3LeafModule::test_choose_module_by_rank
1.21s call unit/runtime/test_ds_initialize.py::TestOptimizerImplementation::test[bf16-fp32-zero3]
1.20s call unit/ops/aio/test_aio.py::TestRead::test_parallel_read[True-True-True]
1.19s call unit/runtime/test_ds_initialize.py::TestOptimizerImplementation::test[bf16-bf16-zero1]
1.19s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[none-2-full-dtype0]
1.18s call unit/runtime/test_ds_initialize.py::TestOptimizerImplementation::test[fp16-fp32-zero3]
1.18s call unit/ops/adam/test_adamw.py::TestAdamConfigs::test[AdamW-True-True-False-resulting_optimizer3]
1.18s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentGet::test_zero_fragments[none-3-full-False]
1.17s call unit/runtime/test_ds_initialize.py::TestOptimizerImplementation::test[fp32-fp32-zero3]
1.16s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentGet::test_zero_fragments[none-3-full-True]
1.15s call unit/runtime/test_ds_initialize.py::TestOptimizerImplementation::test[fp16-bf16-zero1]
1.15s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentGet::test_zero_fragments[none-3-local-False]
1.14s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentGet::test_zero_fragments[none-3-local-True]
1.10s call unit/runtime/zero/test_zero_leaf_module.py::TestSetZ3LeafModule::test_no_grad_input_error
1.08s call unit/runtime/test_ds_initialize.py::TestOptimizerImplementation::test[fp16-fp16-zero3]
1.07s call unit/runtime/test_ds_initialize.py::TestOptimizerImplementation::test[None-fp16-zero3]
1.04s call unit/ops/aio/test_aio.py::TestWrite::test_async_write[False-False-True-True]
1.02s call unit/runtime/zero/test_zero_tensor_fragment.py::TestTensorFragmentUpdate::test_zero_fragments[none-2-full-dtype1]
1.02s call unit/runtime/test_ds_initialize.py::TestOptimizerImplementation::test[fp32-bf16-zero2]
1.02s call unit/runtime/test_ds_initialize.py::TestOptimizerImplementation::test[None-fp16-zero1]
1.01s call unit/ops/adam/test_hybrid_adam.py::TestHybridAdam::test_hybrid_adam_equal[8-bf16]
1.00s call unit/ops/aio/test_aio.py::TestRead::test_parallel_read[False-False-True]
1.00s call unit/runtime/test_mup_optimizers.py::TestMuPOptimizers::test[False-MuSGD-SGD]
(2601 durations < 1s hidden. Use -vv to show these durations.)
===================================================================================== short test summary info =====================================================================================
FAILED tests/unit/inference/quantization/test_intX_quantization.py::TestQuantizedInt::test_zero3_int4_post_init_quant_nvme_offload - AssertionError
FAILED tests/unit/inference/quantization/test_intX_quantization.py::TestQuantizedInt::test_zero3_int4_quantized_initialization_nvme_offload - AssertionError
===================================================== 2 failed, 1052 passed, 129 skipped, 4426 deselected, 172 warnings in 6045.42s (1:40:45) =====================================================
(at easybuild/tools/run.py:695 in parse_cmd_output)
== 2024-11-01 18:14:00,357 build_log.py:267 INFO ... (took 1 hour 48 mins 15 secs)
== 2024-11-01 18:14:00,357 build_log.py:267 INFO ... (took 1 hour 48 mins 35 secs)
== 2024-11-01 18:14:00,357 filetools.py:2025 INFO Removing lock /apps/Test/software/.locks/_apps_Test_software_DeepSpeed_0.14.5-foss-2023a-CUDA-12.1.1.lock...
== 2024-11-01 18:14:00,364 filetools.py:385 INFO Path /apps/Test/software/.locks/_apps_Test_software_DeepSpeed_0.14.5-foss-2023a-CUDA-12.1.1.lock successfully removed.
== 2024-11-01 18:14:00,364 filetools.py:2029 INFO Lock removed: /apps/Test/software/.locks/_apps_Test_software_DeepSpeed_0.14.5-foss-2023a-CUDA-12.1.1.lock
== 2024-11-01 18:14:00,364 easyblock.py:4297 WARNING build failed (first 300 chars): cmd "export PATH=/cephyr/NOBACKUP/priv/c3-staff/eb-tmp/eb-ehjcjfxy/tmpo3g409wd/bin:$PATH PYTHONPATH=/cephyr/NOBACKUP/priv/c3-staff/eb-tmp/eb-ehjcjfxy/tmpo3g409wd/lib/python3.11/site-packages:$PYTHONPATH && ln -s $PWD/tests/ ../tests && cd ../ && pytest tests/unit/ -k "not TestTensorBoard and not Te
== 2024-11-01 18:14:00,365 easyblock.py:326 INFO Closing log for application name DeepSpeed version 0.14.5
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment