Created
September 4, 2025 06:38
-
-
Save AmosLewis/5d2b5d29b2ca58ca1c30a7efb3c34ae8 to your computer and use it in GitHub Desktop.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| ((.venv12) ) ➜ 2024q2-sdxl-mlperf-sprint git:(mi355_llama_working_harness_v1) ✗ ./LLAMA_inference/run_docker_8b_mi355.sh | |
| always | |
| root@smci355-ccs-aus-n10-09:/mlperf/harness# ./run_offline.sh --shortfin-config shortfin_8b_config_fp8.json | |
| Warning: Missing argument '--test-mode' | |
| Info: Defaulting to test mode 'PerformanceOnly' | |
| Warning: Missing argument '--test-scenario' | |
| Info: Defaulting to test scenario 'Offline' | |
| INFO:shortfin_apps.llm.components.service_debug_dumper:[debug_service.py] Please find debug dumps for service.py in /root/.shortfin/debug/llm_service_invocation_dumps/2025-09-04T06:34:39.128962 | |
| INFO:root:#################################################################################################################################################################################### | |
| Running python3.11 harness_alt_mi355.py --devices 0,1,2,3,4,5,6,7 --scenario Offline --test_mode PerformanceOnly --prefill_bs 4 --decode_bs 4 --user_conf_path user.conf --count 50 --tensor_path /data/mlperf_llama3.1_405b_dataset_8313_processed_fp16_eval.pkl --logfile_outdir OutputOfflinePerformanceOnly --debug False --verbose False --user_conf_path user.conf --shortfin_config shortfin_8b_config_fp8.json | |
| ############################################################################################################################################################################################## | |
| WARNING:root:Override count with 50 | |
| INFO:Llama-405B-Dataset:Loading dataset... | |
| INFO:Llama-405B-Dataset:Finished loading dataset. | |
| INFO:shortfin_apps.llm.components.service_debug_dumper:[debug_service.py] Please find debug dumps for service.py in /root/.shortfin/debug/llm_service_invocation_dumps/2025-09-04T06:34:41.771636 | |
| INFO:shortfin_apps.llm.components.service_debug_dumper:[debug_service.py] Please find debug dumps for service.py in /root/.shortfin/debug/llm_service_invocation_dumps/2025-09-04T06:34:41.772333 | |
| INFO:shortfin_apps.llm.components.service_debug_dumper:[debug_service.py] Please find debug dumps for service.py in /root/.shortfin/debug/llm_service_invocation_dumps/2025-09-04T06:34:41.774435 | |
| INFO:shortfin_apps.llm.components.service_debug_dumper:[debug_service.py] Please find debug dumps for service.py in /root/.shortfin/debug/llm_service_invocation_dumps/2025-09-04T06:34:41.776947 | |
| INFO:shortfin_apps.llm.components.service_debug_dumper:[debug_service.py] Please find debug dumps for service.py in /root/.shortfin/debug/llm_service_invocation_dumps/2025-09-04T06:34:41.777720 | |
| INFO:shortfin_apps.llm.components.service_debug_dumper:[debug_service.py] Please find debug dumps for service.py in /root/.shortfin/debug/llm_service_invocation_dumps/2025-09-04T06:34:41.786088 | |
| INFO:shortfin_apps.llm.components.service_debug_dumper:[debug_service.py] Please find debug dumps for service.py in /root/.shortfin/debug/llm_service_invocation_dumps/2025-09-04T06:34:41.790101 | |
| INFO:shortfin_apps.llm.components.service_debug_dumper:[debug_service.py] Please find debug dumps for service.py in /root/.shortfin/debug/llm_service_invocation_dumps/2025-09-04T06:34:41.818436 | |
| INFO:root:NUMA hardware info: {'numa_node_distance': [[10, 32], [32, 10]], 'node_cpu_info': {0: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191], 1: [64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255]}} | |
| INFO:root:GPU: 0 | |
| INFO:root:Nearest nodes = [1] | |
| INFO:root:NUMA hardware info: {'numa_node_distance': [[10, 32], [32, 10]], 'node_cpu_info': {0: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191], 1: [64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255]}} | |
| INFO:root:GPU: 0 | |
| INFO:root:Nearest nodes = [0] | |
| INFO:root:NUMA hardware info: {'numa_node_distance': [[10, 32], [32, 10]], 'node_cpu_info': {0: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191], 1: [64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255]}} | |
| INFO:root:NUMA hardware info: {'numa_node_distance': [[10, 32], [32, 10]], 'node_cpu_info': {0: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191], 1: [64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255]}} | |
| INFO:root:GPU: 0 | |
| INFO:root:Nearest nodes = [0] | |
| INFO:root:GPU: 0 | |
| INFO:root:Nearest nodes = [1] | |
| INFO:root:NUMA hardware info: {'numa_node_distance': [[10, 32], [32, 10]], 'node_cpu_info': {0: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191], 1: [64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255]}} | |
| INFO:root:GPU: 0 | |
| INFO:root:Nearest nodes = [1] | |
| INFO:root:NUMA hardware info: {'numa_node_distance': [[10, 32], [32, 10]], 'node_cpu_info': {0: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191], 1: [64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255]}} | |
| INFO:root:GPU: 0 | |
| INFO:root:Nearest nodes = [0] | |
| INFO:root:NUMA hardware info: {'numa_node_distance': [[10, 32], [32, 10]], 'node_cpu_info': {0: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191], 1: [64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255]}} | |
| INFO:root:GPU: 0 | |
| INFO:root:Nearest nodes = [0] | |
| INFO:root:NUMA hardware info: {'numa_node_distance': [[10, 32], [32, 10]], 'node_cpu_info': {0: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191], 1: [64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255]}} | |
| INFO:root:GPU: 0 | |
| INFO:root:Nearest nodes = [1] | |
| INFO:shortfin_apps.llm.components.manager:Created local system with ['amdgpu:0:0@0'] devices | |
| INFO:shortfin_apps.llm.components.manager:Created local system with ['amdgpu:0:0@0'] devices | |
| INFO:shortfin_apps.llm.components.manager:Created local system with ['amdgpu:0:0@0'] devices | |
| INFO:shortfin_apps.llm.components.manager:Created local system with ['amdgpu:0:0@0'] devices | |
| INFO:shortfin_apps.llm.components.manager:Created local system with ['amdgpu:0:0@0'] devices | |
| INFO:shortfin_apps.llm.components.manager:Created local system with ['amdgpu:0:0@0'] devices | |
| INFO:shortfin_apps.llm.components.manager:Created local system with ['amdgpu:0:0@0'] devices | |
| INFO:shortfin_apps.llm.components.manager:Created local system with ['amdgpu:0:0@0'] devices | |
| INFO:root:Allocating page table (shape=[512, 2097152], dtype=float8_e4m3fn, size=1.0GiB) on DeviceAffinity(amdgpu:0:0@0[0x1]) | |
| INFO:root:Allocating page table (shape=[512, 2097152], dtype=float8_e4m3fn, size=1.0GiB) on DeviceAffinity(amdgpu:0:0@0[0x1]) | |
| INFO:root:Allocating page table (shape=[512, 2097152], dtype=float8_e4m3fn, size=1.0GiB) on DeviceAffinity(amdgpu:0:0@0[0x1]) | |
| INFO:root:Allocating page table (shape=[512, 2097152], dtype=float8_e4m3fn, size=1.0GiB) on DeviceAffinity(amdgpu:0:0@0[0x1]) | |
| INFO:root:Allocating page table (shape=[512, 2097152], dtype=float8_e4m3fn, size=1.0GiB) on DeviceAffinity(amdgpu:0:0@0[0x1]) | |
| INFO:root:Allocating page table (shape=[512, 2097152], dtype=float8_e4m3fn, size=1.0GiB) on DeviceAffinity(amdgpu:0:0@0[0x1]) | |
| INFO:root:Allocating page table (shape=[512, 2097152], dtype=float8_e4m3fn, size=1.0GiB) on DeviceAffinity(amdgpu:0:0@0[0x1]) | |
| INFO:root:Allocating page table (shape=[512, 2097152], dtype=float8_e4m3fn, size=1.0GiB) on DeviceAffinity(amdgpu:0:0@0[0x1]) | |
| INFO:root:Loading parameter fiber 'model' from: /shark-dev/llama3.1/8b/fp8/weight/native_fp8_e4m3fn_llama3_8b.irpa | |
| INFO:root:Loading parameter fiber 'model' from: /shark-dev/llama3.1/8b/fp8/weight/native_fp8_e4m3fn_llama3_8b.irpa | |
| INFO:shortfin_apps.llm.components.manager:Starting system manager | |
| INFO:shortfin_apps.llm.components.manager:Starting system manager | |
| INFO:root:Loading parameter fiber 'model' from: /shark-dev/llama3.1/8b/fp8/weight/native_fp8_e4m3fn_llama3_8b.irpa | |
| INFO:root:Loading parameter fiber 'model' from: /shark-dev/llama3.1/8b/fp8/weight/native_fp8_e4m3fn_llama3_8b.irpa | |
| INFO:root:Loading parameter fiber 'model' from: /shark-dev/llama3.1/8b/fp8/weight/native_fp8_e4m3fn_llama3_8b.irpa | |
| INFO:root:Loading parameter fiber 'model' from: /shark-dev/llama3.1/8b/fp8/weight/native_fp8_e4m3fn_llama3_8b.irpa | |
| INFO:root:Loading parameter fiber 'model' from: /shark-dev/llama3.1/8b/fp8/weight/native_fp8_e4m3fn_llama3_8b.irpa | |
| INFO:shortfin_apps.llm.components.manager:Starting system manager | |
| INFO:shortfin_apps.llm.components.manager:Starting system manager | |
| INFO:shortfin_apps.llm.components.manager:Starting system manager | |
| INFO:shortfin_apps.llm.components.manager:Starting system manager | |
| INFO:shortfin_apps.llm.components.manager:Starting system manager | |
| INFO:root:Loading parameter fiber 'model' from: /shark-dev/llama3.1/8b/fp8/weight/native_fp8_e4m3fn_llama3_8b.irpa | |
| INFO:shortfin_apps.llm.components.manager:Starting system manager | |
| INFO:root:Start Test! | |
| INFO:micro_llama_process_samples:SampleResponder-1 Sending response | |
| INFO:micro_llama_process_samples:SampleResponder-1 end time: 11735.911352843 | |
| INFO:micro_llama_process_samples:SampleResponder-3 Sending response | |
| INFO:micro_llama_process_samples:SampleResponder-3 end time: 11735.911615087 | |
| INFO:micro_llama_process_samples:SampleResponder-4 Sending response | |
| INFO:micro_llama_process_samples:SampleResponder-4 end time: 11735.911683876 | |
| Exception in thread INFO:micro_llama_process_samples:SampleResponder-1 Sending response | |
| Thread-1 (process_response_loop): | |
| Traceback (most recent call last): | |
| INFO:micro_llama_process_samples:SampleResponder-2 Sending response | |
| INFO:micro_llama_process_samples:SampleResponder-2 end time: 11735.916337491 | |
| File "/usr/lib/python3.12/threading.py", line 1073, in _bootstrap_inner | |
| INFO:micro_llama_process_samples:SampleResponder-1 end time: 11735.916447408 | |
| INFO:micro_llama_process_samples:SampleResponder-3 Sending response | |
| INFO:micro_llama_process_samples:SampleResponder-3 end time: 11735.916582975 | |
| INFO:micro_llama_process_samples:SampleResponder-3 Sending response | |
| INFO:micro_llama_process_samples:SampleResponder-3 end time: 11735.916847579 | |
| INFO:micro_llama_process_samples:SampleResponder-4 Sending response | |
| INFO:micro_llama_process_samples:SampleResponder-4 end time: 11735.916954287 | |
| self.run() | |
| File "/usr/lib/python3.12/threading.py", line 1010, in run | |
| self._target(*self._args, **self._kwargs) | |
| File "/mlperf/harness/llama_backend.py", line 267, in process_response_loop | |
| _process_response(response) | |
| File "/mlperf/harness/llama_backend.py", line 243, in _process_response | |
| processed_output = self.dataset.postProcess( | |
| ^^^^^^^^^^^^^^^^^^^^^^^^^ | |
| File "/mlperf/harness/dataset.py", line 65, in postProcess | |
| assert len(query_id_list) == len( | |
| ^^^^^^^^^^^^^^^^^^^^^^^^^^ | |
| AssertionError: len(query_id_list)=1, len(output_seq)=74, query_id_list=[1], output_seq='ResponderErrorCodes.KVCACHE_PAGES_FULL: Not enough memory pages available.' | |
| INFO:micro_llama_process_samples:SampleResponder-2 Sending response | |
| INFO:micro_llama_process_samples:SampleResponder-2 end time: 11735.920174615 | |
| INFO:micro_llama_process_samples:SampleResponder-3 Sending response | |
| INFO:micro_llama_process_samples:SampleResponder-3 end time: 11735.92036932 | |
| INFO:micro_llama_process_samples:SampleResponder-4 Sending response | |
| INFO:micro_llama_process_samples:SampleResponder-4 end time: 11735.920503657 | |
| INFO:micro_llama_process_samples:SampleResponder-2 Sending response | |
| INFO:micro_llama_process_samples:SampleResponder-2 end time: 11735.922886313 | |
| INFO:micro_llama_process_samples:SampleResponder-3 Sending response | |
| INFO:micro_llama_process_samples:SampleResponder-3 end time: 11735.92304434 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment