Last active
September 10, 2019 14:53
-
-
Save fo40225/2a7b474f1b37c3cf262efbe4c18fe35e to your computer and use it in GitHub Desktop.
DGX Station benchmark
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Welcome to NVIDIA DGX Station Version 4.0.7 (GNU/Linux 4.15.0-47-generic x86_64) | |
user@user-DGX-Station:~$ lscpu | |
Architecture: x86_64 | |
CPU op-mode(s): 32-bit, 64-bit | |
Byte Order: Little Endian | |
CPU(s): 40 | |
On-line CPU(s) list: 0-39 | |
Thread(s) per core: 2 | |
Core(s) per socket: 20 | |
Socket(s): 1 | |
NUMA node(s): 1 | |
Vendor ID: GenuineIntel | |
CPU family: 6 | |
Model: 79 | |
Model name: Intel(R) Xeon(R) CPU E5-2698 v4 @ 2.20GHz | |
Stepping: 1 | |
CPU MHz: 1203.348 | |
CPU max MHz: 3600.0000 | |
CPU min MHz: 1200.0000 | |
BogoMIPS: 4397.56 | |
Virtualization: VT-x | |
L1d cache: 32K | |
L1i cache: 32K | |
L2 cache: 256K | |
L3 cache: 51200K | |
NUMA node0 CPU(s): 0-39 | |
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single pti intel_ppin ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm rdt_a rdseed adx smap intel_pt xsaveopt cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts flush_l1d | |
user@user-DGX-Station:~$ free -h | |
total used free shared buff/cache available | |
Mem: 251G 1.1G 249G 2.4M 1.6G 248G | |
Swap: 0B 0B 0B | |
user@user-DGX-Station:~$ nvidia-smi | |
Tue Sep 10 21:48:37 2019 | |
+-----------------------------------------------------------------------------+ | |
| NVIDIA-SMI 410.129 Driver Version: 410.129 CUDA Version: 10.0 | | |
|-------------------------------+----------------------+----------------------+ | |
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | | |
|===============================+======================+======================| | |
| 0 Tesla V100-DGXS... On | 00000000:07:00.0 Off | 0 | | |
| N/A 45C P0 40W / 300W | 27MiB / 32478MiB | 0% Default | | |
+-------------------------------+----------------------+----------------------+ | |
| 1 Tesla V100-DGXS... On | 00000000:08:00.0 Off | 0 | | |
| N/A 46C P0 40W / 300W | 0MiB / 32478MiB | 0% Default | | |
+-------------------------------+----------------------+----------------------+ | |
| 2 Tesla V100-DGXS... On | 00000000:0E:00.0 Off | 0 | | |
| N/A 46C P0 40W / 300W | 0MiB / 32478MiB | 0% Default | | |
+-------------------------------+----------------------+----------------------+ | |
| 3 Tesla V100-DGXS... On | 00000000:0F:00.0 Off | 0 | | |
| N/A 48C P0 40W / 300W | 0MiB / 32478MiB | 0% Default | | |
+-------------------------------+----------------------+----------------------+ | |
+-----------------------------------------------------------------------------+ | |
| Processes: GPU Memory | | |
| GPU PID Type Process name Usage | | |
|=============================================================================| | |
| 0 2774 G /usr/lib/xorg/Xorg 9MiB | | |
| 0 2800 G /usr/bin/gnome-shell 15MiB | | |
+-----------------------------------------------------------------------------+ | |
user@user-DGX-Station:~$ git clone https://github.com/Mysticial/Flops.git | |
Cloning into 'Flops'... | |
remote: Enumerating objects: 726, done. | |
remote: Total 726 (delta 0), reused 0 (delta 0), pack-reused 726 | |
Receiving objects: 100% (726/726), 8.98 MiB | 4.26 MiB/s, done. | |
Resolving deltas: 100% (464/464), done. | |
user@user-DGX-Station:~$ cd Flops/version3/binaries-linux/ | |
user@user-DGX-Station:~/Flops/version3/binaries-linux$ chmod +x 2006-Core2 | |
user@user-DGX-Station:~/Flops/version3/binaries-linux$ ./2006-Core2 | |
Running Core 2 tuned binary with 1 thread... | |
Single-Precision - 128-bit SSE - Add/Sub | |
GFlops = 10.72 | |
Result = 1.35699e+06 | |
Double-Precision - 128-bit SSE2 - Add/Sub | |
GFlops = 5.36 | |
Result = 684236 | |
Single-Precision - 128-bit SSE - Multiply | |
GFlops = 21.456 | |
Result = 2.72048e+06 | |
Double-Precision - 128-bit SSE2 - Multiply | |
GFlops = 10.704 | |
Result = 1.37177e+06 | |
Single-Precision - 128-bit SSE - Multiply + Add | |
GFlops = 21.456 | |
Result = 2.28675e+06 | |
Double-Precision - 128-bit SSE2 - Multiply + Add | |
GFlops = 10.728 | |
Result = 1.14783e+06 | |
Running Core 2 tuned binary with 40 thread(s)... | |
Single-Precision - 128-bit SSE - Add/Sub | |
GFlops = 214.752 | |
Result = 2.74514e+07 | |
Double-Precision - 128-bit SSE2 - Add/Sub | |
GFlops = 108.096 | |
Result = 1.37612e+07 | |
Single-Precision - 128-bit SSE - Multiply | |
GFlops = 432.144 | |
Result = 5.50881e+07 | |
Double-Precision - 128-bit SSE2 - Multiply | |
GFlops = 215.904 | |
Result = 2.75107e+07 | |
Single-Precision - 128-bit SSE - Multiply + Add | |
GFlops = 432.24 | |
Result = 4.58174e+07 | |
Double-Precision - 128-bit SSE2 - Multiply + Add | |
GFlops = 215.976 | |
Result = 2.29539e+07 | |
user@user-DGX-Station:~/Flops/version3/binaries-linux$ chmod +x 2013-Haswell | |
user@user-DGX-Station:~/Flops/version3/binaries-linux$ ./2013-Haswell | |
Running Haswell tuned binary with 1 thread... | |
Single-Precision - 128-bit AVX - Add/Sub | |
GFlops = 10.72 | |
Result = 1.39171e+06 | |
Double-Precision - 128-bit AVX - Add/Sub | |
GFlops = 5.36 | |
Result = 677634 | |
Single-Precision - 128-bit AVX - Multiply | |
GFlops = 21.456 | |
Result = 2.72694e+06 | |
Double-Precision - 128-bit AVX - Multiply | |
GFlops = 10.728 | |
Result = 1.37212e+06 | |
Single-Precision - 128-bit AVX - Multiply + Add | |
GFlops = 21.456 | |
Result = 2.30148e+06 | |
Double-Precision - 128-bit AVX - Multiply + Add | |
GFlops = 10.728 | |
Result = 1.14076e+06 | |
Single-Precision - 128-bit FMA3 - Fused Multiply Add | |
GFlops = 41.376 | |
Result = 2.62872e+06 | |
Double-Precision - 128-bit FMA3 - Fused Multiply Add | |
GFlops = 20.688 | |
Result = 1.31907e+06 | |
Single-Precision - 256-bit AVX - Add/Sub | |
GFlops = 20.672 | |
Result = 2.62571e+06 | |
Double-Precision - 256-bit AVX - Add/Sub | |
GFlops = 10.336 | |
Result = 1.31198e+06 | |
Single-Precision - 256-bit AVX - Multiply | |
GFlops = 41.376 | |
Result = 5.27805e+06 | |
Double-Precision - 256-bit AVX - Multiply | |
GFlops = 20.64 | |
Result = 2.62802e+06 | |
Single-Precision - 256-bit AVX - Multiply + Add | |
GFlops = 41.376 | |
Result = 4.39652e+06 | |
Double-Precision - 256-bit AVX - Multiply + Add | |
GFlops = 20.736 | |
Result = 2.20224e+06 | |
Single-Precision - 256-bit FMA3 - Fused Multiply Add | |
GFlops = 82.56 | |
Result = 5.24702e+06 | |
Double-Precision - 256-bit FMA3 - Fused Multiply Add | |
GFlops = 41.376 | |
Result = 2.63064e+06 | |
Running Haswell tuned binary with 40 thread(s)... | |
Single-Precision - 128-bit AVX - Add/Sub | |
GFlops = 214.912 | |
Result = 2.73979e+07 | |
Double-Precision - 128-bit AVX - Add/Sub | |
GFlops = 108.176 | |
Result = 1.37815e+07 | |
Single-Precision - 128-bit AVX - Multiply | |
GFlops = 432.048 | |
Result = 5.50837e+07 | |
Double-Precision - 128-bit AVX - Multiply | |
GFlops = 215.976 | |
Result = 2.75056e+07 | |
Single-Precision - 128-bit AVX - Multiply + Add | |
GFlops = 432.288 | |
Result = 4.58675e+07 | |
Double-Precision - 128-bit AVX - Multiply + Add | |
GFlops = 216.24 | |
Result = 2.29816e+07 | |
Single-Precision - 128-bit FMA3 - Fused Multiply Add | |
GFlops = 832.608 | |
Result = 5.3022e+07 | |
Double-Precision - 128-bit FMA3 - Fused Multiply Add | |
GFlops = 416.304 | |
Result = 2.65274e+07 | |
Single-Precision - 256-bit AVX - Add/Sub | |
GFlops = 416.832 | |
Result = 5.30652e+07 | |
Double-Precision - 256-bit AVX - Add/Sub | |
GFlops = 208.32 | |
Result = 2.66027e+07 | |
Single-Precision - 256-bit AVX - Multiply | |
GFlops = 832.608 | |
Result = 1.06004e+08 | |
Double-Precision - 256-bit AVX - Multiply | |
GFlops = 416.448 | |
Result = 5.30501e+07 | |
Single-Precision - 256-bit AVX - Multiply + Add | |
GFlops = 832.608 | |
Result = 8.82891e+07 | |
Double-Precision - 256-bit AVX - Multiply + Add | |
GFlops = 416.208 | |
Result = 4.42095e+07 | |
Single-Precision - 256-bit FMA3 - Fused Multiply Add | |
GFlops = 1664.64 | |
Result = 1.06053e+08 | |
Double-Precision - 256-bit FMA3 - Fused Multiply Add | |
GFlops = 831.84 | |
Result = 5.29934e+07 | |
user@user-DGX-Station:~/Flops/version3/binaries-linux$ cd ~ | |
user@user-DGX-Station:~$ sudo docker pull nvcr.io/nvidia/tensorflow:19.08-py3 | |
19.08-py3: Pulling from nvidia/tensorflow | |
Digest: sha256:64e296668d398a106f64bd840772ffb63372148b8c1170b152e7e577013661c9 | |
Status: Image is up to date for nvcr.io/nvidia/tensorflow:19.08-py3 | |
user@user-DGX-Station:~$ sudo nvidia-docker run -it --rm nvcr.io/nvidia/tensorflow:19.08-py3 | |
================ | |
== TensorFlow == | |
================ | |
NVIDIA Release 19.08 (build 7791926) | |
TensorFlow Version 1.14.0 | |
Container image Copyright (c) 2019, NVIDIA CORPORATION. All rights reserved. | |
Copyright 2017-2019 The TensorFlow Authors. All rights reserved. | |
Various files include modifications (c) NVIDIA CORPORATION. All rights reserved. | |
NVIDIA modifications are covered by the license terms that apply to the underlying project or file. | |
NOTE: Legacy NVIDIA Driver detected. Compatibility mode ENABLED. | |
NOTE: MOFED driver for multi-node communication was not detected. | |
Multi-node communication performance may be reduced. | |
root@692ced701c1d:/workspace# git clone --no-checkout https://github.com/IntelPython/ibench.git | |
Cloning into 'ibench'... | |
remote: Enumerating objects: 21, done. | |
remote: Counting objects: 100% (21/21), done. | |
remote: Compressing objects: 100% (21/21), done. | |
remote: Total 467 (delta 8), reused 2 (delta 0), pack-reused 446 | |
Receiving objects: 100% (467/467), 81.04 KiB | 365.00 KiB/s, done. | |
Resolving deltas: 100% (253/253), done. | |
root@692ced701c1d:/workspace# cd ibench | |
root@692ced701c1d:/workspace/ibench# git checkout d2a81d04352427437e6e383654cfbd36e99c5ae9 | |
Note: checking out 'd2a81d04352427437e6e383654cfbd36e99c5ae9'. | |
You are in 'detached HEAD' state. You can look around, make experimental | |
changes and commit them, and you can discard any commits you make in this | |
state without impacting any branches by performing another checkout. | |
If you want to create a new branch to retain commits you create, you may | |
do so (now or later) by using -b with the checkout command again. Example: | |
git checkout -b <new-branch-name> | |
HEAD is now at d2a81d0 Add comments to clarify the steps | |
root@692ced701c1d:/workspace/ibench# python -m ibench run -b all --size small --runs 3 --file result.json | |
Cholesky: | |
Cholesky: N = 10000 | |
Cholesky: elapsed 0.990914 gflops 336.389816 | |
Cholesky: elapsed 0.967183 gflops 344.643459 | |
Cholesky: elapsed 0.979904 gflops 340.169316 | |
Cholesky: gflops 340.169316 | |
Det: | |
Det: N = 15000 | |
Det: elapsed 6.110545 gflops 368.215933 | |
Det: elapsed 5.973589 gflops 376.657959 | |
Det: elapsed 5.944663 gflops 378.490784 | |
Det: gflops 376.657959 | |
Dot: | |
Dot: N = 5000 | |
Dot: elapsed 0.573518 gflops 435.906081 | |
Dot: elapsed 0.573810 gflops 435.684571 | |
Dot: elapsed 0.571029 gflops 437.805993 | |
Dot: gflops 435.906081 | |
Fft: | |
Fft: N = 520000 | |
Fft: elapsed 18.262345 gflops 2.703333 | |
Fft: elapsed 18.233072 gflops 2.707673 | |
Fft: elapsed 18.234067 gflops 2.707525 | |
Fft: gflops 2.707525 | |
Inv: | |
Inv: N = 10000 | |
Inv: elapsed 7.173745 gflops 278.794422 | |
Inv: elapsed 7.098740 gflops 281.740136 | |
Inv: elapsed 7.067508 gflops 282.985177 | |
Inv: gflops 281.740136 | |
Lu: | |
Lu: N = 20000 | |
Lu: elapsed 17.991874 gflops 296.430114 | |
Lu: elapsed 17.741165 gflops 300.619109 | |
Lu: elapsed 17.668941 gflops 301.847929 | |
Lu: gflops 300.619109 | |
Qr: | |
Qr: N = 5000 | |
Qr: elapsed 2.123459 gflops 78.488287 | |
Qr: elapsed 2.098615 gflops 79.417451 | |
Qr: elapsed 2.090776 gflops 79.715220 | |
Qr: gflops 79.417451 | |
Svd: | |
Svd: N = 5000 | |
Svd: elapsed 19.762689 gflops 8.433400 | |
Svd: elapsed 19.579588 gflops 8.512266 | |
Svd: elapsed 19.605364 gflops 8.501075 | |
Svd: gflops 8.501075 | |
root@692ced701c1d:/workspace/ibench# cd /workspace | |
root@692ced701c1d:/workspace# git clone https://github.com/axboe/fio.git -b fio-3.15 | |
Cloning into 'fio'... | |
remote: Enumerating objects: 11, done. | |
remote: Counting objects: 100% (11/11), done. | |
remote: Compressing objects: 100% (8/8), done. | |
remote: Total 28931 (delta 4), reused 8 (delta 3), pack-reused 28920 | |
Receiving objects: 100% (28931/28931), 18.45 MiB | 811.00 KiB/s, done. | |
Resolving deltas: 100% (19259/19259), done. | |
Note: checking out '01bf5128d0581e267383f280c6a1dcd26517240f'. | |
You are in 'detached HEAD' state. You can look around, make experimental | |
changes and commit them, and you can discard any commits you make in this | |
state without impacting any branches by performing another checkout. | |
If you want to create a new branch to retain commits you create, you may | |
do so (now or later) by using -b with the checkout command again. Example: | |
git checkout -b <new-branch-name> | |
root@692ced701c1d:/workspace# cd fio/ | |
root@692ced701c1d:/workspace/fio# ./configure | |
Operating system Linux | |
CPU x86_64 | |
Big endian no | |
Compiler gcc | |
Cross compile no | |
Static build no | |
Wordsize 64 | |
zlib yes | |
Linux AIO support no | |
POSIX AIO support yes | |
POSIX AIO support needs -lrt yes | |
POSIX AIO fsync yes | |
POSIX pshared support yes | |
Solaris AIO support no | |
__sync_fetch_and_add yes | |
__sync_synchronize yes | |
__sync_val_compare_and_swap yes | |
libverbs no | |
rdmacm no | |
asprintf() yes | |
vasprintf() yes | |
Linux fallocate yes | |
POSIX fadvise yes | |
POSIX fallocate yes | |
sched_setaffinity(3 arg) yes | |
sched_setaffinity(2 arg) no | |
clock_gettime yes | |
CLOCK_MONOTONIC yes | |
CLOCK_MONOTONIC_RAW yes | |
CLOCK_MONOTONIC_PRECISE no | |
clockid_t yes | |
gettimeofday yes | |
fdatasync yes | |
sync_file_range yes | |
EXT4 move extent yes | |
Linux splice(2) yes | |
GUASI no | |
libnuma yes | |
libnuma v2 yes | |
strsep yes | |
strcasestr yes | |
strlcat no | |
getopt_long_only() yes | |
inet_aton yes | |
socklen_t yes | |
__thread yes | |
RUSAGE_THREAD yes | |
SCHED_IDLE yes | |
TCP_NODELAY yes | |
Net engine window_size yes | |
TCP_MAXSEG yes | |
RLIMIT_MEMLOCK yes | |
pwritev/preadv yes | |
pwritev2/preadv2 yes | |
IPv6 helpers yes | |
http engine no | |
Rados engine no | |
Rados Block Device engine no | |
setvbuf yes | |
Gluster API engine no | |
s390_z196_facilities no | |
HDFS engine no | |
MTD yes | |
libpmem no | |
libpmemblk no | |
PMDK pmemblk engine no | |
PMDK dev-dax engine no | |
PMDK libpmem engine no | |
DDN's Infinite Memory Engine no | |
iscsi engine no | |
lex/yacc for arithmetic no | |
getmntent yes | |
getmntinfo no | |
Static Assert yes | |
bool yes | |
strndup yes | |
Valgrind headers no | |
Zoned block device support yes | |
march_armv8_a_crc_crypto no | |
cuda no | |
mkdir(a, b) yes | |
Build march=native yes | |
CUnit no | |
__kernel_rwf_t yes | |
-Wimplicit-fallthrough yes | |
MADV_HUGEPAGE yes | |
gettid no | |
TCMalloc support no | |
root@692ced701c1d:/workspace/fio# make -j $(nproc) | |
FIO_VERSION = fio-3.15 | |
CC crc/crc32.o | |
CC crc/crc16.o | |
CC crc/crc32c-arm64.o | |
CC crc/crc32c-intel.o | |
CC crc/crc32c.o | |
CC crc/crc7.o | |
CC crc/crc64.o | |
CC crc/fnv.o | |
CC crc/md5.o | |
CC crc/sha1.o | |
CC crc/murmur3.o | |
CC crc/sha256.o | |
CC crc/sha512.o | |
CC crc/sha3.o | |
CC crc/test.o | |
CC crc/xxhash.o | |
CC lib/axmap.o | |
CC lib/bloom.o | |
CC lib/flist_sort.o | |
CC lib/getrusage.o | |
CC lib/gauss.o | |
CC lib/lfsr.o | |
CC lib/ieee754.o | |
CC lib/memcpy.o | |
CC lib/memalign.o | |
CC lib/hweight.o | |
CC lib/mountcheck.o | |
CC lib/num2str.o | |
CC lib/prio_tree.o | |
CC lib/output_buffer.o | |
CC lib/pattern.o | |
CC lib/rbtree.o | |
CC lib/rand.o | |
CC lib/strntol.o | |
CC gettime.o | |
CC init.o | |
CC ioengines.o | |
CC lib/zipf.o | |
CC stat.o | |
CC log.o | |
CC time.o | |
CC filesetup.o | |
CC eta.o | |
CC verify.o | |
CC memory.o | |
CC io_u.o | |
CC parse.o | |
CC pshared.o | |
CC fio_sem.o | |
CC rwlock.o | |
CC options.o | |
CC smalloc.o | |
CC filehash.o | |
CC profile.o | |
CC debug.o | |
CC engines/cpu.o | |
CC engines/mmap.o | |
CC engines/sync.o | |
CC engines/null.o | |
CC engines/net.o | |
CC engines/ftruncate.o | |
CC engines/filecreate.o | |
CC server.o | |
CC client.o | |
CC iolog.o | |
CC backend.o | |
CC libfio.o | |
CC flow.o | |
CC gettime-thread.o | |
CC cconv.o | |
CC helpers.o | |
CC json.o | |
CC td_error.o | |
CC idletime.o | |
CC profiles/tiobench.o | |
CC profiles/act.o | |
CC io_u_queue.o | |
CC filelock.o | |
CC workqueue.o | |
CC rate-submit.o | |
CC optgroup.o | |
CC helper_thread.o | |
CC steadystate.o | |
CC zone-dist.o | |
CC engines/posixaio.o | |
CC engines/falloc.o | |
CC engines/e4defrag.o | |
CC engines/splice.o | |
CC oslib/asprintf.o | |
CC oslib/strlcat.o | |
CC engines/mtd.o | |
CC oslib/libmtd.o | |
CC oslib/libmtd_legacy.o | |
CC zbd.o | |
CC diskutil.o | |
CC fifo.o | |
CC blktrace.o | |
CC cgroup.o | |
CC trim.o | |
CC engines/sg.o | |
CC oslib/linux-dev-lookup.o | |
CC engines/io_uring.o | |
CC fio.o | |
CC t/log.o | |
CC t/genzipf.o | |
CC oslib/strcasestr.o | |
CC oslib/strndup.o | |
CC t/btrace2fio.o | |
CC t/dedupe.o | |
CC t/debug.o | |
CC t/arch.o | |
CC t/verify-state.o | |
CC t/stest.o | |
CC t/ieee754.o | |
CC t/axmap.o | |
CC t/gen-rand.o | |
CC t/lfsr-test.o | |
LINK t/ieee754 | |
LINK t/stest | |
LINK t/lfsr-test | |
LINK t/fio-verify-state | |
LINK t/gen-rand | |
LINK t/axmap | |
LINK t/fio-genzipf | |
LINK t/fio-dedupe | |
LINK t/fio-btrace2fio | |
LINK fio | |
root@692ced701c1d:/workspace/fio# ./fio --loops=5 --size=1g --runtime=10 --stonewall --direct=1 --group_reporting \ | |
> --name=SeqQ32T1read --bs=128k --iodepth=32 --rw=read \ | |
> --name=SeqQ32T1write --bs=128k --iodepth=32 --rw=write \ | |
> --name=4kQ8T8read --bs=4k --iodepth=8 --numjobs=8 --rw=randread \ | |
> --name=4kQ8T8write --bs=4k --iodepth=8 --numjobs=8 --rw=randwrite \ | |
> --name=4kQ32T1read --bs=4k --iodepth=32 --rw=randread \ | |
> --name=4kQ32T1write --bs=4k --iodepth=32 --rw=randwrite \ | |
> --name=4kQ1T1read --bs=4k --iodepth=1 --rw=randread \ | |
> --name=4kQ1T1write --bs=4k --iodepth=1 --rw=randwrite | |
SeqQ32T1read: (g=0): rw=read, bs=(R) 128KiB-128KiB, (W) 128KiB-128KiB, (T) 128KiB-128KiB, ioengine=psync, iodepth=32 | |
SeqQ32T1write: (g=1): rw=write, bs=(R) 128KiB-128KiB, (W) 128KiB-128KiB, (T) 128KiB-128KiB, ioengine=psync, iodepth=32 | |
4kQ8T8read: (g=2): rw=randread, bs=(R) 4096B-4096B, (W) 4096B-4096B, (T) 4096B-4096B, ioengine=psync, iodepth=8 | |
... | |
4kQ8T8write: (g=3): rw=randwrite, bs=(R) 4096B-4096B, (W) 4096B-4096B, (T) 4096B-4096B, ioengine=psync, iodepth=8 | |
... | |
4kQ32T1read: (g=4): rw=randread, bs=(R) 4096B-4096B, (W) 4096B-4096B, (T) 4096B-4096B, ioengine=psync, iodepth=32 | |
4kQ32T1write: (g=5): rw=randwrite, bs=(R) 4096B-4096B, (W) 4096B-4096B, (T) 4096B-4096B, ioengine=psync, iodepth=32 | |
4kQ1T1read: (g=6): rw=randread, bs=(R) 4096B-4096B, (W) 4096B-4096B, (T) 4096B-4096B, ioengine=psync, iodepth=1 | |
4kQ1T1write: (g=7): rw=randwrite, bs=(R) 4096B-4096B, (W) 4096B-4096B, (T) 4096B-4096B, ioengine=psync, iodepth=1 | |
fio-3.15 | |
Starting 22 processes | |
SeqQ32T1read: Laying out IO file (1 file / 1024MiB) | |
SeqQ32T1write: Laying out IO file (1 file / 1024MiB) | |
4kQ8T8read: Laying out IO file (1 file / 1024MiB) | |
4kQ8T8read: Laying out IO file (1 file / 1024MiB) | |
4kQ8T8read: Laying out IO file (1 file / 1024MiB) | |
4kQ8T8read: Laying out IO file (1 file / 1024MiB) | |
4kQ8T8read: Laying out IO file (1 file / 1024MiB) | |
4kQ8T8read: Laying out IO file (1 file / 1024MiB) | |
4kQ8T8read: Laying out IO file (1 file / 1024MiB) | |
4kQ8T8read: Laying out IO file (1 file / 1024MiB) | |
4kQ8T8write: Laying out IO file (1 file / 1024MiB) | |
4kQ8T8write: Laying out IO file (1 file / 1024MiB) | |
4kQ8T8write: Laying out IO file (1 file / 1024MiB) | |
4kQ8T8write: Laying out IO file (1 file / 1024MiB) | |
4kQ8T8write: Laying out IO file (1 file / 1024MiB) | |
4kQ8T8write: Laying out IO file (1 file / 1024MiB) | |
4kQ8T8write: Laying out IO file (1 file / 1024MiB) | |
4kQ8T8write: Laying out IO file (1 file / 1024MiB) | |
4kQ32T1read: Laying out IO file (1 file / 1024MiB) | |
4kQ32T1write: Laying out IO file (1 file / 1024MiB) | |
4kQ1T1read: Laying out IO file (1 file / 1024MiB) | |
4kQ1T1write: Laying out IO file (1 file / 1024MiB) | |
Jobs: 1 (f=1): [_(21),w(1)][79.2%][w=256MiB/s][w=65.5k IOPS][eta 00m:22s] | |
SeqQ32T1read: (groupid=0, jobs=1): err= 0: pid=3012: Tue Sep 10 14:02:50 2019 | |
read: IOPS=3901, BW=488MiB/s (511MB/s)(4877MiB/10001msec) | |
clat (usec): min=14, max=882, avg=248.18, stdev=234.46 | |
lat (usec): min=14, max=882, avg=248.25, stdev=234.43 | |
clat percentiles (usec): | |
| 1.00th=[ 15], 5.00th=[ 15], 10.00th=[ 15], 20.00th=[ 15], | |
| 30.00th=[ 15], 40.00th=[ 15], 50.00th=[ 92], 60.00th=[ 474], | |
| 70.00th=[ 478], 80.00th=[ 482], 90.00th=[ 482], 95.00th=[ 486], | |
| 99.00th=[ 586], 99.50th=[ 611], 99.90th=[ 742], 99.95th=[ 750], | |
| 99.99th=[ 791] | |
bw ( KiB/s): min=401284, max=507795, per=91.62%, avg=457547.74, stdev=30392.39, samples=19 | |
iops : min= 3135, max= 3967, avg=3574.00, stdev=237.45, samples=19 | |
lat (usec) : 20=49.95%, 50=0.04%, 100=0.02%, 250=0.01%, 500=48.49% | |
lat (usec) : 750=1.44%, 1000=0.06% | |
cpu : usr=0.58%, sys=15.05%, ctx=19533, majf=0, minf=42 | |
IO depths : 1=100.0%, 2=0.0%, 4=0.0%, 8=0.0%, 16=0.0%, 32=0.0%, >=64=0.0% | |
submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% | |
complete : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% | |
issued rwts: total=39019,0,0,0 short=0,0,0,0 dropped=0,0,0,0 | |
latency : target=0, window=0, percentile=100.00%, depth=32 | |
SeqQ32T1write: (groupid=1, jobs=1): err= 0: pid=3013: Tue Sep 10 14:02:50 2019 | |
write: IOPS=2071, BW=259MiB/s (272MB/s)(2815MiB/10872msec); 0 zone resets | |
clat (usec): min=68, max=1369.1k, avg=359.39, stdev=15630.43 | |
lat (usec): min=69, max=1369.1k, avg=360.84, stdev=15630.43 | |
clat percentiles (usec): | |
| 1.00th=[ 72], 5.00th=[ 73], 10.00th=[ 73], 20.00th=[ 74], | |
| 30.00th=[ 75], 40.00th=[ 84], 50.00th=[ 87], 60.00th=[ 94], | |
| 70.00th=[ 98], 80.00th=[ 113], 90.00th=[ 115], 95.00th=[ 127], | |
| 99.00th=[ 167], 99.50th=[ 194], 99.90th=[ 229], 99.95th=[ 725], | |
| 99.99th=[943719] | |
bw ( KiB/s): min= 655, max=894252, per=100.00%, avg=409956.00, stdev=298909.51, samples=12 | |
iops : min= 5, max= 6986, avg=3202.33, stdev=2335.14, samples=12 | |
lat (usec) : 100=73.54%, 250=26.40%, 500=0.01%, 750=0.01% | |
lat (msec) : 2=0.01%, 500=0.01%, 1000=0.02%, 2000=0.01% | |
cpu : usr=0.76%, sys=27.38%, ctx=192, majf=0, minf=12 | |
IO depths : 1=100.0%, 2=0.0%, 4=0.0%, 8=0.0%, 16=0.0%, 32=0.0%, >=64=0.0% | |
submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% | |
complete : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% | |
issued rwts: total=0,22523,0,0 short=0,0,0,0 dropped=0,0,0,0 | |
latency : target=0, window=0, percentile=100.00%, depth=32 | |
4kQ8T8read: (groupid=2, jobs=8): err= 0: pid=3014: Tue Sep 10 14:02:50 2019 | |
read: IOPS=53.0k, BW=211MiB/s (221MB/s)(2109MiB/10001msec) | |
clat (usec): min=99, max=3850, avg=146.54, stdev=23.08 | |
lat (usec): min=99, max=3850, avg=146.72, stdev=23.09 | |
clat percentiles (usec): | |
| 1.00th=[ 117], 5.00th=[ 123], 10.00th=[ 127], 20.00th=[ 131], | |
| 30.00th=[ 135], 40.00th=[ 139], 50.00th=[ 143], 60.00th=[ 147], | |
| 70.00th=[ 153], 80.00th=[ 159], 90.00th=[ 172], 95.00th=[ 186], | |
| 99.00th=[ 219], 99.50th=[ 231], 99.90th=[ 260], 99.95th=[ 273], | |
| 99.99th=[ 314] | |
bw ( KiB/s): min=147195, max=214268, per=70.27%, avg=151708.63, stdev=1850.06, samples=152 | |
iops : min=36796, max=53563, avg=37924.42, stdev=462.48, samples=152 | |
lat (usec) : 100=0.01%, 250=99.83%, 500=0.16%, 750=0.01% | |
lat (msec) : 2=0.01%, 4=0.01% | |
cpu : usr=1.86%, sys=11.29%, ctx=540117, majf=0, minf=75 | |
IO depths : 1=100.0%, 2=0.0%, 4=0.0%, 8=0.0%, 16=0.0%, 32=0.0%, >=64=0.0% | |
submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% | |
complete : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% | |
issued rwts: total=539779,0,0,0 short=0,0,0,0 dropped=0,0,0,0 | |
latency : target=0, window=0, percentile=100.00%, depth=8 | |
4kQ8T8write: (groupid=3, jobs=8): err= 0: pid=3022: Tue Sep 10 14:02:50 2019 | |
write: IOPS=31.5k, BW=123MiB/s (129MB/s)(1256MiB/10200msec); 0 zone resets | |
clat (usec): min=4, max=1306.4k, avg=252.44, stdev=12776.17 | |
lat (usec): min=4, max=1306.4k, avg=252.63, stdev=12776.17 | |
clat percentiles (usec): | |
| 1.00th=[ 7], 5.00th=[ 10], 10.00th=[ 11], 20.00th=[ 13], | |
| 30.00th=[ 15], 40.00th=[ 17], 50.00th=[ 20], 60.00th=[ 26], | |
| 70.00th=[ 194], 80.00th=[ 229], 90.00th=[ 265], 95.00th=[ 293], | |
| 99.00th=[ 523], 99.50th=[ 775], 99.90th=[ 1401], 99.95th=[ 1729], | |
| 99.99th=[759170] | |
bw ( KiB/s): min=20367, max=501819, per=100.00%, avg=181662.04, stdev=15622.51, samples=110 | |
iops : min= 5087, max=125451, avg=45412.24, stdev=3905.66, samples=110 | |
lat (usec) : 10=6.44%, 20=46.47%, 50=8.16%, 100=0.29%, 250=25.34% | |
lat (usec) : 500=12.19%, 750=0.52%, 1000=0.42% | |
lat (msec) : 2=0.14%, 4=0.02%, 50=0.01%, 100=0.01%, 250=0.01% | |
lat (msec) : 500=0.01%, 750=0.01%, 1000=0.01%, 2000=0.01% | |
cpu : usr=0.84%, sys=8.83%, ctx=135933, majf=0, minf=82 | |
IO depths : 1=100.0%, 2=0.0%, 4=0.0%, 8=0.0%, 16=0.0%, 32=0.0%, >=64=0.0% | |
submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% | |
complete : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% | |
issued rwts: total=0,321484,0,0 short=0,0,0,0 dropped=0,0,0,0 | |
latency : target=0, window=0, percentile=100.00%, depth=8 | |
4kQ32T1read: (groupid=4, jobs=1): err= 0: pid=3030: Tue Sep 10 14:02:50 2019 | |
read: IOPS=8640, BW=33.8MiB/s (35.4MB/s)(338MiB/10001msec) | |
clat (usec): min=95, max=8677, avg=115.05, stdev=30.52 | |
lat (usec): min=95, max=8677, avg=115.12, stdev=30.52 | |
clat percentiles (usec): | |
| 1.00th=[ 99], 5.00th=[ 109], 10.00th=[ 110], 20.00th=[ 110], | |
| 30.00th=[ 111], 40.00th=[ 111], 50.00th=[ 112], 60.00th=[ 113], | |
| 70.00th=[ 125], 80.00th=[ 126], 90.00th=[ 127], 95.00th=[ 127], | |
| 99.00th=[ 128], 99.50th=[ 129], 99.90th=[ 135], 99.95th=[ 137], | |
| 99.99th=[ 163] | |
bw ( KiB/s): min=27901, max=34240, per=83.27%, avg=28779.26, stdev=1330.24, samples=19 | |
iops : min= 6975, max= 8560, avg=7194.42, stdev=332.65, samples=19 | |
lat (usec) : 100=1.47%, 250=98.53%, 500=0.01%, 750=0.01% | |
lat (msec) : 2=0.01%, 10=0.01% | |
cpu : usr=1.38%, sys=5.77%, ctx=86417, majf=0, minf=11 | |
IO depths : 1=100.0%, 2=0.0%, 4=0.0%, 8=0.0%, 16=0.0%, 32=0.0%, >=64=0.0% | |
submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% | |
complete : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% | |
issued rwts: total=86413,0,0,0 short=0,0,0,0 dropped=0,0,0,0 | |
latency : target=0, window=0, percentile=100.00%, depth=32 | |
4kQ32T1write: (groupid=5, jobs=1): err= 0: pid=3031: Tue Sep 10 14:02:50 2019 | |
write: IOPS=24.6k, BW=96.2MiB/s (101MB/s)(1024MiB/10650msec); 0 zone resets | |
clat (usec): min=3, max=1300.0k, avg=34.66, stdev=4996.97 | |
lat (usec): min=3, max=1300.0k, avg=34.75, stdev=4996.97 | |
clat percentiles (usec): | |
| 1.00th=[ 5], 5.00th=[ 5], 10.00th=[ 5], 20.00th=[ 5], | |
| 30.00th=[ 5], 40.00th=[ 5], 50.00th=[ 6], 60.00th=[ 7], | |
| 70.00th=[ 7], 80.00th=[ 9], 90.00th=[ 58], 95.00th=[ 77], | |
| 99.00th=[ 94], 99.50th=[ 96], 99.90th=[ 103], 99.95th=[ 128], | |
| 99.99th=[ 159] | |
bw ( KiB/s): min= 4843, max=318196, per=100.00%, avg=134503.25, stdev=90968.52, samples=12 | |
iops : min= 1210, max=79549, avg=33625.50, stdev=22742.31, samples=12 | |
lat (usec) : 4=0.11%, 10=82.44%, 20=1.52%, 50=4.10%, 100=11.61% | |
lat (usec) : 250=0.22%, 500=0.01% | |
lat (msec) : 2000=0.01% | |
cpu : usr=2.80%, sys=27.19%, ctx=60132, majf=0, minf=11 | |
IO depths : 1=100.0%, 2=0.0%, 4=0.0%, 8=0.0%, 16=0.0%, 32=0.0%, >=64=0.0% | |
submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% | |
complete : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% | |
issued rwts: total=0,262145,0,0 short=0,0,0,0 dropped=0,0,0,0 | |
latency : target=0, window=0, percentile=100.00%, depth=32 | |
4kQ1T1read: (groupid=6, jobs=1): err= 0: pid=3032: Tue Sep 10 14:02:50 2019 | |
read: IOPS=8657, BW=33.8MiB/s (35.5MB/s)(338MiB/10001msec) | |
clat (usec): min=95, max=4880, avg=114.85, stdev=18.65 | |
lat (usec): min=95, max=4881, avg=114.92, stdev=18.65 | |
clat percentiles (usec): | |
| 1.00th=[ 99], 5.00th=[ 108], 10.00th=[ 109], 20.00th=[ 110], | |
| 30.00th=[ 111], 40.00th=[ 111], 50.00th=[ 112], 60.00th=[ 112], | |
| 70.00th=[ 125], 80.00th=[ 126], 90.00th=[ 127], 95.00th=[ 127], | |
| 99.00th=[ 129], 99.50th=[ 129], 99.90th=[ 133], 99.95th=[ 135], | |
| 99.99th=[ 163] | |
bw ( KiB/s): min=23045, max=34358, per=69.14%, avg=23942.63, stdev=2523.49, samples=19 | |
iops : min= 5761, max= 8589, avg=5985.21, stdev=630.86, samples=19 | |
lat (usec) : 100=2.07%, 250=97.92%, 500=0.01% | |
lat (msec) : 2=0.01%, 10=0.01% | |
cpu : usr=0.97%, sys=6.12%, ctx=86584, majf=0, minf=11 | |
IO depths : 1=100.0%, 2=0.0%, 4=0.0%, 8=0.0%, 16=0.0%, 32=0.0%, >=64=0.0% | |
submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% | |
complete : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% | |
issued rwts: total=86581,0,0,0 short=0,0,0,0 dropped=0,0,0,0 | |
latency : target=0, window=0, percentile=100.00%, depth=1 | |
4kQ1T1write: (groupid=7, jobs=1): err= 0: pid=3033: Tue Sep 10 14:02:50 2019 | |
write: IOPS=38.3k, BW=149MiB/s (157MB/s)(1600MiB/10702msec); 0 zone resets | |
clat (usec): min=3, max=1504.4k, avg=22.01, stdev=3430.79 | |
lat (usec): min=3, max=1504.4k, avg=22.11, stdev=3430.79 | |
clat percentiles (usec): | |
| 1.00th=[ 4], 5.00th=[ 4], 10.00th=[ 5], 20.00th=[ 5], | |
| 30.00th=[ 5], 40.00th=[ 5], 50.00th=[ 5], 60.00th=[ 6], | |
| 70.00th=[ 6], 80.00th=[ 7], 90.00th=[ 39], 95.00th=[ 70], | |
| 99.00th=[ 91], 99.50th=[ 96], 99.90th=[ 103], 99.95th=[ 112], | |
| 99.99th=[ 190] | |
bw ( KiB/s): min= 1053, max=472944, per=100.00%, avg=218439.27, stdev=160092.70, samples=15 | |
iops : min= 263, max=118236, avg=54609.80, stdev=40023.20, samples=15 | |
lat (usec) : 4=6.63%, 10=81.95%, 20=1.20%, 50=2.56%, 100=7.48% | |
lat (usec) : 250=0.17%, 500=0.01%, 750=0.01% | |
lat (msec) : 2=0.01%, 4=0.01%, 10=0.01%, 20=0.01%, 250=0.01% | |
lat (msec) : 500=0.01%, 2000=0.01% | |
cpu : usr=4.68%, sys=30.36%, ctx=56291, majf=0, minf=11 | |
IO depths : 1=100.0%, 2=0.0%, 4=0.0%, 8=0.0%, 16=0.0%, 32=0.0%, >=64=0.0% | |
submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% | |
complete : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0% | |
issued rwts: total=0,409575,0,0 short=0,0,0,0 dropped=0,0,0,0 | |
latency : target=0, window=0, percentile=100.00%, depth=1 | |
Run status group 0 (all jobs): | |
READ: bw=488MiB/s (511MB/s), 488MiB/s-488MiB/s (511MB/s-511MB/s), io=4877MiB (5114MB), run=10001-10001msec | |
Run status group 1 (all jobs): | |
WRITE: bw=259MiB/s (272MB/s), 259MiB/s-259MiB/s (272MB/s-272MB/s), io=2815MiB (2952MB), run=10872-10872msec | |
Run status group 2 (all jobs): | |
READ: bw=211MiB/s (221MB/s), 211MiB/s-211MiB/s (221MB/s-221MB/s), io=2109MiB (2211MB), run=10001-10001msec | |
Run status group 3 (all jobs): | |
WRITE: bw=123MiB/s (129MB/s), 123MiB/s-123MiB/s (129MB/s-129MB/s), io=1256MiB (1317MB), run=10200-10200msec | |
Run status group 4 (all jobs): | |
READ: bw=33.8MiB/s (35.4MB/s), 33.8MiB/s-33.8MiB/s (35.4MB/s-35.4MB/s), io=338MiB (354MB), run=10001-10001msec | |
Run status group 5 (all jobs): | |
WRITE: bw=96.2MiB/s (101MB/s), 96.2MiB/s-96.2MiB/s (101MB/s-101MB/s), io=1024MiB (1074MB), run=10650-10650msec | |
Run status group 6 (all jobs): | |
READ: bw=33.8MiB/s (35.5MB/s), 33.8MiB/s-33.8MiB/s (35.5MB/s-35.5MB/s), io=338MiB (355MB), run=10001-10001msec | |
Run status group 7 (all jobs): | |
WRITE: bw=149MiB/s (157MB/s), 149MiB/s-149MiB/s (157MB/s-157MB/s), io=1600MiB (1678MB), run=10702-10702msec | |
root@692ced701c1d:/workspace/fio# cd /workspace | |
root@692ced701c1d:/workspace# git clone https://github.com/tensorflow/benchmarks.git -b cnn_tf_v1.13_compatible | |
Cloning into 'benchmarks'... | |
remote: Enumerating objects: 40, done. | |
remote: Counting objects: 100% (40/40), done. | |
remote: Compressing objects: 100% (29/29), done. | |
remote: Total 4174 (delta 21), reused 17 (delta 11), pack-reused 4134 | |
Receiving objects: 100% (4174/4174), 2.17 MiB | 1.63 MiB/s, done. | |
Resolving deltas: 100% (2793/2793), done. | |
root@692ced701c1d:/workspace# cd benchmarks/scripts/tf_cnn_benchmarks/ | |
root@692ced701c1d:/workspace/benchmarks/scripts/tf_cnn_benchmarks# vim run.sh | |
root@692ced701c1d:/workspace/benchmarks/scripts/tf_cnn_benchmarks# cat run.sh | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=64 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=False --distortions=False --local_parameter_device=gpu --num_gpus=1 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=64 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=True --distortions=False --local_parameter_device=gpu --num_gpus=1 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=128 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=False --distortions=False --local_parameter_device=gpu --num_gpus=1 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=128 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=True --distortions=False --local_parameter_device=gpu --num_gpus=1 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=256 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=False --distortions=False --local_parameter_device=gpu --num_gpus=1 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=256 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=True --distortions=False --local_parameter_device=gpu --num_gpus=1 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=512 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=True --distortions=False --local_parameter_device=gpu --num_gpus=1 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=64 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=False --distortions=False --local_parameter_device=gpu --num_gpus=2 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=64 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=True --distortions=False --local_parameter_device=gpu --num_gpus=2 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=128 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=False --distortions=False --local_parameter_device=gpu --num_gpus=2 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=128 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=True --distortions=False --local_parameter_device=gpu --num_gpus=2 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=256 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=False --distortions=False --local_parameter_device=gpu --num_gpus=2 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=256 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=True --distortions=False --local_parameter_device=gpu --num_gpus=2 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=512 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=True --distortions=False --local_parameter_device=gpu --num_gpus=2 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=64 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=False --distortions=False --local_parameter_device=gpu --num_gpus=3 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=64 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=True --distortions=False --local_parameter_device=gpu --num_gpus=3 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=128 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=False --distortions=False --local_parameter_device=gpu --num_gpus=3 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=128 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=True --distortions=False --local_parameter_device=gpu --num_gpus=3 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=256 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=False --distortions=False --local_parameter_device=gpu --num_gpus=3 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=256 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=True --distortions=False --local_parameter_device=gpu --num_gpus=3 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=512 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=True --distortions=False --local_parameter_device=gpu --num_gpus=3 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=64 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=False --distortions=False --local_parameter_device=gpu --num_gpus=4 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=64 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=True --distortions=False --local_parameter_device=gpu --num_gpus=4 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=128 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=False --distortions=False --local_parameter_device=gpu --num_gpus=4 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=128 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=True --distortions=False --local_parameter_device=gpu --num_gpus=4 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=256 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=False --distortions=False --local_parameter_device=gpu --num_gpus=4 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=256 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=True --distortions=False --local_parameter_device=gpu --num_gpus=4 --display_every=10 | |
python tf_cnn_benchmarks.py --data_format=NHWC --batch_size=512 --num_batches=100 --model=resnet50 --optimizer=sgd --variable_update=replicated --use_fp16=True --distortions=False --local_parameter_device=gpu --num_gpus=4 --display_every=10 | |
root@692ced701c1d:/workspace/benchmarks/scripts/tf_cnn_benchmarks# sh run.sh | |
2019-09-10 14:14:28.007945: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:14:30.023041 140493887063872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:14:30.027451 140493887063872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:14:30.050584 140493887063872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:14:30.050790 140493887063872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:14:30.076690: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:14:30.079387: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x40b6af0 executing computations on platform Host. Devices: | |
2019-09-10 14:14:30.079428: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:14:30.083433: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:14:30.972866: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x40b7000 executing computations on platform CUDA. Devices: | |
2019-09-10 14:14:30.972907: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:14:30.972921: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:14:30.972932: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:14:30.972942: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:14:30.982199: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:14:30.984747: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:14:30.986562: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:14:30.988391: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:14:30.988427: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:14:30.990480: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:14:30.992303: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:14:30.992670: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:14:30.994771: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:14:30.996007: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:14:30.999773: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:14:31.011971: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:14:31.012005: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:14:32.783594: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:14:32.783634: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:14:32.783647: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:14:32.783654: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:14:32.783661: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:14:32.783667: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:14:32.791691: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:14:32.793914: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:14:32.796062: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:14:32.798258: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 64 global | |
64 per device | |
Num batches: 100 | |
Num epochs: 0.00 | |
Devices: ['/gpu:0'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:14:32.804810 140493887063872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:14:32.805390 140493887063872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:14:32.814951 140493887063872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:14:32.815586 140493887063872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:245: The name tf.truncated_normal is deprecated. Please use tf.random.truncated_normal instead. | |
W0910 14:14:32.832383 140493887063872 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:14:33.195570 140493887063872 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:14:35.705537 140493887063872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:14:35.707565 140493887063872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:14:35.718987 140493887063872 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:14:36.514074 140493887063872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:14:36.600245 140493887063872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:14:36.759890 140493887063872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:14:36.764339 140493887063872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:14:36.764563 140493887063872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:14:37.020146 140493887063872 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:14:37.468795: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:14:37.470325: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:14:37.471836: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:14:37.473342: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:14:37.473374: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:14:37.473416: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:14:37.473437: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:14:37.473456: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:14:37.473473: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:14:37.473492: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:14:37.473511: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:14:37.485387: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:14:37.485446: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:14:37.485456: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:14:37.485464: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:14:37.485471: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:14:37.485478: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:14:37.485485: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:14:37.493479: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:14:37.495080: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:14:37.496672: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:14:37.498259: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:14:38.017979: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:14:38.513500 140493887063872 session_manager.py:500] Running local_init_op. | |
I0910 14:14:38.560857 140493887063872 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:14:40.057473: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:14:40.379174: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 375.1 +/- 0.0 (jitter = 0.0) 7.695 | |
10 images/sec: 375.4 +/- 0.1 (jitter = 0.4) 8.122 | |
20 images/sec: 375.3 +/- 0.1 (jitter = 0.5) 8.039 | |
30 images/sec: 375.3 +/- 0.1 (jitter = 0.5) 7.974 | |
40 images/sec: 375.3 +/- 0.1 (jitter = 0.5) 7.807 | |
50 images/sec: 375.3 +/- 0.1 (jitter = 0.4) 7.639 | |
60 images/sec: 375.3 +/- 0.0 (jitter = 0.4) 7.610 | |
70 images/sec: 375.3 +/- 0.0 (jitter = 0.4) 8.210 | |
80 images/sec: 375.4 +/- 0.0 (jitter = 0.3) 7.699 | |
90 images/sec: 375.4 +/- 0.0 (jitter = 0.4) 7.823 | |
100 images/sec: 375.3 +/- 0.0 (jitter = 0.4) 7.743 | |
---------------------------------------------------------------- | |
total images/sec: 375.17 | |
---------------------------------------------------------------- | |
2019-09-10 14:15:04.238046: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:15:06.232517 139697050122048 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:15:06.236979 139697050122048 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:15:06.260249 139697050122048 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:15:06.260469 139697050122048 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:15:06.284705: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:15:06.287244: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x45b0c30 executing computations on platform Host. Devices: | |
2019-09-10 14:15:06.287278: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:15:06.290724: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:15:07.130993: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x45b1140 executing computations on platform CUDA. Devices: | |
2019-09-10 14:15:07.131040: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:15:07.131061: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:15:07.131077: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:15:07.131092: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:15:07.134313: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:15:07.136989: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:15:07.139629: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:15:07.142273: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:15:07.142322: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:15:07.145260: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:15:07.147050: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:15:07.147396: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:15:07.149438: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:15:07.150627: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:15:07.154879: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:15:07.168760: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:15:07.168805: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:15:09.016489: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:15:09.016539: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:15:09.016549: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:15:09.016556: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:15:09.016563: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:15:09.016569: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:15:09.024561: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:15:09.026792: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:15:09.029027: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:15:09.031196: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 64 global | |
64 per device | |
Num batches: 100 | |
Num epochs: 0.00 | |
Devices: ['/gpu:0'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:15:09.037749 139697050122048 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:15:09.038326 139697050122048 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:15:09.046648 139697050122048 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2739: The name tf.get_variable is deprecated. Please use tf.compat.v1.get_variable instead. | |
W0910 14:15:09.051440 139697050122048 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:15:09.068489 139697050122048 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:15:09.437392 139697050122048 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:15:12.105584 139697050122048 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:15:12.108491 139697050122048 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:15:12.120260 139697050122048 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:15:13.469039 139697050122048 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:15:13.556470 139697050122048 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:15:13.720479 139697050122048 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:15:13.725033 139697050122048 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:15:13.725260 139697050122048 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:15:14.075250 139697050122048 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:15:14.568947: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:15:14.570451: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:15:14.571934: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:15:14.573426: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:15:14.573456: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:15:14.573491: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:15:14.573509: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:15:14.573527: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:15:14.573544: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:15:14.573561: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:15:14.573578: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:15:14.585324: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:15:14.585382: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:15:14.585394: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:15:14.585402: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:15:14.585409: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:15:14.585416: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:15:14.585423: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:15:14.593141: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:15:14.594725: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:15:14.596310: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:15:14.597906: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:15:15.153951: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:15:15.656506 139697050122048 session_manager.py:500] Running local_init_op. | |
I0910 14:15:15.711758 139697050122048 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:15:17.575268: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:15:17.911464: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 689.8 +/- 0.0 (jitter = 0.0) 7.913 | |
10 images/sec: 692.6 +/- 0.6 (jitter = 1.2) 7.814 | |
20 images/sec: 692.3 +/- 0.5 (jitter = 1.6) 7.953 | |
30 images/sec: 692.7 +/- 0.4 (jitter = 1.6) 7.762 | |
40 images/sec: 692.7 +/- 0.3 (jitter = 1.6) 7.956 | |
50 images/sec: 692.4 +/- 0.3 (jitter = 1.4) 7.889 | |
60 images/sec: 692.3 +/- 0.3 (jitter = 1.3) 8.009 | |
70 images/sec: 692.2 +/- 0.2 (jitter = 1.2) 7.811 | |
80 images/sec: 692.2 +/- 0.2 (jitter = 1.2) 7.829 | |
90 images/sec: 692.2 +/- 0.2 (jitter = 1.2) 7.842 | |
100 images/sec: 692.2 +/- 0.2 (jitter = 1.1) 7.834 | |
---------------------------------------------------------------- | |
total images/sec: 691.58 | |
---------------------------------------------------------------- | |
2019-09-10 14:15:33.287054: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:15:35.317740 140310324315968 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:15:35.322293 140310324315968 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:15:35.345879 140310324315968 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:15:35.346103 140310324315968 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:15:35.368733: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:15:35.371342: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x2bc9ad0 executing computations on platform Host. Devices: | |
2019-09-10 14:15:35.371380: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:15:35.375354: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:15:36.248835: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x2bc2fa0 executing computations on platform CUDA. Devices: | |
2019-09-10 14:15:36.248875: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:15:36.248893: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:15:36.248909: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:15:36.248926: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:15:36.251779: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:15:36.254163: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:15:36.256521: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:15:36.258875: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:15:36.258921: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:15:36.261616: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:15:36.263962: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:15:36.264416: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:15:36.266248: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:15:36.267235: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:15:36.270798: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:15:36.282704: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:15:36.282741: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:15:38.061528: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:15:38.061571: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:15:38.061583: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:15:38.061593: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:15:38.061603: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:15:38.061612: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:15:38.069655: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:15:38.071865: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:15:38.074037: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:15:38.076207: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 128 global | |
128 per device | |
Num batches: 100 | |
Num epochs: 0.01 | |
Devices: ['/gpu:0'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:15:38.082777 140310324315968 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:15:38.083382 140310324315968 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:15:38.091226 140310324315968 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:15:38.091602 140310324315968 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:245: The name tf.truncated_normal is deprecated. Please use tf.random.truncated_normal instead. | |
W0910 14:15:38.107810 140310324315968 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:15:38.466178 140310324315968 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:15:40.952657 140310324315968 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:15:40.954668 140310324315968 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:15:40.965989 140310324315968 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:15:41.754404 140310324315968 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:15:41.839752 140310324315968 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:15:41.997255 140310324315968 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:15:42.001689 140310324315968 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:15:42.001904 140310324315968 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:15:42.255005 140310324315968 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:15:42.695529: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:15:42.697053: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:15:42.698558: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:15:42.700100: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:15:42.700129: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:15:42.700162: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:15:42.700181: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:15:42.700198: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:15:42.700215: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:15:42.700231: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:15:42.700249: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:15:42.711978: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:15:42.712039: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:15:42.712049: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:15:42.712058: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:15:42.712065: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:15:42.712072: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:15:42.712079: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:15:42.719888: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:15:42.721498: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:15:42.723096: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:15:42.724719: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:15:43.240712: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:15:43.737771 140310324315968 session_manager.py:500] Running local_init_op. | |
I0910 14:15:43.783949 140310324315968 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:15:45.269380: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:15:45.590576: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 397.2 +/- 0.0 (jitter = 0.0) 7.973 | |
10 images/sec: 396.4 +/- 0.2 (jitter = 0.8) 7.882 | |
20 images/sec: 396.4 +/- 0.2 (jitter = 0.8) 7.909 | |
30 images/sec: 396.1 +/- 0.2 (jitter = 1.1) 7.805 | |
40 images/sec: 395.8 +/- 0.1 (jitter = 1.1) 7.989 | |
50 images/sec: 395.6 +/- 0.1 (jitter = 1.2) 7.868 | |
60 images/sec: 395.5 +/- 0.1 (jitter = 1.1) 7.944 | |
70 images/sec: 395.4 +/- 0.1 (jitter = 1.0) 7.757 | |
80 images/sec: 395.3 +/- 0.1 (jitter = 1.0) 7.826 | |
90 images/sec: 395.3 +/- 0.1 (jitter = 0.8) 7.947 | |
100 images/sec: 395.2 +/- 0.1 (jitter = 0.8) 7.789 | |
---------------------------------------------------------------- | |
total images/sec: 395.08 | |
---------------------------------------------------------------- | |
2019-09-10 14:16:28.465683: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:16:30.496468 140580007774016 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:16:30.501045 140580007774016 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:16:30.524539 140580007774016 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:16:30.524753 140580007774016 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:16:30.548702: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:16:30.551402: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x394c060 executing computations on platform Host. Devices: | |
2019-09-10 14:16:30.551439: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:16:30.555321: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:16:31.492745: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x3945530 executing computations on platform CUDA. Devices: | |
2019-09-10 14:16:31.492779: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:16:31.492791: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:16:31.492803: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:16:31.492812: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:16:31.495267: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:16:31.497403: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:16:31.499819: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:16:31.502224: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:16:31.502270: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:16:31.505020: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:16:31.507436: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:16:31.507923: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:16:31.510082: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:16:31.511097: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:16:31.514765: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:16:31.527010: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:16:31.527046: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:16:33.313778: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:16:33.313821: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:16:33.313832: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:16:33.313839: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:16:33.313845: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:16:33.313851: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:16:33.322112: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:16:33.324370: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:16:33.326570: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:16:33.328799: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 128 global | |
128 per device | |
Num batches: 100 | |
Num epochs: 0.01 | |
Devices: ['/gpu:0'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:16:33.335533 140580007774016 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:16:33.336146 140580007774016 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:16:33.343653 140580007774016 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2739: The name tf.get_variable is deprecated. Please use tf.compat.v1.get_variable instead. | |
W0910 14:16:33.348349 140580007774016 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:16:33.364998 140580007774016 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:16:33.723358 140580007774016 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:16:36.319929 140580007774016 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:16:36.322793 140580007774016 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:16:36.334097 140580007774016 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:16:37.643736 140580007774016 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:16:37.728603 140580007774016 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:16:37.888648 140580007774016 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:16:37.893090 140580007774016 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:16:37.893323 140580007774016 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:16:38.235394 140580007774016 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:16:38.715452: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:16:38.717052: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:16:38.718579: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:16:38.720079: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:16:38.720109: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:16:38.720143: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:16:38.720161: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:16:38.720180: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:16:38.720206: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:16:38.720225: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:16:38.720248: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:16:38.732009: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:16:38.732068: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:16:38.732079: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:16:38.732087: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:16:38.732094: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:16:38.732100: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:16:38.732108: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:16:38.739951: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:16:38.741566: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:16:38.743171: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:16:38.744793: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:16:39.295176: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:16:39.799729 140580007774016 session_manager.py:500] Running local_init_op. | |
I0910 14:16:39.862023 140580007774016 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:16:41.703017: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:16:42.036113: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 778.1 +/- 0.0 (jitter = 0.0) 7.830 | |
10 images/sec: 778.3 +/- 0.3 (jitter = 0.9) 7.923 | |
20 images/sec: 778.3 +/- 0.3 (jitter = 0.9) 7.879 | |
30 images/sec: 778.2 +/- 0.2 (jitter = 0.7) 7.797 | |
40 images/sec: 778.2 +/- 0.2 (jitter = 0.7) 7.767 | |
50 images/sec: 778.2 +/- 0.2 (jitter = 0.7) 7.844 | |
60 images/sec: 778.4 +/- 0.2 (jitter = 0.7) 7.794 | |
70 images/sec: 778.4 +/- 0.2 (jitter = 0.7) 7.776 | |
80 images/sec: 778.4 +/- 0.1 (jitter = 0.7) 7.743 | |
90 images/sec: 778.4 +/- 0.1 (jitter = 0.8) 7.890 | |
100 images/sec: 778.4 +/- 0.1 (jitter = 0.7) 7.918 | |
---------------------------------------------------------------- | |
total images/sec: 778.05 | |
---------------------------------------------------------------- | |
2019-09-10 14:17:06.530284: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:17:08.567694 139704933136192 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:17:08.572294 139704933136192 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:17:08.595902 139704933136192 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:17:08.596122 139704933136192 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:17:08.620706: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:17:08.623456: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x4599350 executing computations on platform Host. Devices: | |
2019-09-10 14:17:08.623493: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:17:08.627239: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:17:09.508583: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x45928b0 executing computations on platform CUDA. Devices: | |
2019-09-10 14:17:09.508613: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:17:09.508622: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:17:09.508630: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:17:09.508636: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:17:09.510456: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:17:09.512033: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:17:09.513840: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:17:09.515865: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:17:09.515899: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:17:09.518216: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:17:09.520253: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:17:09.520647: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:17:09.523039: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:17:09.524517: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:17:09.528215: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:17:09.540129: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:17:09.540163: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:17:11.324749: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:17:11.324792: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:17:11.324802: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:17:11.324808: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:17:11.324814: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:17:11.324820: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:17:11.332875: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:17:11.335144: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:17:11.337331: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:17:11.339496: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 256 global | |
256 per device | |
Num batches: 100 | |
Num epochs: 0.02 | |
Devices: ['/gpu:0'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:17:11.345422 139704933136192 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:17:11.345918 139704933136192 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:17:11.353884 139704933136192 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:17:11.354410 139704933136192 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:245: The name tf.truncated_normal is deprecated. Please use tf.random.truncated_normal instead. | |
W0910 14:17:11.371625 139704933136192 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:17:11.745485 139704933136192 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:17:14.301219 139704933136192 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:17:14.303290 139704933136192 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:17:14.314931 139704933136192 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:17:15.124845 139704933136192 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:17:15.212653 139704933136192 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:17:15.374406 139704933136192 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:17:15.378963 139704933136192 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:17:15.379185 139704933136192 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:17:15.639576 139704933136192 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:17:16.089637: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:17:16.091145: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:17:16.092636: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:17:16.094119: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:17:16.094148: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:17:16.094181: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:17:16.094200: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:17:16.094217: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:17:16.094235: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:17:16.094252: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:17:16.094269: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:17:16.106013: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:17:16.106071: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:17:16.106083: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:17:16.106091: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:17:16.106098: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:17:16.106105: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:17:16.106111: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:17:16.113818: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:17:16.115412: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:17:16.117008: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:17:16.118599: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:17:16.651760: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:17:17.153269 139704933136192 session_manager.py:500] Running local_init_op. | |
I0910 14:17:17.201341 139704933136192 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:17:18.732226: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:17:19.054275: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 406.8 +/- 0.0 (jitter = 0.0) 8.000 | |
10 images/sec: 406.8 +/- 0.1 (jitter = 0.5) 7.877 | |
20 images/sec: 406.7 +/- 0.1 (jitter = 0.4) 7.968 | |
30 images/sec: 406.6 +/- 0.1 (jitter = 0.5) 7.934 | |
40 images/sec: 406.4 +/- 0.1 (jitter = 0.8) 7.734 | |
50 images/sec: 406.3 +/- 0.1 (jitter = 0.7) 7.799 | |
60 images/sec: 406.3 +/- 0.1 (jitter = 0.6) 7.819 | |
70 images/sec: 406.3 +/- 0.1 (jitter = 0.6) 7.740 | |
80 images/sec: 406.3 +/- 0.1 (jitter = 0.6) 7.907 | |
90 images/sec: 406.2 +/- 0.1 (jitter = 0.6) 7.853 | |
100 images/sec: 406.2 +/- 0.1 (jitter = 0.6) 7.827 | |
---------------------------------------------------------------- | |
total images/sec: 406.17 | |
---------------------------------------------------------------- | |
2019-09-10 14:18:39.248265: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:18:41.235126 139667033356096 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:18:41.239565 139667033356096 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:18:41.262663 139667033356096 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:18:41.262866 139667033356096 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:18:41.288731: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:18:41.291306: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x3327a70 executing computations on platform Host. Devices: | |
2019-09-10 14:18:41.291342: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:18:41.295140: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:18:42.117245: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x3327dd0 executing computations on platform CUDA. Devices: | |
2019-09-10 14:18:42.117278: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:18:42.117289: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:18:42.117297: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:18:42.117305: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:18:42.119612: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:18:42.121799: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:18:42.124077: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:18:42.126372: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:18:42.126414: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:18:42.128998: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:18:42.131276: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:18:42.131738: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:18:42.133810: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:18:42.134823: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:18:42.138445: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:18:42.150376: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:18:42.150412: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:18:43.956251: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:18:43.956292: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:18:43.956303: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:18:43.956310: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:18:43.956316: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:18:43.956323: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:18:43.964345: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:18:43.966609: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:18:43.968846: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:18:43.971003: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 256 global | |
256 per device | |
Num batches: 100 | |
Num epochs: 0.02 | |
Devices: ['/gpu:0'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:18:43.977334 139667033356096 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:18:43.977944 139667033356096 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:18:43.986305 139667033356096 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2739: The name tf.get_variable is deprecated. Please use tf.compat.v1.get_variable instead. | |
W0910 14:18:43.991040 139667033356096 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:18:44.007900 139667033356096 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:18:44.368425 139667033356096 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:18:47.003370 139667033356096 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:18:47.006294 139667033356096 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:18:47.017845 139667033356096 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:18:48.346340 139667033356096 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:18:48.432383 139667033356096 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:18:48.594961 139667033356096 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:18:48.599460 139667033356096 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:18:48.599704 139667033356096 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:18:48.946296 139667033356096 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:18:49.432361: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:18:49.433871: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:18:49.435362: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:18:49.436882: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:18:49.436911: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:18:49.436945: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:18:49.436963: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:18:49.436980: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:18:49.436997: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:18:49.437015: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:18:49.437032: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:18:49.448728: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:18:49.448786: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:18:49.448796: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:18:49.448804: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:18:49.448810: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:18:49.448817: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:18:49.448823: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:18:49.456572: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:18:49.458238: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:18:49.459823: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:18:49.461400: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:18:50.017381: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:18:50.524705 139667033356096 session_manager.py:500] Running local_init_op. | |
I0910 14:18:50.586862 139667033356096 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:18:52.465906: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:18:52.800992: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 838.5 +/- 0.0 (jitter = 0.0) 7.928 | |
10 images/sec: 838.3 +/- 0.4 (jitter = 1.0) 7.880 | |
20 images/sec: 838.0 +/- 0.3 (jitter = 1.5) 7.827 | |
30 images/sec: 838.0 +/- 0.2 (jitter = 1.2) 7.852 | |
40 images/sec: 837.9 +/- 0.2 (jitter = 1.0) 7.931 | |
50 images/sec: 838.0 +/- 0.2 (jitter = 0.9) 7.947 | |
60 images/sec: 838.0 +/- 0.2 (jitter = 0.9) 7.867 | |
70 images/sec: 837.9 +/- 0.2 (jitter = 1.0) 7.820 | |
80 images/sec: 837.9 +/- 0.1 (jitter = 1.0) 7.837 | |
90 images/sec: 837.8 +/- 0.1 (jitter = 1.1) 7.868 | |
100 images/sec: 837.8 +/- 0.1 (jitter = 1.0) 7.831 | |
---------------------------------------------------------------- | |
total images/sec: 837.62 | |
---------------------------------------------------------------- | |
2019-09-10 14:19:37.101038: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:19:39.116133 140478665234240 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:19:39.120661 140478665234240 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:19:39.144004 140478665234240 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:19:39.144218 140478665234240 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:19:39.168721: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:19:39.171222: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x30dbdf0 executing computations on platform Host. Devices: | |
2019-09-10 14:19:39.171254: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:19:39.174375: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:19:40.174912: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x30dc300 executing computations on platform CUDA. Devices: | |
2019-09-10 14:19:40.174959: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:19:40.174976: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:19:40.174990: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:19:40.175003: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:19:40.178405: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:19:40.181227: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:19:40.184028: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:19:40.186842: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:19:40.186887: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:19:40.189674: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:19:40.191415: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:19:40.191738: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:19:40.193734: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:19:40.194897: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:19:40.199100: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:19:40.212789: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:19:40.212826: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:19:42.021289: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:19:42.021330: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:19:42.021341: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:19:42.021348: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:19:42.021354: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:19:42.021361: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:19:42.029272: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:19:42.031518: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:19:42.033666: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:19:42.035762: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 512 global | |
512 per device | |
Num batches: 100 | |
Num epochs: 0.04 | |
Devices: ['/gpu:0'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:19:42.042193 140478665234240 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:19:42.042770 140478665234240 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:19:42.050614 140478665234240 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2739: The name tf.get_variable is deprecated. Please use tf.compat.v1.get_variable instead. | |
W0910 14:19:42.055263 140478665234240 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:19:42.071847 140478665234240 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:19:42.436394 140478665234240 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:19:45.031168 140478665234240 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:19:45.034001 140478665234240 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:19:45.045301 140478665234240 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:19:46.350355 140478665234240 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:19:46.434978 140478665234240 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:19:46.594228 140478665234240 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:19:46.598637 140478665234240 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:19:46.598854 140478665234240 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:19:46.938732 140478665234240 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:19:47.418791: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:19:47.420354: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:19:47.421862: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:19:47.423338: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:19:47.423366: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:19:47.423401: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:19:47.423419: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:19:47.423436: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:19:47.423452: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:19:47.423470: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:19:47.423486: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:19:47.435119: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:19:47.435177: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:19:47.435188: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:19:47.435196: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:19:47.435203: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:19:47.435210: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:19:47.435217: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:19:47.443014: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:19:47.444598: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:19:47.446170: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:19:47.447750: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:19:47.989868: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:19:48.486477 140478665234240 session_manager.py:500] Running local_init_op. | |
I0910 14:19:48.541121 140478665234240 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:19:50.368899: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:19:50.702323: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 848.7 +/- 0.0 (jitter = 0.0) 7.887 | |
10 images/sec: 848.4 +/- 0.2 (jitter = 0.4) 7.908 | |
20 images/sec: 848.4 +/- 0.2 (jitter = 0.8) 7.870 | |
30 images/sec: 848.4 +/- 0.1 (jitter = 0.8) 7.917 | |
40 images/sec: 848.4 +/- 0.1 (jitter = 0.8) 7.860 | |
50 images/sec: 848.4 +/- 0.1 (jitter = 0.8) 7.763 | |
60 images/sec: 848.4 +/- 0.1 (jitter = 0.8) 7.756 | |
70 images/sec: 848.4 +/- 0.1 (jitter = 0.8) 7.749 | |
80 images/sec: 848.3 +/- 0.1 (jitter = 0.8) 7.755 | |
90 images/sec: 848.3 +/- 0.1 (jitter = 0.8) 7.722 | |
100 images/sec: 848.3 +/- 0.1 (jitter = 0.8) 7.713 | |
---------------------------------------------------------------- | |
total images/sec: 848.17 | |
---------------------------------------------------------------- | |
2019-09-10 14:21:13.007125: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:21:15.035794 140578430060352 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:21:15.040489 140578430060352 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:21:15.064074 140578430060352 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:21:15.064295 140578430060352 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:21:15.088711: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:21:15.091283: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x2faa950 executing computations on platform Host. Devices: | |
2019-09-10 14:21:15.091317: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:21:15.094614: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:21:16.102762: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x2faae60 executing computations on platform CUDA. Devices: | |
2019-09-10 14:21:16.102801: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:21:16.102816: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:21:16.102826: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:21:16.102836: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:21:16.105699: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:21:16.108100: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:21:16.110464: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:21:16.112826: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:21:16.112869: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:21:16.115577: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:21:16.117557: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:21:16.117852: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:21:16.119583: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:21:16.120583: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:21:16.124216: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:21:16.136173: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:21:16.136209: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:21:17.951546: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:21:17.951583: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:21:17.951594: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:21:17.951600: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:21:17.951607: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:21:17.951613: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:21:17.959628: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:21:17.961828: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:21:17.964004: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:21:17.966159: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 128 global | |
64 per device | |
Num batches: 100 | |
Num epochs: 0.01 | |
Devices: ['/gpu:0', '/gpu:1'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:21:17.972665 140578430060352 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:21:17.973247 140578430060352 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:21:17.981930 140578430060352 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:21:17.982311 140578430060352 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:245: The name tf.truncated_normal is deprecated. Please use tf.random.truncated_normal instead. | |
W0910 14:21:17.998647 140578430060352 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:21:18.360758 140578430060352 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:21:20.863417 140578430060352 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:21:20.865456 140578430060352 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:21:20.876849 140578430060352 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:21:24.969537 140578430060352 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:21:25.055859 140578430060352 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:21:25.462713 140578430060352 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:21:25.838522 140578430060352 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:21:25.838754 140578430060352 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:21:26.094898 140578430060352 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:21:26.974114: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:21:26.975641: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:21:26.977147: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:21:26.978629: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:21:26.978658: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:21:26.978692: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:21:26.978709: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:21:26.978726: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:21:26.978742: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:21:26.978759: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:21:26.978776: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:21:26.990482: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:21:26.990539: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:21:26.990549: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:21:26.990557: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:21:26.990563: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:21:26.990569: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:21:26.990576: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:21:26.998265: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:21:26.999836: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:21:27.001408: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:21:27.002992: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:21:28.072702: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:21:30.050031 140578430060352 session_manager.py:500] Running local_init_op. | |
I0910 14:21:30.640582 140578430060352 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:21:33.312138: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:21:33.985353: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 730.6 +/- 0.0 (jitter = 0.0) 7.687 | |
10 images/sec: 731.9 +/- 0.4 (jitter = 1.6) 8.002 | |
20 images/sec: 732.0 +/- 0.2 (jitter = 1.3) 7.834 | |
30 images/sec: 731.8 +/- 0.2 (jitter = 1.6) 7.958 | |
40 images/sec: 731.6 +/- 0.2 (jitter = 1.7) 7.925 | |
50 images/sec: 731.9 +/- 0.2 (jitter = 1.8) 7.717 | |
60 images/sec: 732.2 +/- 0.2 (jitter = 1.5) 7.856 | |
70 images/sec: 732.5 +/- 0.2 (jitter = 1.7) 8.079 | |
80 images/sec: 732.7 +/- 0.2 (jitter = 1.7) 7.821 | |
90 images/sec: 732.6 +/- 0.2 (jitter = 1.7) 7.871 | |
100 images/sec: 732.6 +/- 0.2 (jitter = 1.6) 7.881 | |
---------------------------------------------------------------- | |
total images/sec: 732.26 | |
---------------------------------------------------------------- | |
2019-09-10 14:21:59.850233: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:22:01.857193 139913248937792 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:22:01.861687 139913248937792 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:22:01.885066 139913248937792 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:22:01.885278 139913248937792 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:22:01.908719: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:22:01.911279: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x2fe9c20 executing computations on platform Host. Devices: | |
2019-09-10 14:22:01.911312: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:22:01.914664: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:22:02.791802: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x4860330 executing computations on platform CUDA. Devices: | |
2019-09-10 14:22:02.791838: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:22:02.791849: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:22:02.791857: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:22:02.791866: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:22:02.794607: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:22:02.796978: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:22:02.799343: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:22:02.801707: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:22:02.801749: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:22:02.804423: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:22:02.805996: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:22:02.806274: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:22:02.807968: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:22:02.808961: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:22:02.812512: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:22:02.824369: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:22:02.824403: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:22:04.600829: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:22:04.600869: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:22:04.600880: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:22:04.600887: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:22:04.600893: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:22:04.600899: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:22:04.608958: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:22:04.611161: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:22:04.613269: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:22:04.615392: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 128 global | |
64 per device | |
Num batches: 100 | |
Num epochs: 0.01 | |
Devices: ['/gpu:0', '/gpu:1'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:22:04.621623 139913248937792 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:22:04.622202 139913248937792 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:22:04.630498 139913248937792 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2739: The name tf.get_variable is deprecated. Please use tf.compat.v1.get_variable instead. | |
W0910 14:22:04.635244 139913248937792 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:22:04.652157 139913248937792 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:22:05.017952 139913248937792 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:22:07.683459 139913248937792 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:22:07.686371 139913248937792 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:22:07.698026 139913248937792 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:22:13.078341 139913248937792 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:22:13.165047 139913248937792 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:22:13.582752 139913248937792 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:22:13.967311 139913248937792 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:22:13.967549 139913248937792 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:22:14.230364 139913248937792 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:22:15.338605: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:22:15.340129: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:22:15.341660: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:22:15.343177: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:22:15.343207: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:22:15.343242: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:22:15.343260: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:22:15.343277: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:22:15.343293: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:22:15.343311: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:22:15.343330: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:22:15.355167: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:22:15.355228: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:22:15.355239: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:22:15.355248: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:22:15.355255: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:22:15.355263: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:22:15.355269: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:22:15.363077: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:22:15.364692: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:22:15.366287: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:22:15.367929: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:22:16.525356: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:22:18.515134 139913248937792 session_manager.py:500] Running local_init_op. | |
I0910 14:22:19.141375 139913248937792 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:22:22.505595: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:22:23.232971: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 1296.0 +/- 0.0 (jitter = 0.0) 8.091 | |
10 images/sec: 1303.0 +/- 2.5 (jitter = 10.1) 7.844 | |
20 images/sec: 1292.5 +/- 2.8 (jitter = 14.6) 7.962 | |
30 images/sec: 1290.0 +/- 2.1 (jitter = 11.9) 7.945 | |
40 images/sec: 1288.5 +/- 1.7 (jitter = 9.4) 7.991 | |
50 images/sec: 1287.1 +/- 1.5 (jitter = 9.0) 7.895 | |
60 images/sec: 1286.7 +/- 1.3 (jitter = 8.1) 7.971 | |
70 images/sec: 1286.3 +/- 1.1 (jitter = 7.1) 7.906 | |
80 images/sec: 1286.1 +/- 1.0 (jitter = 7.0) 7.802 | |
90 images/sec: 1286.3 +/- 0.9 (jitter = 6.8) 7.878 | |
100 images/sec: 1286.5 +/- 0.8 (jitter = 7.8) 7.815 | |
---------------------------------------------------------------- | |
total images/sec: 1285.55 | |
---------------------------------------------------------------- | |
2019-09-10 14:22:40.788280: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:22:42.793020 140266530510656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:22:42.797582 140266530510656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:22:42.821213 140266530510656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:22:42.821424 140266530510656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:22:42.844707: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:22:42.847382: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x409e030 executing computations on platform Host. Devices: | |
2019-09-10 14:22:42.847423: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:22:42.851444: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:22:43.789975: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x4097500 executing computations on platform CUDA. Devices: | |
2019-09-10 14:22:43.790014: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:22:43.790027: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:22:43.790038: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:22:43.790048: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:22:43.792929: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:22:43.795356: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:22:43.797544: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:22:43.799187: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:22:43.799217: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:22:43.801062: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:22:43.802738: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:22:43.803064: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:22:43.804972: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:22:43.806084: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:22:43.809961: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:22:43.821910: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:22:43.821942: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:22:45.589580: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:22:45.589622: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:22:45.589633: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:22:45.589640: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:22:45.589646: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:22:45.589652: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:22:45.597620: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:22:45.599830: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:22:45.601972: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:22:45.604103: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 256 global | |
128 per device | |
Num batches: 100 | |
Num epochs: 0.02 | |
Devices: ['/gpu:0', '/gpu:1'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:22:45.610832 140266530510656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:22:45.611408 140266530510656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:22:45.619366 140266530510656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:22:45.619744 140266530510656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:245: The name tf.truncated_normal is deprecated. Please use tf.random.truncated_normal instead. | |
W0910 14:22:45.635990 140266530510656 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:22:45.999307 140266530510656 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:22:48.550598 140266530510656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:22:48.552660 140266530510656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:22:48.564251 140266530510656 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:22:52.737199 140266530510656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:22:52.825487 140266530510656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:22:53.239958 140266530510656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:22:53.623955 140266530510656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:22:53.624194 140266530510656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:22:53.885126 140266530510656 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:22:54.782751: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:22:54.784293: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:22:54.785785: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:22:54.787266: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:22:54.787296: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:22:54.787331: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:22:54.787348: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:22:54.787366: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:22:54.787382: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:22:54.787397: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:22:54.787415: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:22:54.799057: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:22:54.799117: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:22:54.799127: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:22:54.799135: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:22:54.799141: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:22:54.799148: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:22:54.799154: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:22:54.806917: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:22:54.808502: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:22:54.810070: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:22:54.811651: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:22:55.907285: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:22:57.894693 140266530510656 session_manager.py:500] Running local_init_op. | |
I0910 14:22:58.504334 140266530510656 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:23:01.230805: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:23:01.905064: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 784.8 +/- 0.0 (jitter = 0.0) 7.943 | |
10 images/sec: 783.9 +/- 0.4 (jitter = 1.0) 7.891 | |
20 images/sec: 783.4 +/- 0.3 (jitter = 1.5) 7.917 | |
30 images/sec: 783.2 +/- 0.2 (jitter = 1.4) 7.786 | |
40 images/sec: 782.9 +/- 0.2 (jitter = 1.4) 7.899 | |
50 images/sec: 782.9 +/- 0.2 (jitter = 1.7) 7.873 | |
60 images/sec: 782.7 +/- 0.2 (jitter = 1.5) 7.820 | |
70 images/sec: 782.6 +/- 0.2 (jitter = 1.5) 7.785 | |
80 images/sec: 782.5 +/- 0.2 (jitter = 1.5) 7.772 | |
90 images/sec: 782.5 +/- 0.1 (jitter = 1.6) 7.847 | |
100 images/sec: 782.4 +/- 0.1 (jitter = 1.6) 7.804 | |
---------------------------------------------------------------- | |
total images/sec: 782.23 | |
---------------------------------------------------------------- | |
2019-09-10 14:23:45.600726: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:23:47.584838 140504485586752 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:23:47.589249 140504485586752 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:23:47.612278 140504485586752 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:23:47.612483 140504485586752 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:23:47.636707: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:23:47.639307: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x3f0b020 executing computations on platform Host. Devices: | |
2019-09-10 14:23:47.639348: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:23:47.643307: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:23:48.524085: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x3f044f0 executing computations on platform CUDA. Devices: | |
2019-09-10 14:23:48.524123: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:23:48.524135: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:23:48.524145: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:23:48.524154: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:23:48.526746: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:23:48.528936: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:23:48.531461: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:23:48.533992: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:23:48.534034: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:23:48.536900: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:23:48.539422: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:23:48.539907: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:23:48.542071: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:23:48.543073: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:23:48.546688: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:23:48.558710: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:23:48.558744: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:23:50.362620: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:23:50.362663: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:23:50.362673: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:23:50.362680: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:23:50.362686: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:23:50.362692: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:23:50.370660: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:23:50.372905: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:23:50.375087: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:23:50.377276: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 256 global | |
128 per device | |
Num batches: 100 | |
Num epochs: 0.02 | |
Devices: ['/gpu:0', '/gpu:1'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:23:50.383536 140504485586752 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:23:50.384116 140504485586752 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:23:50.393350 140504485586752 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2739: The name tf.get_variable is deprecated. Please use tf.compat.v1.get_variable instead. | |
W0910 14:23:50.398173 140504485586752 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:23:50.415276 140504485586752 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:23:50.782157 140504485586752 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:23:53.464663 140504485586752 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:23:53.467579 140504485586752 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:23:53.479324 140504485586752 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:23:58.887962 140504485586752 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:23:58.975307 140504485586752 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:23:59.395919 140504485586752 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:23:59.782965 140504485586752 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:23:59.783206 140504485586752 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:24:00.047021 140504485586752 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:24:01.156133: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:24:01.157660: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:24:01.159161: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:24:01.160681: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:24:01.160712: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:24:01.160746: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:24:01.160765: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:24:01.160783: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:24:01.160802: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:24:01.160821: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:24:01.160840: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:24:01.172536: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:24:01.172601: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:24:01.172613: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:24:01.172622: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:24:01.172629: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:24:01.172636: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:24:01.172644: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:24:01.180462: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:24:01.182089: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:24:01.183680: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:24:01.185277: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:24:02.361322: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:24:04.354744 140504485586752 session_manager.py:500] Running local_init_op. | |
I0910 14:24:05.001707 140504485586752 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:24:08.421011: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:24:09.150870: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 1509.3 +/- 0.0 (jitter = 0.0) 7.795 | |
10 images/sec: 1503.9 +/- 1.4 (jitter = 5.6) 7.907 | |
20 images/sec: 1500.6 +/- 1.1 (jitter = 4.7) 7.837 | |
30 images/sec: 1499.8 +/- 0.9 (jitter = 4.9) 7.865 | |
40 images/sec: 1499.1 +/- 0.8 (jitter = 4.7) 7.863 | |
50 images/sec: 1499.1 +/- 0.7 (jitter = 4.1) 7.809 | |
60 images/sec: 1499.1 +/- 0.6 (jitter = 4.1) 7.763 | |
70 images/sec: 1499.5 +/- 0.6 (jitter = 4.6) 7.798 | |
80 images/sec: 1499.2 +/- 0.5 (jitter = 4.6) 7.779 | |
90 images/sec: 1499.0 +/- 0.5 (jitter = 4.2) 7.834 | |
100 images/sec: 1498.7 +/- 0.5 (jitter = 4.5) 7.834 | |
---------------------------------------------------------------- | |
total images/sec: 1497.98 | |
---------------------------------------------------------------- | |
2019-09-10 14:24:35.550971: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:24:37.575564 140465520486208 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:24:37.580106 140465520486208 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:24:37.603518 140465520486208 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:24:37.603733 140465520486208 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:24:37.628730: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:24:37.631293: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x2ccbe40 executing computations on platform Host. Devices: | |
2019-09-10 14:24:37.631330: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:24:37.634484: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:24:38.429684: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x2cc54c0 executing computations on platform CUDA. Devices: | |
2019-09-10 14:24:38.429719: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:24:38.429731: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:24:38.429740: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:24:38.429748: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:24:38.432086: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:24:38.434270: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:24:38.436478: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:24:38.438702: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:24:38.438739: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:24:38.441257: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:24:38.443465: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:24:38.443889: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:24:38.445934: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:24:38.446905: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:24:38.450450: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:24:38.462229: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:24:38.462262: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:24:40.243775: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:24:40.243817: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:24:40.243830: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:24:40.243837: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:24:40.243842: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:24:40.243848: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:24:40.251920: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:24:40.254154: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:24:40.256281: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:24:40.258410: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 512 global | |
256 per device | |
Num batches: 100 | |
Num epochs: 0.04 | |
Devices: ['/gpu:0', '/gpu:1'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:24:40.264776 140465520486208 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:24:40.265367 140465520486208 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:24:40.274071 140465520486208 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:24:40.274465 140465520486208 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:245: The name tf.truncated_normal is deprecated. Please use tf.random.truncated_normal instead. | |
W0910 14:24:40.291061 140465520486208 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:24:40.660473 140465520486208 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:24:43.193747 140465520486208 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:24:43.195797 140465520486208 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:24:43.207361 140465520486208 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:24:47.347999 140465520486208 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:24:47.435391 140465520486208 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:24:47.846177 140465520486208 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:24:48.224692 140465520486208 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:24:48.224929 140465520486208 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:24:48.483238 140465520486208 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:24:49.371149: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:24:49.372665: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:24:49.374149: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:24:49.375634: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:24:49.375666: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:24:49.375707: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:24:49.375728: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:24:49.375748: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:24:49.375768: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:24:49.375787: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:24:49.375807: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:24:49.387674: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:24:49.387733: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:24:49.387744: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:24:49.387753: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:24:49.387760: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:24:49.387767: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:24:49.387774: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:24:49.395656: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:24:49.397259: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:24:49.398853: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:24:49.400477: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:24:50.491537: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:24:52.478892 140465520486208 session_manager.py:500] Running local_init_op. | |
I0910 14:24:53.103343 140465520486208 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:24:55.816936: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:24:56.494341: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 805.3 +/- 0.0 (jitter = 0.0) 7.936 | |
10 images/sec: 805.0 +/- 0.5 (jitter = 0.6) 7.861 | |
20 images/sec: 804.3 +/- 0.4 (jitter = 1.2) 7.871 | |
30 images/sec: 803.9 +/- 0.3 (jitter = 1.5) 7.901 | |
40 images/sec: 803.9 +/- 0.2 (jitter = 1.4) 7.808 | |
50 images/sec: 803.9 +/- 0.2 (jitter = 1.4) 7.802 | |
60 images/sec: 803.6 +/- 0.2 (jitter = 1.6) 7.736 | |
70 images/sec: 803.5 +/- 0.2 (jitter = 1.6) 7.754 | |
80 images/sec: 803.4 +/- 0.2 (jitter = 1.5) 7.776 | |
90 images/sec: 803.4 +/- 0.2 (jitter = 1.3) 7.701 | |
100 images/sec: 803.3 +/- 0.1 (jitter = 1.5) 7.707 | |
---------------------------------------------------------------- | |
total images/sec: 803.21 | |
---------------------------------------------------------------- | |
2019-09-10 14:26:17.101463: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:26:19.133256 140219604993856 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:26:19.137797 140219604993856 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:26:19.161279 140219604993856 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:26:19.161490 140219604993856 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:26:19.184711: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:26:19.187359: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x30bfa80 executing computations on platform Host. Devices: | |
2019-09-10 14:26:19.187390: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:26:19.190534: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:26:20.017661: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x30b9190 executing computations on platform CUDA. Devices: | |
2019-09-10 14:26:20.017707: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:26:20.017723: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:26:20.017736: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:26:20.017748: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:26:20.021181: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:26:20.024024: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:26:20.026878: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:26:20.029698: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:26:20.029745: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:26:20.032875: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:26:20.034383: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:26:20.034665: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:26:20.036394: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:26:20.037393: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:26:20.041017: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:26:20.052905: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:26:20.052940: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:26:21.860352: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:26:21.860392: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:26:21.860403: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:26:21.860410: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:26:21.860416: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:26:21.860422: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:26:21.868529: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:26:21.870714: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:26:21.872869: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:26:21.875005: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 512 global | |
256 per device | |
Num batches: 100 | |
Num epochs: 0.04 | |
Devices: ['/gpu:0', '/gpu:1'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:26:21.881631 140219604993856 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:26:21.882249 140219604993856 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:26:21.889526 140219604993856 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2739: The name tf.get_variable is deprecated. Please use tf.compat.v1.get_variable instead. | |
W0910 14:26:21.895006 140219604993856 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:26:21.912440 140219604993856 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:26:22.286693 140219604993856 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:26:24.972420 140219604993856 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:26:24.975386 140219604993856 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:26:24.987217 140219604993856 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:26:30.412933 140219604993856 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:26:30.501047 140219604993856 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:26:30.924280 140219604993856 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:26:31.314418 140219604993856 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:26:31.314657 140219604993856 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:26:31.579755 140219604993856 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:26:32.687091: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:26:32.688625: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:26:32.690125: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:26:32.691622: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:26:32.691653: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:26:32.691687: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:26:32.691705: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:26:32.691723: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:26:32.691740: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:26:32.691755: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:26:32.691773: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:26:32.703621: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:26:32.703680: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:26:32.703692: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:26:32.703700: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:26:32.703706: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:26:32.703713: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:26:32.703720: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:26:32.711478: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:26:32.713088: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:26:32.714673: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:26:32.716263: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:26:33.863633: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:26:35.855477 140219604993856 session_manager.py:500] Running local_init_op. | |
I0910 14:26:36.503135 140219604993856 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:26:39.864129: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:26:40.591895: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 1637.3 +/- 0.0 (jitter = 0.0) 7.941 | |
10 images/sec: 1640.3 +/- 0.7 (jitter = 1.8) 7.941 | |
20 images/sec: 1639.6 +/- 0.6 (jitter = 2.7) 7.881 | |
30 images/sec: 1638.7 +/- 0.5 (jitter = 3.7) 7.858 | |
40 images/sec: 1638.1 +/- 0.5 (jitter = 3.2) 7.845 | |
50 images/sec: 1638.1 +/- 0.4 (jitter = 3.2) 7.878 | |
60 images/sec: 1638.3 +/- 0.4 (jitter = 2.9) 7.795 | |
70 images/sec: 1638.2 +/- 0.3 (jitter = 2.9) 7.733 | |
80 images/sec: 1638.4 +/- 0.3 (jitter = 2.7) 7.745 | |
90 images/sec: 1638.4 +/- 0.3 (jitter = 2.7) 7.707 | |
100 images/sec: 1638.5 +/- 0.3 (jitter = 2.6) 7.678 | |
---------------------------------------------------------------- | |
total images/sec: 1638.03 | |
---------------------------------------------------------------- | |
2019-09-10 14:27:24.666012: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:27:26.685798 140315749975872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:27:26.690323 140315749975872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:27:26.713760 140315749975872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:27:26.713972 140315749975872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:27:26.736723: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:27:26.739181: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x3faec20 executing computations on platform Host. Devices: | |
2019-09-10 14:27:26.739216: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:27:26.742711: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:27:27.539075: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x3faef80 executing computations on platform CUDA. Devices: | |
2019-09-10 14:27:27.539109: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:27:27.539120: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:27:27.539130: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:27:27.539138: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:27:27.541656: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:27:27.543800: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:27:27.545926: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:27:27.548027: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:27:27.548062: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:27:27.550465: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:27:27.552595: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:27:27.552982: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:27:27.554680: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:27:27.555666: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:27:27.559227: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:27:27.571105: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:27:27.571138: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:27:29.338866: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:27:29.338908: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:27:29.338919: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:27:29.338925: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:27:29.338931: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:27:29.338937: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:27:29.347004: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:27:29.349219: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:27:29.351382: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:27:29.353560: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 1024 global | |
512 per device | |
Num batches: 100 | |
Num epochs: 0.08 | |
Devices: ['/gpu:0', '/gpu:1'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:27:29.359734 140315749975872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:27:29.360307 140315749975872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:27:29.369190 140315749975872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2739: The name tf.get_variable is deprecated. Please use tf.compat.v1.get_variable instead. | |
W0910 14:27:29.373850 140315749975872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:27:29.390364 140315749975872 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:27:29.751940 140315749975872 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:27:32.344232 140315749975872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:27:32.347057 140315749975872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:27:32.358335 140315749975872 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:27:37.570375 140315749975872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:27:37.654525 140315749975872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:27:38.058170 140315749975872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:27:38.429493 140315749975872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:27:38.429723 140315749975872 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:27:38.683707 140315749975872 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:27:39.764123: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:27:39.765637: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:27:39.767119: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:27:39.768604: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:27:39.768633: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:27:39.768667: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:27:39.768685: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:27:39.768701: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:27:39.768717: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:27:39.768733: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:27:39.768749: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:27:39.780334: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:27:39.780392: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:27:39.780402: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:27:39.780410: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:27:39.780417: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:27:39.780423: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:27:39.780431: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:27:39.788214: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:27:39.789794: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:27:39.791359: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:27:39.792926: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:27:40.918063: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:27:42.901646 140315749975872 session_manager.py:500] Running local_init_op. | |
I0910 14:27:43.553680 140315749975872 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:27:46.930607: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:27:47.657608: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 1677.6 +/- 0.0 (jitter = 0.0) 7.881 | |
10 images/sec: 1674.6 +/- 1.0 (jitter = 2.7) 7.880 | |
20 images/sec: 1674.5 +/- 0.6 (jitter = 2.9) 7.836 | |
30 images/sec: 1674.7 +/- 0.5 (jitter = 2.7) 7.768 | |
40 images/sec: 1674.3 +/- 0.5 (jitter = 2.7) 7.737 | |
50 images/sec: 1674.4 +/- 0.4 (jitter = 2.5) 7.663 | |
60 images/sec: 1674.4 +/- 0.4 (jitter = 2.9) 7.622 | |
70 images/sec: 1674.2 +/- 0.3 (jitter = 2.8) 7.608 | |
80 images/sec: 1674.5 +/- 0.3 (jitter = 3.2) 7.590 | |
90 images/sec: 1674.6 +/- 0.3 (jitter = 3.3) 7.569 | |
100 images/sec: 1674.5 +/- 0.3 (jitter = 3.1) 7.527 | |
---------------------------------------------------------------- | |
total images/sec: 1674.28 | |
---------------------------------------------------------------- | |
2019-09-10 14:29:08.669428: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:29:10.656620 140108273727296 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:29:10.661067 140108273727296 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:29:10.684174 140108273727296 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:29:10.684391 140108273727296 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:29:10.708721: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:29:10.711331: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x34e5040 executing computations on platform Host. Devices: | |
2019-09-10 14:29:10.711372: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:29:10.714902: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:29:11.547614: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x34de510 executing computations on platform CUDA. Devices: | |
2019-09-10 14:29:11.547655: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:29:11.547669: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:29:11.547679: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:29:11.547689: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:29:11.550588: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:29:11.553011: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:29:11.555391: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:29:11.557807: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:29:11.557848: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:29:11.560576: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:29:11.562285: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:29:11.562579: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:29:11.564325: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:29:11.565341: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:29:11.568983: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:29:11.580898: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:29:11.580931: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:29:13.377180: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:29:13.377221: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:29:13.377232: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:29:13.377239: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:29:13.377245: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:29:13.377251: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:29:13.385337: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:29:13.387552: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:29:13.389660: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:29:13.391819: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 192 global | |
64 per device | |
Num batches: 100 | |
Num epochs: 0.01 | |
Devices: ['/gpu:0', '/gpu:1', '/gpu:2'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:29:13.398070 140108273727296 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:29:13.398653 140108273727296 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:29:13.407375 140108273727296 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:29:13.407935 140108273727296 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:245: The name tf.truncated_normal is deprecated. Please use tf.random.truncated_normal instead. | |
W0910 14:29:13.426264 140108273727296 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:29:13.800179 140108273727296 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:29:16.361115 140108273727296 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:29:16.363334 140108273727296 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:29:16.375196 140108273727296 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:29:23.781066 140108273727296 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:29:23.869140 140108273727296 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:29:24.535114 140108273727296 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:29:25.302121 140108273727296 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:29:25.302373 140108273727296 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:29:25.565145 140108273727296 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:29:27.057529: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:29:27.059127: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:29:27.060884: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:29:27.062640: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:29:27.062672: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:29:27.062708: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:29:27.062727: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:29:27.062744: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:29:27.062762: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:29:27.062781: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:29:27.062799: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:29:27.074690: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:29:27.074750: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:29:27.074760: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:29:27.074768: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:29:27.074775: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:29:27.074781: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:29:27.074789: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:29:27.082613: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:29:27.084232: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:29:27.085840: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:29:27.087426: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:29:28.765411: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:29:31.292917 140108273727296 session_manager.py:500] Running local_init_op. | |
I0910 14:29:32.368149 140108273727296 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:29:36.315383: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:29:37.345462: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 1059.7 +/- 0.0 (jitter = 0.0) 7.680 | |
10 images/sec: 1059.5 +/- 0.4 (jitter = 0.8) 7.907 | |
20 images/sec: 1059.6 +/- 0.3 (jitter = 1.5) 7.866 | |
30 images/sec: 1059.0 +/- 0.3 (jitter = 2.0) 7.926 | |
40 images/sec: 1058.2 +/- 0.3 (jitter = 2.2) 7.935 | |
50 images/sec: 1057.8 +/- 0.4 (jitter = 2.3) 7.797 | |
60 images/sec: 1057.8 +/- 0.4 (jitter = 2.2) 7.832 | |
70 images/sec: 1057.8 +/- 0.3 (jitter = 2.2) 7.922 | |
80 images/sec: 1057.7 +/- 0.3 (jitter = 1.9) 7.809 | |
90 images/sec: 1057.2 +/- 0.3 (jitter = 2.3) 7.848 | |
100 images/sec: 1056.8 +/- 0.3 (jitter = 2.9) 7.803 | |
---------------------------------------------------------------- | |
total images/sec: 1056.40 | |
---------------------------------------------------------------- | |
2019-09-10 14:30:06.546204: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:30:08.574933 139862905734976 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:30:08.579488 139862905734976 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:30:08.603000 139862905734976 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:30:08.603210 139862905734976 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:30:08.628722: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:30:08.631197: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x39b6c70 executing computations on platform Host. Devices: | |
2019-09-10 14:30:08.631223: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:30:08.633866: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:30:09.536844: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x39b0380 executing computations on platform CUDA. Devices: | |
2019-09-10 14:30:09.536878: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:30:09.536889: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:30:09.536897: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:30:09.536906: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:30:09.539260: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:30:09.541237: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:30:09.546732: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:30:09.548709: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:30:09.548744: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:30:09.550947: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:30:09.552897: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:30:09.553179: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:30:09.554908: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:30:09.555908: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:30:09.559524: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:30:09.571359: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:30:09.571392: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:30:11.359701: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:30:11.359738: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:30:11.359748: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:30:11.359754: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:30:11.359761: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:30:11.359767: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:30:11.367853: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:30:11.370095: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:30:11.372362: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:30:11.374537: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 192 global | |
64 per device | |
Num batches: 100 | |
Num epochs: 0.01 | |
Devices: ['/gpu:0', '/gpu:1', '/gpu:2'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:30:11.380581 139862905734976 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:30:11.381137 139862905734976 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:30:11.389535 139862905734976 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2739: The name tf.get_variable is deprecated. Please use tf.compat.v1.get_variable instead. | |
W0910 14:30:11.394240 139862905734976 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:30:11.410990 139862905734976 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:30:11.774429 139862905734976 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:30:14.393703 139862905734976 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:30:14.396571 139862905734976 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:30:14.408024 139862905734976 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:30:23.494632 139862905734976 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:30:23.580518 139862905734976 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:30:24.393837 139862905734976 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:30:25.147120 139862905734976 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:30:25.147358 139862905734976 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:30:25.405670 139862905734976 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:30:26.863619: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:30:26.865215: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:30:26.866987: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:30:26.868881: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:30:26.868912: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:30:26.868947: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:30:26.868966: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:30:26.868982: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:30:26.868999: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:30:26.869016: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:30:26.869034: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:30:26.880888: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:30:26.880945: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:30:26.880955: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:30:26.880962: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:30:26.880969: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:30:26.880975: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:30:26.880982: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:30:26.888736: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:30:26.890344: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:30:26.891927: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:30:26.893526: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:30:28.642114: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:30:31.150029 139862905734976 session_manager.py:500] Running local_init_op. | |
I0910 14:30:32.218698 139862905734976 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:30:37.024126: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:30:38.113790: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 1860.4 +/- 0.0 (jitter = 0.0) 8.040 | |
10 images/sec: 1852.7 +/- 6.0 (jitter = 16.8) 7.810 | |
20 images/sec: 1848.8 +/- 3.8 (jitter = 16.5) 7.898 | |
30 images/sec: 1846.2 +/- 2.9 (jitter = 15.0) 7.924 | |
40 images/sec: 1845.3 +/- 2.4 (jitter = 14.1) 7.868 | |
50 images/sec: 1844.1 +/- 2.1 (jitter = 14.3) 7.889 | |
60 images/sec: 1843.9 +/- 1.9 (jitter = 16.0) 7.920 | |
70 images/sec: 1844.2 +/- 1.7 (jitter = 14.6) 7.861 | |
80 images/sec: 1844.2 +/- 1.6 (jitter = 14.6) 7.786 | |
90 images/sec: 1844.2 +/- 1.5 (jitter = 15.7) 7.890 | |
100 images/sec: 1844.1 +/- 1.4 (jitter = 14.5) 7.715 | |
---------------------------------------------------------------- | |
total images/sec: 1842.96 | |
---------------------------------------------------------------- | |
2019-09-10 14:30:58.862575: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:31:00.887480 139713216689984 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:31:00.892082 139713216689984 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:31:00.915890 139713216689984 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:31:00.916103 139713216689984 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:31:00.940724: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:31:00.943524: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x2d80910 executing computations on platform Host. Devices: | |
2019-09-10 14:31:00.943560: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:31:00.947318: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:31:01.788284: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x2d80e20 executing computations on platform CUDA. Devices: | |
2019-09-10 14:31:01.788333: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:31:01.788349: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:31:01.788363: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:31:01.788376: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:31:01.791845: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:31:01.794753: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:31:01.797617: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:31:01.800465: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:31:01.800526: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:31:01.803872: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:31:01.806009: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:31:01.806307: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:31:01.808057: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:31:01.809089: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:31:01.812747: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:31:01.824786: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:31:01.824822: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:31:03.616053: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:31:03.616097: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:31:03.616107: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:31:03.616114: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:31:03.616121: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:31:03.616127: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:31:03.624240: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:31:03.626439: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:31:03.628592: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:31:03.630723: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 384 global | |
128 per device | |
Num batches: 100 | |
Num epochs: 0.03 | |
Devices: ['/gpu:0', '/gpu:1', '/gpu:2'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:31:03.637035 139713216689984 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:31:03.637629 139713216689984 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:31:03.647360 139713216689984 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:31:03.648018 139713216689984 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:245: The name tf.truncated_normal is deprecated. Please use tf.random.truncated_normal instead. | |
W0910 14:31:03.665177 139713216689984 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:31:04.041753 139713216689984 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:31:06.613382 139713216689984 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:31:06.615478 139713216689984 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:31:06.627191 139713216689984 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:31:14.082817 139713216689984 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:31:14.171029 139713216689984 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:31:14.845149 139713216689984 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:31:15.620382 139713216689984 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:31:15.620633 139713216689984 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:31:15.885851 139713216689984 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:31:17.393387: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:31:17.395242: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:31:17.396816: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:31:17.398514: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:31:17.398545: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:31:17.398582: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:31:17.398601: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:31:17.398619: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:31:17.398636: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:31:17.398653: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:31:17.398671: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:31:17.410674: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:31:17.410736: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:31:17.410747: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:31:17.410755: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:31:17.410762: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:31:17.410771: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:31:17.410778: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:31:17.418533: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:31:17.420180: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:31:17.421786: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:31:17.423369: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:31:19.098435: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:31:21.618732 139713216689984 session_manager.py:500] Running local_init_op. | |
I0910 14:31:22.686068 139713216689984 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:31:26.611475: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:31:27.638775: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 1149.6 +/- 0.0 (jitter = 0.0) 7.894 | |
10 images/sec: 1148.6 +/- 0.6 (jitter = 1.5) 7.914 | |
20 images/sec: 1148.6 +/- 0.3 (jitter = 1.0) 7.929 | |
30 images/sec: 1148.3 +/- 0.2 (jitter = 1.1) 7.821 | |
40 images/sec: 1148.1 +/- 0.2 (jitter = 1.2) 7.887 | |
50 images/sec: 1148.1 +/- 0.2 (jitter = 1.3) 7.840 | |
60 images/sec: 1148.1 +/- 0.2 (jitter = 1.4) 7.807 | |
70 images/sec: 1147.9 +/- 0.2 (jitter = 1.5) 7.780 | |
80 images/sec: 1147.6 +/- 0.2 (jitter = 1.8) 7.796 | |
90 images/sec: 1147.5 +/- 0.2 (jitter = 1.8) 7.732 | |
100 images/sec: 1147.3 +/- 0.2 (jitter = 1.8) 7.750 | |
---------------------------------------------------------------- | |
total images/sec: 1147.10 | |
---------------------------------------------------------------- | |
2019-09-10 14:32:14.609965: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:32:16.587633 140460509329216 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:32:16.592045 140460509329216 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:32:16.615108 140460509329216 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:32:16.615323 140460509329216 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:32:16.640722: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:32:16.643587: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x2bdce90 executing computations on platform Host. Devices: | |
2019-09-10 14:32:16.643626: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:32:16.647652: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:32:17.607048: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x2bdd1f0 executing computations on platform CUDA. Devices: | |
2019-09-10 14:32:17.607097: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:32:17.607114: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:32:17.607128: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:32:17.607140: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:32:17.610448: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:32:17.613143: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:32:17.615800: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:32:17.618468: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:32:17.618512: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:32:17.621288: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:32:17.623048: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:32:17.623389: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:32:17.625416: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:32:17.626590: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:32:17.630782: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:32:17.644455: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:32:17.644504: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:32:19.444678: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:32:19.444720: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:32:19.444731: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:32:19.444739: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:32:19.444745: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:32:19.444751: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:32:19.453029: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:32:19.455246: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:32:19.457397: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:32:19.459538: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 384 global | |
128 per device | |
Num batches: 100 | |
Num epochs: 0.03 | |
Devices: ['/gpu:0', '/gpu:1', '/gpu:2'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:32:19.465443 140460509329216 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:32:19.466017 140460509329216 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:32:19.475411 140460509329216 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2739: The name tf.get_variable is deprecated. Please use tf.compat.v1.get_variable instead. | |
W0910 14:32:19.480674 140460509329216 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:32:19.497605 140460509329216 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:32:19.866362 140460509329216 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:32:22.526313 140460509329216 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:32:22.529229 140460509329216 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:32:22.540902 140460509329216 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:32:31.768654 140460509329216 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:32:31.855767 140460509329216 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:32:32.681496 140460509329216 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:32:33.447739 140460509329216 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:32:33.447987 140460509329216 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:32:33.711128 140460509329216 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:32:35.168059: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:32:35.169649: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:32:35.171173: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:32:35.172755: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:32:35.172786: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:32:35.172821: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:32:35.172840: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:32:35.172857: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:32:35.172874: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:32:35.172890: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:32:35.172910: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:32:35.185408: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:32:35.185469: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:32:35.185480: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:32:35.185488: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:32:35.185494: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:32:35.185501: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:32:35.185508: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:32:35.193297: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:32:35.194894: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:32:35.196481: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:32:35.198087: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:32:36.967002: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:32:39.493895 140460509329216 session_manager.py:500] Running local_init_op. | |
I0910 14:32:40.576540 140460509329216 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:32:45.436112: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:32:46.536391: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 2175.0 +/- 0.0 (jitter = 0.0) 7.807 | |
10 images/sec: 2175.4 +/- 2.6 (jitter = 9.7) 7.885 | |
20 images/sec: 2178.1 +/- 1.6 (jitter = 7.2) 7.872 | |
30 images/sec: 2178.3 +/- 1.3 (jitter = 6.4) 7.828 | |
40 images/sec: 2178.7 +/- 1.1 (jitter = 6.3) 7.835 | |
50 images/sec: 2177.7 +/- 1.1 (jitter = 6.8) 7.777 | |
60 images/sec: 2177.2 +/- 1.0 (jitter = 7.8) 7.759 | |
70 images/sec: 2177.1 +/- 1.0 (jitter = 7.7) 7.777 | |
80 images/sec: 2177.1 +/- 0.9 (jitter = 7.9) 7.775 | |
90 images/sec: 2176.4 +/- 0.8 (jitter = 8.4) 7.766 | |
100 images/sec: 2176.3 +/- 0.8 (jitter = 7.9) 7.755 | |
---------------------------------------------------------------- | |
total images/sec: 2175.48 | |
---------------------------------------------------------------- | |
2019-09-10 14:33:15.866450: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:33:17.905010 140656632866624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:33:17.909595 140656632866624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:33:17.933259 140656632866624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:33:17.933474 140656632866624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:33:17.956725: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:33:17.959067: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x3ab88a0 executing computations on platform Host. Devices: | |
2019-09-10 14:33:17.959095: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:33:17.961790: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:33:18.850970: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x3ab8c00 executing computations on platform CUDA. Devices: | |
2019-09-10 14:33:18.851004: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:33:18.851017: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:33:18.851025: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:33:18.851034: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:33:18.853408: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:33:18.855362: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:33:18.857314: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:33:18.859290: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:33:18.859325: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:33:18.861556: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:33:18.863520: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:33:18.863891: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:33:18.865769: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:33:18.866755: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:33:18.870352: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:33:18.882180: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:33:18.882213: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:33:20.650176: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:33:20.650219: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:33:20.650230: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:33:20.650237: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:33:20.650243: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:33:20.650249: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:33:20.658206: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:33:20.660421: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:33:20.662605: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:33:20.664798: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 768 global | |
256 per device | |
Num batches: 100 | |
Num epochs: 0.06 | |
Devices: ['/gpu:0', '/gpu:1', '/gpu:2'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:33:20.669903 140656632866624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:33:20.670258 140656632866624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:33:20.675268 140656632866624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:33:20.675594 140656632866624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:245: The name tf.truncated_normal is deprecated. Please use tf.random.truncated_normal instead. | |
W0910 14:33:20.689463 140656632866624 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:33:21.041664 140656632866624 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:33:23.559040 140656632866624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:33:23.561100 140656632866624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:33:23.572565 140656632866624 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:33:30.864940 140656632866624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:33:30.951225 140656632866624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:33:31.605484 140656632866624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:33:32.362595 140656632866624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:33:32.362834 140656632866624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:33:32.621598 140656632866624 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:33:34.100666: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:33:34.102496: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:33:34.104035: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:33:34.105685: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:33:34.105717: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:33:34.105751: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:33:34.105769: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:33:34.105786: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:33:34.105802: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:33:34.105818: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:33:34.105833: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:33:34.117745: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:33:34.117806: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:33:34.117816: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:33:34.117824: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:33:34.117831: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:33:34.117838: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:33:34.117845: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:33:34.125567: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:33:34.127183: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:33:34.128771: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:33:34.130360: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:33:35.773982: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:33:38.281344 140656632866624 session_manager.py:500] Running local_init_op. | |
I0910 14:33:39.331176 140656632866624 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:33:43.195456: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:33:44.219971: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 1192.5 +/- 0.0 (jitter = 0.0) 7.916 | |
10 images/sec: 1191.6 +/- 0.5 (jitter = 1.6) 7.829 | |
20 images/sec: 1192.3 +/- 0.4 (jitter = 1.7) 7.843 | |
30 images/sec: 1192.0 +/- 0.3 (jitter = 1.8) 7.825 | |
40 images/sec: 1191.8 +/- 0.3 (jitter = 1.9) 7.767 | |
50 images/sec: 1191.7 +/- 0.2 (jitter = 1.8) 7.742 | |
60 images/sec: 1191.4 +/- 0.2 (jitter = 2.1) 7.701 | |
70 images/sec: 1191.2 +/- 0.2 (jitter = 2.1) 7.699 | |
80 images/sec: 1191.0 +/- 0.2 (jitter = 2.2) 7.680 | |
90 images/sec: 1191.0 +/- 0.2 (jitter = 2.1) 7.621 | |
100 images/sec: 1190.9 +/- 0.2 (jitter = 2.1) 7.635 | |
---------------------------------------------------------------- | |
total images/sec: 1190.78 | |
---------------------------------------------------------------- | |
2019-09-10 14:35:07.601698: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:35:09.625108 140291211429696 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:35:09.629648 140291211429696 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:35:09.653266 140291211429696 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:35:09.653477 140291211429696 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:35:09.676691: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:35:09.679294: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x33f44d0 executing computations on platform Host. Devices: | |
2019-09-10 14:35:09.679327: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:35:09.682479: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:35:10.529886: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x33eda30 executing computations on platform CUDA. Devices: | |
2019-09-10 14:35:10.529926: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:35:10.529939: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:35:10.529950: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:35:10.529960: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:35:10.532725: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:35:10.534964: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:35:10.537237: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:35:10.539485: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:35:10.539528: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:35:10.542103: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:35:10.544351: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:35:10.544780: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:35:10.546496: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:35:10.547494: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:35:10.551107: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:35:10.563043: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:35:10.563082: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:35:12.345276: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:35:12.345312: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:35:12.345323: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:35:12.345330: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:35:12.345337: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:35:12.345344: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:35:12.353353: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:35:12.355553: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:35:12.357723: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:35:12.359861: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 768 global | |
256 per device | |
Num batches: 100 | |
Num epochs: 0.06 | |
Devices: ['/gpu:0', '/gpu:1', '/gpu:2'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:35:12.366173 140291211429696 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:35:12.366757 140291211429696 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:35:12.374750 140291211429696 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2739: The name tf.get_variable is deprecated. Please use tf.compat.v1.get_variable instead. | |
W0910 14:35:12.379552 140291211429696 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:35:12.396618 140291211429696 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:35:12.762197 140291211429696 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:35:15.441936 140291211429696 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:35:15.444860 140291211429696 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:35:15.456628 140291211429696 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:35:24.748151 140291211429696 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:35:24.835855 140291211429696 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:35:25.665865 140291211429696 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:35:26.437985 140291211429696 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:35:26.438231 140291211429696 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:35:26.702493 140291211429696 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:35:28.176218: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:35:28.177812: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:35:28.179423: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:35:28.181154: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:35:28.181188: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:35:28.181234: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:35:28.181257: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:35:28.181277: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:35:28.181297: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:35:28.181317: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:35:28.181337: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:35:28.193578: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:35:28.193642: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:35:28.193653: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:35:28.193662: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:35:28.193669: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:35:28.193676: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:35:28.193682: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:35:28.201526: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:35:28.203125: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:35:28.204720: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:35:28.206300: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:35:30.003689: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:35:32.526953 140291211429696 session_manager.py:500] Running local_init_op. | |
I0910 14:35:33.634510 140291211429696 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:35:38.550253: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:35:39.657877: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 2401.6 +/- 0.0 (jitter = 0.0) 7.908 | |
10 images/sec: 2412.1 +/- 2.3 (jitter = 8.9) 7.929 | |
20 images/sec: 2412.3 +/- 1.7 (jitter = 9.4) 7.830 | |
30 images/sec: 2409.2 +/- 1.6 (jitter = 8.4) 7.826 | |
40 images/sec: 2408.9 +/- 1.2 (jitter = 7.5) 7.757 | |
50 images/sec: 2409.0 +/- 1.1 (jitter = 8.4) 7.784 | |
60 images/sec: 2408.6 +/- 1.0 (jitter = 7.5) 7.744 | |
70 images/sec: 2408.0 +/- 0.9 (jitter = 6.8) 7.679 | |
80 images/sec: 2407.6 +/- 0.8 (jitter = 6.8) 7.649 | |
90 images/sec: 2407.7 +/- 0.8 (jitter = 6.6) 7.618 | |
100 images/sec: 2407.4 +/- 0.7 (jitter = 6.6) 7.575 | |
---------------------------------------------------------------- | |
total images/sec: 2406.89 | |
---------------------------------------------------------------- | |
2019-09-10 14:36:26.756683: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:36:28.777641 140153510606656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:36:28.782037 140153510606656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:36:28.805100 140153510606656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:36:28.805310 140153510606656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:36:28.828723: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:36:28.831522: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x462cb10 executing computations on platform Host. Devices: | |
2019-09-10 14:36:28.831563: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:36:28.835566: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:36:29.683237: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x5ea1e90 executing computations on platform CUDA. Devices: | |
2019-09-10 14:36:29.683275: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:36:29.683287: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:36:29.683297: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:36:29.683306: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:36:29.686131: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:36:29.688632: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:36:29.691109: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:36:29.693613: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:36:29.693654: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:36:29.696517: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:36:29.698348: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:36:29.698682: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:36:29.700715: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:36:29.701908: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:36:29.706148: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:36:29.719880: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:36:29.719917: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:36:31.536367: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:36:31.536408: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:36:31.536420: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:36:31.536429: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:36:31.536436: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:36:31.536442: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:36:31.544454: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:36:31.546716: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:36:31.548895: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:36:31.551042: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 1536 global | |
512 per device | |
Num batches: 100 | |
Num epochs: 0.12 | |
Devices: ['/gpu:0', '/gpu:1', '/gpu:2'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:36:31.557293 140153510606656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:36:31.557902 140153510606656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:36:31.566257 140153510606656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2739: The name tf.get_variable is deprecated. Please use tf.compat.v1.get_variable instead. | |
W0910 14:36:31.571086 140153510606656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:36:31.588083 140153510606656 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:36:31.949458 140153510606656 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:36:34.601019 140153510606656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:36:34.603948 140153510606656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:36:34.615601 140153510606656 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:36:43.831400 140153510606656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:36:43.918720 140153510606656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:36:44.743673 140153510606656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:36:45.510879 140153510606656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:36:45.511124 140153510606656 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:36:45.774051 140153510606656 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:36:47.260350: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:36:47.262202: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:36:47.263754: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:36:47.265319: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:36:47.265352: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:36:47.265397: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:36:47.265419: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:36:47.265438: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:36:47.265458: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:36:47.265478: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:36:47.265498: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:36:47.277504: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:36:47.277563: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:36:47.277574: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:36:47.277582: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:36:47.277589: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:36:47.277596: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:36:47.277603: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:36:47.285324: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:36:47.286911: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:36:47.288468: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:36:47.290053: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:36:49.026768: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:36:51.533483 140153510606656 session_manager.py:500] Running local_init_op. | |
I0910 14:36:52.590555 140153510606656 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:36:57.377398: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:36:58.474053: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 2485.6 +/- 0.0 (jitter = 0.0) 7.865 | |
10 images/sec: 2488.5 +/- 1.4 (jitter = 3.8) 7.840 | |
20 images/sec: 2488.0 +/- 0.9 (jitter = 4.1) 7.782 | |
30 images/sec: 2487.2 +/- 0.8 (jitter = 3.8) 7.706 | |
40 images/sec: 2485.5 +/- 0.9 (jitter = 5.0) 7.615 | |
50 images/sec: 2485.5 +/- 0.8 (jitter = 5.3) 7.584 | |
60 images/sec: 2485.3 +/- 0.7 (jitter = 5.4) 7.542 | |
70 images/sec: 2485.3 +/- 0.7 (jitter = 5.4) 7.521 | |
80 images/sec: 2485.6 +/- 0.6 (jitter = 5.4) 7.500 | |
90 images/sec: 2484.9 +/- 0.6 (jitter = 5.7) 7.490 | |
100 images/sec: 2484.9 +/- 0.5 (jitter = 5.8) 7.474 | |
---------------------------------------------------------------- | |
total images/sec: 2484.56 | |
---------------------------------------------------------------- | |
2019-09-10 14:38:21.365960: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:38:23.393384 140479090112320 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:38:23.397919 140479090112320 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:38:23.421399 140479090112320 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:38:23.421613 140479090112320 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:38:23.444707: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:38:23.447263: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x3251db0 executing computations on platform Host. Devices: | |
2019-09-10 14:38:23.447298: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:38:23.450949: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:38:24.233403: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x4ac82a0 executing computations on platform CUDA. Devices: | |
2019-09-10 14:38:24.233436: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:38:24.233448: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:38:24.233456: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:38:24.233465: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:38:24.235791: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:38:24.237931: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:38:24.240262: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:38:24.242604: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:38:24.242644: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:38:24.245324: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:38:24.247671: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:38:24.248111: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:38:24.250044: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:38:24.251025: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:38:24.254593: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:38:24.266346: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:38:24.266379: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:38:26.032171: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:38:26.032212: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:38:26.032223: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:38:26.032230: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:38:26.032237: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:38:26.032243: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:38:26.040285: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:38:26.042474: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:38:26.044607: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:38:26.046722: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 256 global | |
64 per device | |
Num batches: 100 | |
Num epochs: 0.02 | |
Devices: ['/gpu:0', '/gpu:1', '/gpu:2', '/gpu:3'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:38:26.052796 140479090112320 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:38:26.053402 140479090112320 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:38:26.062278 140479090112320 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:38:26.062677 140479090112320 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:245: The name tf.truncated_normal is deprecated. Please use tf.random.truncated_normal instead. | |
W0910 14:38:26.079476 140479090112320 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:38:26.445255 140479090112320 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:38:29.036207 140479090112320 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:38:29.038326 140479090112320 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:38:29.050102 140479090112320 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:38:39.846082 140479090112320 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:38:39.934732 140479090112320 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:38:40.865384 140479090112320 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:38:42.026576 140479090112320 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:38:42.026824 140479090112320 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:38:42.293349 140479090112320 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:38:44.319723: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:38:44.321581: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:38:44.323313: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:38:44.325092: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:38:44.325126: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:38:44.325164: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:38:44.325184: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:38:44.325203: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:38:44.325221: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:38:44.325239: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:38:44.325259: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:38:44.337884: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:38:44.337946: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:38:44.337957: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:38:44.337966: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:38:44.337972: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:38:44.337980: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:38:44.337987: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:38:44.345757: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:38:44.347368: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:38:44.348958: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:38:44.350572: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:38:46.679864: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:38:49.708881 140479090112320 session_manager.py:500] Running local_init_op. | |
I0910 14:38:51.242393 140479090112320 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:38:56.471976: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:38:57.857615: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 1435.8 +/- 0.0 (jitter = 0.0) 7.719 | |
10 images/sec: 1437.9 +/- 1.7 (jitter = 6.7) 7.944 | |
20 images/sec: 1439.0 +/- 1.0 (jitter = 4.1) 7.885 | |
30 images/sec: 1438.5 +/- 0.8 (jitter = 3.8) 7.907 | |
40 images/sec: 1438.4 +/- 0.7 (jitter = 3.6) 7.931 | |
50 images/sec: 1437.7 +/- 0.6 (jitter = 4.5) 7.796 | |
60 images/sec: 1437.3 +/- 0.6 (jitter = 4.9) 7.828 | |
70 images/sec: 1436.6 +/- 0.5 (jitter = 5.3) 7.883 | |
80 images/sec: 1435.6 +/- 0.7 (jitter = 5.2) 7.760 | |
90 images/sec: 1434.2 +/- 0.8 (jitter = 5.8) 7.825 | |
100 images/sec: 1434.3 +/- 0.8 (jitter = 5.2) 7.815 | |
---------------------------------------------------------------- | |
total images/sec: 1433.72 | |
---------------------------------------------------------------- | |
2019-09-10 14:39:29.193982: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:39:31.194494 140373536360256 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:39:31.198968 140373536360256 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:39:31.222224 140373536360256 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:39:31.222431 140373536360256 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:39:31.248721: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:39:31.251280: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x31917d0 executing computations on platform Host. Devices: | |
2019-09-10 14:39:31.251314: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:39:31.254771: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:39:32.064302: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x3191b30 executing computations on platform CUDA. Devices: | |
2019-09-10 14:39:32.064351: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:39:32.064367: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:39:32.064379: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:39:32.064391: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:39:32.067818: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:39:32.070653: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:39:32.073478: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:39:32.076270: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:39:32.076316: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:39:32.079543: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:39:32.081668: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:39:32.081945: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:39:32.083638: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:39:32.084632: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:39:32.088180: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:39:32.099957: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:39:32.099989: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:39:33.860195: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:39:33.860235: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:39:33.860248: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:39:33.860255: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:39:33.860262: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:39:33.860268: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:39:33.868318: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:39:33.870509: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:39:33.872661: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:39:33.874773: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 256 global | |
64 per device | |
Num batches: 100 | |
Num epochs: 0.02 | |
Devices: ['/gpu:0', '/gpu:1', '/gpu:2', '/gpu:3'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:39:33.880883 140373536360256 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:39:33.881470 140373536360256 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:39:33.890151 140373536360256 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2739: The name tf.get_variable is deprecated. Please use tf.compat.v1.get_variable instead. | |
W0910 14:39:33.895046 140373536360256 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:39:33.912401 140373536360256 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:39:34.278283 140373536360256 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:39:36.983918 140373536360256 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:39:36.986926 140373536360256 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:39:36.998892 140373536360256 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:39:50.398809 140373536360256 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:39:50.487930 140373536360256 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:39:51.618725 140373536360256 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:39:52.795925 140373536360256 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:39:52.796192 140373536360256 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:39:53.066552 140373536360256 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:39:55.149767: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:39:55.151607: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:39:55.153463: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:39:55.155187: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:39:55.155220: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:39:55.155254: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:39:55.155274: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:39:55.155291: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:39:55.155308: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:39:55.155325: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:39:55.155343: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:39:55.168207: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:39:55.168269: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:39:55.168281: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:39:55.168289: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:39:55.168296: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:39:55.168302: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:39:55.168308: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:39:55.176192: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:39:55.177821: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:39:55.179408: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:39:55.180997: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:39:57.639631: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:40:00.674548 140373536360256 session_manager.py:500] Running local_init_op. | |
I0910 14:40:02.234329 140373536360256 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:40:08.754754: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:40:10.229719: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 2373.4 +/- 0.0 (jitter = 0.0) 8.015 | |
10 images/sec: 2374.5 +/- 3.6 (jitter = 8.9) 7.900 | |
20 images/sec: 2370.0 +/- 2.4 (jitter = 10.2) 7.928 | |
30 images/sec: 2369.8 +/- 1.9 (jitter = 10.1) 7.899 | |
40 images/sec: 2370.9 +/- 1.6 (jitter = 9.3) 7.836 | |
50 images/sec: 2371.3 +/- 1.5 (jitter = 12.1) 7.880 | |
60 images/sec: 2370.8 +/- 1.4 (jitter = 11.6) 7.853 | |
70 images/sec: 2371.2 +/- 1.3 (jitter = 10.6) 7.800 | |
80 images/sec: 2371.6 +/- 1.2 (jitter = 10.8) 7.787 | |
90 images/sec: 2372.2 +/- 1.2 (jitter = 12.9) 7.886 | |
100 images/sec: 2372.4 +/- 1.1 (jitter = 12.7) 7.696 | |
---------------------------------------------------------------- | |
total images/sec: 2370.85 | |
---------------------------------------------------------------- | |
2019-09-10 14:40:33.355935: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:40:35.351562 140068409890624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:40:35.356027 140068409890624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:40:35.379163 140068409890624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:40:35.379368 140068409890624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:40:35.404723: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:40:35.407009: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x3f6ada0 executing computations on platform Host. Devices: | |
2019-09-10 14:40:35.407030: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:40:35.409093: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:40:36.177913: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x57cccd0 executing computations on platform CUDA. Devices: | |
2019-09-10 14:40:36.177952: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:40:36.177965: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:40:36.177975: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:40:36.177985: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:40:36.180824: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:40:36.183185: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:40:36.185524: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:40:36.187890: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:40:36.187930: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:40:36.190617: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:40:36.192957: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:40:36.193266: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:40:36.194965: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:40:36.195946: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:40:36.199504: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:40:36.211310: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:40:36.211343: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:40:37.981401: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:40:37.981441: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:40:37.981451: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:40:37.981457: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:40:37.981463: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:40:37.981469: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:40:37.989418: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:40:37.991696: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:40:37.993831: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:40:37.995936: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 512 global | |
128 per device | |
Num batches: 100 | |
Num epochs: 0.04 | |
Devices: ['/gpu:0', '/gpu:1', '/gpu:2', '/gpu:3'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:40:38.001996 140068409890624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:40:38.002607 140068409890624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:40:38.010828 140068409890624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:40:38.011236 140068409890624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:245: The name tf.truncated_normal is deprecated. Please use tf.random.truncated_normal instead. | |
W0910 14:40:38.027825 140068409890624 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:40:38.394341 140068409890624 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:40:40.938336 140068409890624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:40:40.940391 140068409890624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:40:40.951967 140068409890624 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:40:51.559518 140068409890624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:40:51.646619 140068409890624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:40:52.556450 140068409890624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:40:53.695517 140068409890624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:40:53.695766 140068409890624 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:40:53.956712 140068409890624 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:40:55.917164: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:40:55.918885: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:40:55.920731: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:40:55.922285: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:40:55.922319: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:40:55.922354: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:40:55.922372: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:40:55.922389: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:40:55.922406: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:40:55.922431: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:40:55.922461: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:40:55.934869: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:40:55.934930: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:40:55.934942: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:40:55.934950: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:40:55.934956: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:40:55.934962: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:40:55.934969: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:40:55.942813: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:40:55.944464: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:40:55.946085: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:40:55.947695: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:40:58.241728: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:41:01.260072 140068409890624 session_manager.py:500] Running local_init_op. | |
I0910 14:41:02.757647 140068409890624 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:41:07.928875: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:41:09.316597: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 1548.1 +/- 0.0 (jitter = 0.0) 7.911 | |
10 images/sec: 1545.3 +/- 1.1 (jitter = 4.1) 7.909 | |
20 images/sec: 1544.6 +/- 0.8 (jitter = 3.4) 7.886 | |
30 images/sec: 1544.6 +/- 0.6 (jitter = 3.5) 7.778 | |
40 images/sec: 1544.4 +/- 0.5 (jitter = 3.3) 7.868 | |
50 images/sec: 1544.2 +/- 0.4 (jitter = 3.4) 7.778 | |
60 images/sec: 1544.3 +/- 0.4 (jitter = 3.5) 7.791 | |
70 images/sec: 1544.3 +/- 0.4 (jitter = 3.5) 7.748 | |
80 images/sec: 1544.6 +/- 0.4 (jitter = 3.5) 7.754 | |
90 images/sec: 1544.6 +/- 0.3 (jitter = 3.8) 7.691 | |
100 images/sec: 1544.7 +/- 0.3 (jitter = 3.5) 7.702 | |
---------------------------------------------------------------- | |
total images/sec: 1544.37 | |
---------------------------------------------------------------- | |
2019-09-10 14:41:58.338581: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:42:00.345152 140084530083648 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:42:00.349649 140084530083648 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:42:00.372978 140084530083648 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:42:00.373190 140084530083648 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:42:00.396688: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:42:00.399358: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x4a9bb50 executing computations on platform Host. Devices: | |
2019-09-10 14:42:00.399392: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:42:00.402927: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:42:01.175071: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x4a9c060 executing computations on platform CUDA. Devices: | |
2019-09-10 14:42:01.175103: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:42:01.175113: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:42:01.175121: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:42:01.175128: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:42:01.177331: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:42:01.179466: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:42:01.181590: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:42:01.183731: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:42:01.183769: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:42:01.186188: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:42:01.188303: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:42:01.188701: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:42:01.190412: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:42:01.191398: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:42:01.194987: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:42:01.206841: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:42:01.206877: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:42:02.971131: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:42:02.971175: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:42:02.971185: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:42:02.971192: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:42:02.971198: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:42:02.971204: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:42:02.979115: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:42:02.981260: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:42:02.983389: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:42:02.985780: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 512 global | |
128 per device | |
Num batches: 100 | |
Num epochs: 0.04 | |
Devices: ['/gpu:0', '/gpu:1', '/gpu:2', '/gpu:3'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:42:02.991589 140084530083648 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:42:02.992118 140084530083648 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:42:03.000892 140084530083648 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2739: The name tf.get_variable is deprecated. Please use tf.compat.v1.get_variable instead. | |
W0910 14:42:03.007102 140084530083648 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:42:03.023902 140084530083648 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:42:03.396012 140084530083648 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:42:06.038249 140084530083648 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:42:06.041155 140084530083648 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:42:06.052740 140084530083648 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:42:19.066690 140084530083648 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:42:19.153356 140084530083648 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:42:20.247669 140084530083648 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:42:21.380501 140084530083648 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:42:21.380757 140084530083648 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:42:21.642261 140084530083648 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:42:23.576749: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:42:23.578613: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:42:23.580386: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:42:23.581974: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:42:23.582006: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:42:23.582044: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:42:23.582066: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:42:23.582083: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:42:23.582099: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:42:23.582120: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:42:23.582140: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:42:23.594867: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:42:23.594928: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:42:23.594942: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:42:23.594952: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:42:23.594958: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:42:23.594965: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:42:23.594972: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:42:23.602803: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:42:23.604400: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:42:23.605992: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:42:23.607597: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:42:25.980176: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:42:29.000877 140084530083648 session_manager.py:500] Running local_init_op. | |
I0910 14:42:30.535269 140084530083648 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:42:37.047870: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:42:38.519216: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 2836.0 +/- 0.0 (jitter = 0.0) 7.836 | |
10 images/sec: 2847.8 +/- 3.1 (jitter = 15.4) 7.857 | |
20 images/sec: 2843.7 +/- 2.7 (jitter = 13.8) 7.873 | |
30 images/sec: 2838.9 +/- 2.8 (jitter = 14.3) 7.816 | |
40 images/sec: 2838.3 +/- 2.4 (jitter = 18.0) 7.818 | |
50 images/sec: 2837.6 +/- 2.1 (jitter = 12.7) 7.773 | |
60 images/sec: 2837.9 +/- 1.8 (jitter = 10.9) 7.723 | |
70 images/sec: 2836.2 +/- 1.7 (jitter = 11.3) 7.727 | |
80 images/sec: 2836.4 +/- 1.6 (jitter = 10.5) 7.670 | |
90 images/sec: 2835.1 +/- 1.5 (jitter = 10.6) 7.736 | |
100 images/sec: 2833.2 +/- 1.5 (jitter = 13.3) 7.713 | |
---------------------------------------------------------------- | |
total images/sec: 2832.00 | |
---------------------------------------------------------------- | |
2019-09-10 14:43:10.852005: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:43:12.849007 139844950951744 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:43:12.853490 139844950951744 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:43:12.876675 139844950951744 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:43:12.876881 139844950951744 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:43:12.900728: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:43:12.903535: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x3d9a760 executing computations on platform Host. Devices: | |
2019-09-10 14:43:12.903580: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:43:12.907585: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:43:13.729113: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x3d9ac70 executing computations on platform CUDA. Devices: | |
2019-09-10 14:43:13.729150: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:43:13.729164: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:43:13.729174: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:43:13.729183: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:43:13.731730: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:43:13.734102: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:43:13.736581: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:43:13.739039: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:43:13.739080: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:43:13.741924: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:43:13.744380: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:43:13.744844: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:43:13.746831: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:43:13.747991: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:43:13.752180: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:43:13.765892: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:43:13.765929: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:43:15.551812: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:43:15.551853: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:43:15.551863: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:43:15.551870: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:43:15.551876: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:43:15.551883: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:43:15.559899: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:43:15.562079: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:43:15.564197: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:43:15.566344: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 1024 global | |
256 per device | |
Num batches: 100 | |
Num epochs: 0.08 | |
Devices: ['/gpu:0', '/gpu:1', '/gpu:2', '/gpu:3'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:43:15.572416 139844950951744 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:43:15.573017 139844950951744 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:43:15.581869 139844950951744 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:43:15.582245 139844950951744 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:245: The name tf.truncated_normal is deprecated. Please use tf.random.truncated_normal instead. | |
W0910 14:43:15.598568 139844950951744 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:43:15.965062 139844950951744 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:43:18.525661 139844950951744 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:43:18.527738 139844950951744 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:43:18.539392 139844950951744 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:43:29.232611 139844950951744 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:43:29.320429 139844950951744 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:43:30.241079 139844950951744 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:43:31.388153 139844950951744 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:43:31.388401 139844950951744 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:43:31.651107 139844950951744 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:43:33.634589: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:43:33.636350: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:43:33.637971: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:43:33.639558: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:43:33.639590: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:43:33.639624: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:43:33.639643: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:43:33.639661: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:43:33.639678: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:43:33.639694: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:43:33.639711: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:43:33.652171: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:43:33.652232: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:43:33.652243: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:43:33.652251: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:43:33.652258: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:43:33.652264: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:43:33.652271: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:43:33.660022: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:43:33.661638: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:43:33.663217: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:43:33.664818: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:43:35.981527: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:43:39.000194 139844950951744 session_manager.py:500] Running local_init_op. | |
I0910 14:43:40.515112 139844950951744 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:43:45.714711: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:43:47.097411: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 1597.2 +/- 0.0 (jitter = 0.0) 7.912 | |
10 images/sec: 1597.7 +/- 0.5 (jitter = 1.9) 7.832 | |
20 images/sec: 1597.7 +/- 0.5 (jitter = 2.4) 7.820 | |
30 images/sec: 1597.3 +/- 0.5 (jitter = 2.0) 7.779 | |
40 images/sec: 1597.0 +/- 0.4 (jitter = 2.0) 7.698 | |
50 images/sec: 1596.4 +/- 0.4 (jitter = 2.4) 7.668 | |
60 images/sec: 1596.2 +/- 0.4 (jitter = 2.5) 7.652 | |
70 images/sec: 1595.6 +/- 0.4 (jitter = 2.5) 7.602 | |
80 images/sec: 1595.4 +/- 0.4 (jitter = 2.7) 7.606 | |
90 images/sec: 1595.1 +/- 0.4 (jitter = 2.9) 7.549 | |
100 images/sec: 1594.7 +/- 0.4 (jitter = 3.6) 7.540 | |
---------------------------------------------------------------- | |
total images/sec: 1594.55 | |
---------------------------------------------------------------- | |
2019-09-10 14:45:12.106464: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:45:14.095043 140505083311936 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:45:14.099504 140505083311936 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:45:14.122840 140505083311936 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:45:14.123056 140505083311936 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:45:14.148721: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:45:14.151158: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x338eaa0 executing computations on platform Host. Devices: | |
2019-09-10 14:45:14.151197: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:45:14.155209: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:45:14.938336: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x338ee00 executing computations on platform CUDA. Devices: | |
2019-09-10 14:45:14.938371: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:45:14.938382: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:45:14.938391: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:45:14.938400: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:45:14.940939: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:45:14.943287: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:45:14.945629: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:45:14.947958: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:45:14.948002: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:45:14.950709: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:45:14.952981: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:45:14.953268: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:45:14.954995: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:45:14.955993: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:45:14.959616: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:45:14.971484: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:45:14.971521: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:45:16.766139: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:45:16.766181: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:45:16.766192: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:45:16.766199: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:45:16.766205: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:45:16.766211: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:45:16.774212: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:45:16.776464: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:45:16.778609: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:45:16.780744: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 1024 global | |
256 per device | |
Num batches: 100 | |
Num epochs: 0.08 | |
Devices: ['/gpu:0', '/gpu:1', '/gpu:2', '/gpu:3'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:45:16.786987 140505083311936 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:45:16.787610 140505083311936 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:45:16.795541 140505083311936 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2739: The name tf.get_variable is deprecated. Please use tf.compat.v1.get_variable instead. | |
W0910 14:45:16.800475 140505083311936 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:45:16.818038 140505083311936 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:45:17.184011 140505083311936 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:45:19.924096 140505083311936 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:45:19.927115 140505083311936 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:45:19.939219 140505083311936 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:45:33.494938 140505083311936 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:45:33.585247 140505083311936 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:45:34.727916 140505083311936 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:45:35.915678 140505083311936 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:45:35.915928 140505083311936 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:45:36.188548 140505083311936 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:45:38.263424: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:45:38.265288: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:45:38.267183: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:45:38.269051: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:45:38.269084: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:45:38.269120: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:45:38.269140: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:45:38.269158: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:45:38.269175: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:45:38.269193: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:45:38.269212: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:45:38.281934: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:45:38.281996: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:45:38.282006: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:45:38.282015: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:45:38.282021: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:45:38.282028: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:45:38.282034: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:45:38.289954: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:45:38.291599: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:45:38.293206: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:45:38.294782: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:45:40.777442: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:45:43.813993 140505083311936 session_manager.py:500] Running local_init_op. | |
I0910 14:45:45.382833 140505083311936 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:45:51.948691: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:45:53.433423: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 3165.2 +/- 0.0 (jitter = 0.0) 7.944 | |
10 images/sec: 3165.8 +/- 1.1 (jitter = 1.9) 7.908 | |
20 images/sec: 3168.8 +/- 1.5 (jitter = 3.9) 7.823 | |
30 images/sec: 3168.9 +/- 1.1 (jitter = 7.2) 7.786 | |
40 images/sec: 3167.5 +/- 1.1 (jitter = 7.4) 7.710 | |
50 images/sec: 3166.9 +/- 0.9 (jitter = 7.2) 7.712 | |
60 images/sec: 3168.0 +/- 1.0 (jitter = 8.1) 7.654 | |
70 images/sec: 3168.1 +/- 0.9 (jitter = 8.3) 7.621 | |
80 images/sec: 3168.1 +/- 0.9 (jitter = 8.6) 7.562 | |
90 images/sec: 3168.0 +/- 0.8 (jitter = 8.4) 7.541 | |
100 images/sec: 3168.4 +/- 0.7 (jitter = 8.3) 7.511 | |
---------------------------------------------------------------- | |
total images/sec: 3167.72 | |
---------------------------------------------------------------- | |
2019-09-10 14:46:43.058012: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
WARNING: Logging before flag parsing goes to stderr. | |
W0910 14:46:45.087880 139836202166080 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/mobilenet.py:388: The name tf.nn.avg_pool is deprecated. Please use tf.nn.avg_pool2d instead. | |
W0910 14:46:45.092403 139836202166080 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/experimental/deepspeech.py:131: The name tf.nn.rnn_cell.RNNCell is deprecated. Please use tf.compat.v1.nn.rnn_cell.RNNCell instead. | |
W0910 14:46:45.115877 139836202166080 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:722: The name tf.ConfigProto is deprecated. Please use tf.compat.v1.ConfigProto instead. | |
W0910 14:46:45.116084 139836202166080 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:3503: The name tf.Session is deprecated. Please use tf.compat.v1.Session instead. | |
2019-09-10 14:46:45.140710: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2198780000 Hz | |
2019-09-10 14:46:45.143276: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x3062300 executing computations on platform Host. Devices: | |
2019-09-10 14:46:45.143309: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): <undefined>, <undefined> | |
2019-09-10 14:46:45.146618: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcuda.so.1 | |
2019-09-10 14:46:45.926347: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x48e7650 executing computations on platform CUDA. Devices: | |
2019-09-10 14:46:45.926394: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (0): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:46:45.926412: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (1): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:46:45.926426: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (2): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:46:45.926439: I tensorflow/compiler/xla/service/service.cc:175] StreamExecutor device (3): Tesla V100-DGXS-32GB, Compute Capability 7.0 | |
2019-09-10 14:46:45.929875: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:46:45.932745: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:46:45.935583: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:46:45.938430: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:46:45.938477: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:46:45.941454: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:46:45.943216: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:46:45.943555: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:46:45.945592: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:46:45.946771: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:46:45.950975: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:46:45.964752: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:46:45.964790: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:46:47.763412: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:46:47.763451: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:46:47.763461: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:46:47.763468: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:46:47.763475: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:46:47.763481: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:46:47.771501: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:46:47.773692: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:46:47.775869: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:46:47.778028: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
TensorFlow: 1.14 | |
Model: resnet50 | |
Dataset: imagenet (synthetic) | |
Mode: training | |
SingleSess: False | |
Batch size: 2048 global | |
512 per device | |
Num batches: 100 | |
Num epochs: 0.16 | |
Devices: ['/gpu:0', '/gpu:1', '/gpu:2', '/gpu:3'] | |
NUMA bind: False | |
Data format: NHWC | |
Optimizer: sgd | |
Variables: replicated | |
AllReduce: None | |
========== | |
W0910 14:46:47.783979 139836202166080 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2762: The name tf.set_random_seed is deprecated. Please use tf.compat.v1.set_random_seed instead. | |
Generating training model | |
W0910 14:46:47.784570 139836202166080 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2782: The name tf.train.get_or_create_global_step is deprecated. Please use tf.compat.v1.train.get_or_create_global_step instead. | |
W0910 14:46:47.793416 139836202166080 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2739: The name tf.get_variable is deprecated. Please use tf.compat.v1.get_variable instead. | |
W0910 14:46:47.798059 139836202166080 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/variable_mgr.py:316: The name tf.variable_scope is deprecated. Please use tf.compat.v1.variable_scope instead. | |
W0910 14:46:47.814631 139836202166080 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:129: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use `tf.keras.layers.Conv2D` instead. | |
W0910 14:46:48.174123 139836202166080 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:261: max_pooling2d (from tensorflow.python.layers.pooling) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use keras.layers.MaxPooling2D instead. | |
W0910 14:46:50.758634 139836202166080 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/convnet_builder.py:334: The name tf.nn.xw_plus_b is deprecated. Please use tf.compat.v1.nn.xw_plus_b instead. | |
W0910 14:46:50.761462 139836202166080 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/model.py:319: The name tf.losses.sparse_softmax_cross_entropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy instead. | |
W0910 14:46:50.772780 139836202166080 deprecation.py:323] From /usr/local/lib/python3.6/dist-packages/tensorflow/python/ops/losses/losses_impl.py:121: add_dispatch_support.<locals>.wrapper (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Use tf.where in 2.0, which has the same broadcast rule as np.where | |
W0910 14:47:03.522151 139836202166080 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/models/resnet_model.py:333: The name tf.train.piecewise_constant is deprecated. Please use tf.compat.v1.train.piecewise_constant instead. | |
W0910 14:47:03.607043 139836202166080 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:1209: The name tf.train.GradientDescentOptimizer is deprecated. Please use tf.compat.v1.train.GradientDescentOptimizer instead. | |
W0910 14:47:04.679104 139836202166080 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2090: The name tf.train.get_global_step is deprecated. Please use tf.compat.v1.train.get_global_step instead. | |
W0910 14:47:05.784687 139836202166080 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2126: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead. | |
Initializing graph | |
W0910 14:47:05.784940 139836202166080 deprecation_wrapper.py:119] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2183: The name tf.train.Saver is deprecated. Please use tf.compat.v1.train.Saver instead. | |
W0910 14:47:06.040027 139836202166080 deprecation.py:323] From /workspace/benchmarks/scripts/tf_cnn_benchmarks/benchmark_cnn.py:2238: Supervisor.__init__ (from tensorflow.python.training.supervisor) is deprecated and will be removed in a future version. | |
Instructions for updating: | |
Please switch to tf.train.MonitoredTrainingSession | |
2019-09-10 14:47:07.945425: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 0 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:07:00.0 | |
2019-09-10 14:47:07.947246: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 1 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:08:00.0 | |
2019-09-10 14:47:07.949090: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 2 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0e:00.0 | |
2019-09-10 14:47:07.950916: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1640] Found device 3 with properties: | |
name: Tesla V100-DGXS-32GB major: 7 minor: 0 memoryClockRate(GHz): 1.53 | |
pciBusID: 0000:0f:00.0 | |
2019-09-10 14:47:07.950948: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudart.so.10.1 | |
2019-09-10 14:47:07.950982: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:47:07.951000: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcufft.so.10 | |
2019-09-10 14:47:07.951017: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcurand.so.10 | |
2019-09-10 14:47:07.951033: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusolver.so.10 | |
2019-09-10 14:47:07.951049: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcusparse.so.10 | |
2019-09-10 14:47:07.951066: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
2019-09-10 14:47:07.963704: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1763] Adding visible gpu devices: 0, 1, 2, 3 | |
2019-09-10 14:47:07.963762: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1181] Device interconnect StreamExecutor with strength 1 edge matrix: | |
2019-09-10 14:47:07.963772: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1187] 0 1 2 3 | |
2019-09-10 14:47:07.963780: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 0: N Y Y Y | |
2019-09-10 14:47:07.963786: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 1: Y N Y Y | |
2019-09-10 14:47:07.963793: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 2: Y Y N Y | |
2019-09-10 14:47:07.963799: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1200] 3: Y Y Y N | |
2019-09-10 14:47:07.971680: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 30438 MB memory) -> physical GPU (device: 0, name: Tesla V100-DGXS-32GB, pci bus id: 0000:07:00.0, compute capability: 7.0) | |
2019-09-10 14:47:07.973274: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:1 with 30463 MB memory) -> physical GPU (device: 1, name: Tesla V100-DGXS-32GB, pci bus id: 0000:08:00.0, compute capability: 7.0) | |
2019-09-10 14:47:07.974844: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:2 with 30463 MB memory) -> physical GPU (device: 2, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0e:00.0, compute capability: 7.0) | |
2019-09-10 14:47:07.976407: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1326] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:3 with 30463 MB memory) -> physical GPU (device: 3, name: Tesla V100-DGXS-32GB, pci bus id: 0000:0f:00.0, compute capability: 7.0) | |
2019-09-10 14:47:10.332216: W tensorflow/compiler/jit/mark_for_compilation_pass.cc:1412] (One-time warning): Not using XLA:CPU for cluster because envvar TF_XLA_FLAGS=--tf_xla_cpu_global_jit was not set. If you want XLA:CPU, either set that envvar, or use experimental_jit_scope to enable XLA:CPU. To confirm that XLA is active, pass --vmodule=xla_compilation_cache=1 (as a proper command-line flag, not via TF_XLA_FLAGS) or set the envvar XLA_FLAGS=--xla_hlo_profile. | |
I0910 14:47:13.328989 139836202166080 session_manager.py:500] Running local_init_op. | |
I0910 14:47:14.842230 139836202166080 session_manager.py:502] Done running local_init_op. | |
Running warm up | |
2019-09-10 14:47:21.165005: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcublas.so.10 | |
2019-09-10 14:47:22.630395: I tensorflow/stream_executor/platform/default/dso_loader.cc:42] Successfully opened dynamic library libcudnn.so.7 | |
Done warm up | |
Step Img/sec total_loss | |
1 images/sec: 3289.7 +/- 0.0 (jitter = 0.0) 7.853 | |
10 images/sec: 3288.5 +/- 1.6 (jitter = 5.1) 7.798 | |
20 images/sec: 3288.1 +/- 0.9 (jitter = 3.4) 7.711 | |
30 images/sec: 3289.6 +/- 0.8 (jitter = 3.3) 7.608 | |
40 images/sec: 3289.9 +/- 0.6 (jitter = 4.1) 7.547 | |
50 images/sec: 3289.9 +/- 0.6 (jitter = 4.1) 7.516 | |
60 images/sec: 3290.2 +/- 0.5 (jitter = 4.1) 7.492 | |
70 images/sec: 3290.7 +/- 0.5 (jitter = 4.1) 7.480 | |
80 images/sec: 3290.9 +/- 0.5 (jitter = 4.3) 7.476 | |
90 images/sec: 3291.1 +/- 0.4 (jitter = 4.2) 7.477 | |
100 images/sec: 3291.0 +/- 0.4 (jitter = 3.9) 7.468 | |
---------------------------------------------------------------- | |
total images/sec: 3290.58 | |
---------------------------------------------------------------- | |
root@692ced701c1d:/workspace/benchmarks/scripts/tf_cnn_benchmarks# |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment