Keras benchmarks slow on a 256 threads machine as compared to 16 threads laptop #21626
planetprivate
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Why would a 256 threads machine slower in benchmarks as compared to a laptop with 16 threads when running.
python model_benchmark/image_classification_benchmark.py
Any suggestions glad to test.
nproc= 256
RAM 1TB
AMD EPYC 7713 64-Core Processor (supermicro server)
Wall Time: 9496.9490 seconds.
Validation Accuracy: 0.9819
examples_per_second: 2.2220
nproc=16
16GB RAM
AMD Ryzen 7 PRO 5850U (thinkpad P14s)
Wall Time: 4028.4497 seconds.
Validation Accuracy: 0.9841
examples_per_second: 5.3173
This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.
To enable the following instructions: AVX2 FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.
SERVER
LAPTOP
Beta Was this translation helpful? Give feedback.
All reactions