I found out why the M1 supposedly beat other GPUs in OpenCL benchmarks. Apple's OpenCL queue profiling API is not Khronos-conformant. It reports Mach timebase ticks (24 MHz) as nanoseconds (1 GHz), creating times that are 1/41.7 times the actual value. It's highly probable that some kind of profiling software assumed that CL_QUEUE_PROFILING_ENABLE works correctly and did not correct the error.I made this thread because I thought sites regarding Apple Silicon Macs regarding science is lacking and I thought this thread would be a helpful place for any future scientists or current ones to get info regarding Apple Silicon Macs.
Please post only things regarding the usage of Apple Silicon Macs in science. Post articles, github repositories, sites, etc. regarding this topic.
I'm gonna start with this:
Apple Silicon Performance in Scientific Computing
PDF version:
Take whatever results these researchers got, divide the speed by 41.7. If the M1 is reported 10x faster than V100, it now becomes 4x slower.