_mat_

October 9, 2017

Please send the result files to matthias [at] hwbot.org, I will have a look at them.

August 8, 2017

Probably uploaded in the wrong category, right? The error messages are not very clear on that. You have to be sure to upload CPU scores in the "GPUPI for CPU" categories and GPU scores into the "GPUPI" categories.

It's best to submit the scores inside the benchmark, if your bench system is connected to the internet.

July 22, 2017

Ryzen is not split up in GPUPI. Each compatible OpenCL platform will show one Ryzen device. You can install multiple AMD OpenCL platforms to try out multiple OpenCL drivers, each will perform differently.

Search for the AMD App SDK, version 3.0 will install the OpenCL 2.0 driver, version 2.9 should result in OpenCL 1.2.

July 11, 2017

What? Skylake X is not supported by Intel's official OpenCL drivers? Now I see why the current scores where made with the AMD OpenCL driver.

That's really sad.

July 2, 2017

That's strange indeed. Please send one of these result files to matthias at hwbot org, I will have a look at it. Thank you.

July 2, 2017

The submission node is the root node of the HWBOT validation file. But the errors before that already make it very clear, that the data could not be decrypted correctly. Sry, that's bad news. Seems like the file was damaged at some point.

July 2, 2017

Check GPUPI.log there should be an extended error description which XML node makes the file invalid. Please post it here, might be a bug in the validation.

June 28, 2017

A little preview of some of the features of GPUPI 3.0 for my fellow overclockers. Command line version for Windows:

attachment.php?attachmentid=223510

Autoselection of the compute platform, Batch Size and Reduction Size depending by prebenching it for the user:

$ ./GPUPI_x64 -c -d 100M
GPUPI 3.0 (64 bit)

API: OpenCL GPU with 1 devices

API: OpenCL CPU with 2 devices

API: CUDA with 1 devices

Testing device: OpenCL CPU -> Intel® OpenCL -> Intel Core i7-6950X

=> 1M, 16: 2.294076 (Kernel: 2.250068, Reduction: 0.043280)

=> 1M, 32: 2.248263 (Kernel: 2.209528, Reduction: 0.037883)

=> 1M, 64: 2.270596 (Kernel: 2.231809, Reduction: 0.038067)

=> 1M, 128: 2.245034 (Kernel: 2.207602, Reduction: 0.036715)

=> 1M, 256: 2.279390 (Kernel: 2.229491, Reduction: 0.049113)

=> 1M, 512: 2.266061 (Kernel: 2.193988, Reduction: 0.071337)

=> 2M, 16: 2.315099 (Kernel: 2.236380, Reduction: 0.078018)

=> 2M, 32: 2.288076 (Kernel: 2.219284, Reduction: 0.068005)

=> 2M, 64: 2.287389 (Kernel: 2.226804, Reduction: 0.059873)

=> 2M, 128: 2.249376 (Kernel: 2.191177, Reduction: 0.057482)

=> 2M, 256: 2.283105 (Kernel: 2.215427, Reduction: 0.066962)

=> 2M, 512: 2.254495 (Kernel: 2.194912, Reduction: 0.058892)

=> 4M, 16: 2.307497 (Kernel: 2.218491, Reduction: 0.088419)

=> 4M, 32: 2.260795 (Kernel: 2.183106, Reduction: 0.077162)

=> 4M, 64: 2.304972 (Kernel: 2.238267, Reduction: 0.066159)

=> 4M, 128: 2.255765 (Kernel: 2.196260, Reduction: 0.058924)

=> 4M, 256: 2.277544 (Kernel: 2.209126, Reduction: 0.067898)

=> 4M, 512: 2.249683 (Kernel: 2.191406, Reduction: 0.057736)

=> 5M, 16: 2.304984 (Kernel: 2.217214, Reduction: 0.087279)

=> 5M, 32: 2.265134 (Kernel: 2.187128, Reduction: 0.077524)

=> 5M, 64: 2.279445 (Kernel: 2.212483, Reduction: 0.066463)

=> 5M, 128: 2.238783 (Kernel: 2.180829, Reduction: 0.057460)

=> 5M, 256: 2.299566 (Kernel: 2.231994, Reduction: 0.067090)

=> 5M, 512: 2.267714 (Kernel: 2.197324, Reduction: 0.069908)

=> 10M, 16: 2.311983 (Kernel: 2.226683, Reduction: 0.084900)

=> 10M, 32: 2.271478 (Kernel: 2.194653, Reduction: 0.076431)

=> 10M, 64: 2.261646 (Kernel: 2.190358, Reduction: 0.070862)

=> 10M, 128: 2.238901 (Kernel: 2.181579, Reduction: 0.056898)

=> 10M, 256: 2.278743 (Kernel: 2.215327, Reduction: 0.062982)

=> 10M, 512: 2.271698 (Kernel: 2.204813, Reduction: 0.066495)

=> 20M, 16: 2.316387 (Kernel: 2.224665, Reduction: 0.091349)

=> 20M, 32: 2.264053 (Kernel: 2.185630, Reduction: 0.078063)

=> 20M, 64: 2.302941 (Kernel: 2.229473, Reduction: 0.073087)

=> 20M, 128: 2.256671 (Kernel: 2.193457, Reduction: 0.062854)

=> 20M, 256: 2.256185 (Kernel: 2.194374, Reduction: 0.061453)

=> 20M, 512: 2.239121 (Kernel: 2.177762, Reduction: 0.061003)

=> 100M, 16: 2.351734 (Kernel: 2.219785, Reduction: 0.131625)

=> 100M, 32: 2.284867 (Kernel: 2.182241, Reduction: 0.102328)

=> 100M, 64: 2.331753 (Kernel: 2.241809, Reduction: 0.089634)

=> 100M, 128: 2.314707 (Kernel: 2.239817, Reduction: 0.074468)

=> 100M, 256: 2.272911 (Kernel: 2.204067, Reduction: 0.068538)

=> 100M, 512: 2.262099 (Kernel: 2.198868, Reduction: 0.062920)

Testing device: OpenCL CPU -> Experimental OpenCL 2.1 CPU Only Platform -> Intel Core i7-6950X

=> 1M, 16: 2.256272 (Kernel: 2.208813, Reduction: 0.046615)

=> 1M, 32: 2.259307 (Kernel: 2.213463, Reduction: 0.045005)

=> 1M, 64: 2.261296 (Kernel: 2.217321, Reduction: 0.043054)

=> 1M, 128: 2.255290 (Kernel: 2.210472, Reduction: 0.043979)

=> 1M, 256: 2.276476 (Kernel: 2.228651, Reduction: 0.046952)

=> 1M, 512: 2.290972 (Kernel: 2.212072, Reduction: 0.078073)

=> 2M, 16: 2.281812 (Kernel: 2.198045, Reduction: 0.082942)

=> 2M, 32: 2.257342 (Kernel: 2.186708, Reduction: 0.069812)

=> 2M, 64: 2.274022 (Kernel: 2.210600, Reduction: 0.062644)

=> 2M, 128: 2.242322 (Kernel: 2.182171, Reduction: 0.059333)

=> 2M, 256: 2.284035 (Kernel: 2.214316, Reduction: 0.068943)

=> 2M, 512: 2.245358 (Kernel: 2.182079, Reduction: 0.062468)

=> 4M, 16: 2.314692 (Kernel: 2.223573, Reduction: 0.090524)

=> 4M, 32: 2.287976 (Kernel: 2.207894, Reduction: 0.079501)

=> 4M, 64: 2.280830 (Kernel: 2.212408, Reduction: 0.067748)

=> 4M, 128: 2.246577 (Kernel: 2.185500, Reduction: 0.060417)

=> 4M, 256: 2.267059 (Kernel: 2.196838, Reduction: 0.069591)

=> 4M, 512: 2.245285 (Kernel: 2.185138, Reduction: 0.059543)

=> 5M, 16: 2.297634 (Kernel: 2.208198, Reduction: 0.088819)

=> 5M, 32: 2.252606 (Kernel: 2.173261, Reduction: 0.078811)

=> 5M, 64: 2.289753 (Kernel: 2.219699, Reduction: 0.069525)

=> 5M, 128: 2.241125 (Kernel: 2.181959, Reduction: 0.058589)

=> 5M, 256: 2.272509 (Kernel: 2.203694, Reduction: 0.068293)

=> 5M, 512: 2.255514 (Kernel: 2.184216, Reduction: 0.070740)

=> 10M, 16: 2.283480 (Kernel: 2.197331, Reduction: 0.085667)

=> 10M, 32: 2.259312 (Kernel: 2.181606, Reduction: 0.077262)

=> 10M, 64: 2.273700 (Kernel: 2.201239, Reduction: 0.071997)

=> 10M, 128: 2.239782 (Kernel: 2.180927, Reduction: 0.058395)

=> 10M, 256: 2.288214 (Kernel: 2.223593, Reduction: 0.064161)

=> 10M, 512: 2.275962 (Kernel: 2.210551, Reduction: 0.064933)

=> 20M, 16: 2.298107 (Kernel: 2.206268, Reduction: 0.091459)

=> 20M, 32: 2.282686 (Kernel: 2.204328, Reduction: 0.077989)

=> 20M, 64: 2.264337 (Kernel: 2.189890, Reduction: 0.074074)

=> 20M, 128: 2.261340 (Kernel: 2.197828, Reduction: 0.063139)

=> 20M, 256: 2.254334 (Kernel: 2.191051, Reduction: 0.062873)

=> 20M, 512: 2.261168 (Kernel: 2.199776, Reduction: 0.060975)

=> 100M, 16: 2.363773 (Kernel: 2.221893, Reduction: 0.141445)

=> 100M, 32: 2.325590 (Kernel: 2.220586, Reduction: 0.104667)

=> 100M, 64: 2.281907 (Kernel: 2.196272, Reduction: 0.085290)

=> 100M, 128: 2.290375 (Kernel: 2.214583, Reduction: 0.075452)

=> 100M, 256: 2.274559 (Kernel: 2.203661, Reduction: 0.070590)

=> 100M, 512: 2.287639 (Kernel: 2.223986, Reduction: 0.063316)

Best device found: OpenCL CPU -> Intel® OpenCL -> Intel Core i7-6950X with 5M, 128.

Timer: HPET (14.32 MHz)

Init HWiNFO: Ok

OpenCL CPU: Intel Core i7-6950X (20 CUs, 3000 MHz)

Compiling OpenCL kernels ... done.

Calculating 100.000.000th digit of PI. 20 iterations.

Allocated device memory : 83.89 MB

Batch Size : 5M

Reduction Size : 128

00h 00m 00.480s Batch 1 finished.

00h 00m 00.945s Batch 2 finished.

00h 00m 01.403s Batch 3 finished.

00h 00m 01.850s Batch 4 finished.

00h 00m 02.263s Batch 5 finished.

00h 00m 02.734s Batch 6 finished.

00h 00m 03.201s Batch 7 finished.

00h 00m 03.649s Batch 8 finished.

00h 00m 04.089s Batch 9 finished.

00h 00m 04.502s Batch 10 finished.

00h 00m 04.980s Batch 11 finished.

00h 00m 05.450s Batch 12 finished.

00h 00m 05.915s Batch 13 finished.

00h 00m 06.367s Batch 14 finished.

00h 00m 06.784s Batch 15 finished.

00h 00m 07.257s Batch 16 finished.

00h 00m 07.724s Batch 17 finished.

00h 00m 08.187s Batch 18 finished.

00h 00m 08.639s Batch 19 finished.

00h 00m 09.055s PI value output -> CB840E219

Highest clocks measured:

CPU: 3800.11 MHz

GPU: 202.50 MHz

GPU memory: 101.25 MHz

Statistics:

Calculation + Reduction time: 8.822s + 0.231s

PI calculation is done!

June 26, 2017

Thank you for your kind words, very much appreciated.

June 25, 2017

Windows 8 and above are effected by the RTC skewing bug when bclock is changed in Windows. I don't think that Skylake and Kaby Lake are any exception to this rule, but I haven't tested it myself yet.

To circumvent HPET you have to use Windows 7.

Edit: Rules of HWBOT allow the legacy benchmarks on SL and KL, so I guess it has been tested and it's not affecting the RTC timer.

Well, with the next version GPUPI I will remove the HPET restriction on SL and KL.

June 12, 2017

It's not a good sign for your windows install, if the COM library can't be initialized. These are pretty basic methods to query WMI information. Disable the hardware detection for CPU and GPU on the submission dialog.

May 30, 2017

Hey Nick, overclockers.at (where the download is hosted) was temporarily down due to a new forum software update. It has been back a few hours later, so is the download as well. Sry for the inconvencience!

May 8, 2017

Wow!

5M/512?! :eek:

April 7, 2017

Way to go, congrats! Six of these puppies could actually do it!

February 23, 2017

Well done! GPUPI 1B on those big chips is a hell of a ride, congrats!

February 15, 2017

You need to install the Intel OpenCL drivers: https://software.intel.com/en-us/articles/opencl-drivers

February 1, 2017

If you want an even better score, use the CUDA platform instead of OpenCL. It should perform best for NVIDIA cards.

January 17, 2017

Congrats, nice setup! Overclocking of the GPUs not possible? Some pictures would be nice as well.

December 31, 2016

Awesome clocks, congrats!

December 18, 2016

Just awesome!

November 21, 2016

Damn, I like your testing style!

November 4, 2016

The name of the device is retrieved via the opencl driver, which normally just takes the CPUID brand string as it is shown in CPU-Z, a hardcoded value inside the CPU. GPUPI removes various prefixes and postfixes to be able to submit the result to HWBOT.

September 19, 2016

Oops, I meant that I am avoiding QPC when HPET is not enabled. Sorry, I have currently a lot on my plate.

I can't remember if it's precisely ACPI that's vulnerable, but on Windows 7 - which is not affected by the RTC bug - QPC gets skewed if HPET is disabled. My best guess is, that it falls back to ACPI, otherwise the fallback to RTC would not produce skewed results. See my results here: https://www.overclockers.at/articles/gpupi-2-1 ... I should have displayed the timer frequencies as well, hrmpf.

September 18, 2016

QPC falls back in various ways depending on the hardware and OS version, it can be HPET, ACPI or even RTC. The timer resolution of QPC differs greatly as well so it's difficult to find out which timing method is currently in use.

September 18, 2016

The broad problem is that only the TSC clock is affected by the clock skew on Windows 8+10. Does anyone mind if I change clock enforcement to blacklist TSC rather than whitelist HPET and ACPI? This should allow all other platform clocks.

I've researched this topic for some days back when I was developing GPUPI and in my opinion the only manageable option for me was to ban TSC from 8 and 10 and only allow HPET there. Using ACPI as a timer if available is possible but depends on how it's done. I would not advise to use Windows' QPC functions for example.

Sign In

_mat_

Posts

Joined

Last visited

Days Won

Content Type

Profiles

Forums

Events

Blogs

Posts posted by _mat_

error gpupi 1B and 32B

GPUPI 1B and 100m validation

GPUPI: Split Ryzen into two OpenCL devices

GPUPI - SuperPI on the GPU

GPUPI - SuperPI on the GPU

GPUPI - SuperPI on the GPU

GPUPI - SuperPI on the GPU

GPUPI - SuperPI on the GPU

GPUPI - SuperPI on the GPU

GPUPI - SuperPI on the GPU

need help, gpupi for Ryzen 5

"GPUPI - 1B" benchmark (broken download link)

$@39@ - Core i7 7700K @ 6981MHz - 3min 44sec 192ms GPUPI for CPU - 1B

H2o vs. Ln2 - 4x GeForce GTX 1080 Ti @ 2214/1451MHz - 2sec 673ms GPUPI - 1B

Xtreme Addict - Core i7 6950X @ 5480MHz - 2min 0sec 36ms GPUPI for CPU - 1B

GPUPI for CPU error

topyoyoguybest - Titan X Pascal - 11sec 590ms GPUPI - 1B

Pijonson - >4 GeForce GTX 1080 @ 1607/1251MHz - 2sec 390ms GPUPI - 1B

Splave - Core i7 7700K @ 6830MHz - 3min 50sec 33ms GPUPI for CPU - 1B

Vivi - GeForce GTX 1060 @ 3012/1901MHz - 22sec 60ms GPUPI - 1B

k|ngp|n - Titan X Pascal @ 2632/1251MHz - 8sec 987ms GPUPI - 1B

GPUPI - SuperPI on the GPU

Math turns benchmark: y-cruncher meets HWBOT

Math turns benchmark: y-cruncher meets HWBOT

Math turns benchmark: y-cruncher meets HWBOT

HWBOT

Browse

Activity