Jump to content
HWBOT Community Forums

GPUPI - SuperPI on the GPU


_mat_

Recommended Posts

I'm sorry to bring this up again... but despite all efforts Nvidia G200 still refuses to work with GPUPI 2.2 (legacy). Although the error message is different this time.

 

If you manage to fix this issue, I promise to bench all G200 videocards I can find. :D

We certainly will get to the bottom of this. :)

 

Have you tried more recent drivers? The drivers you are using only support CUDA 6.0.5, but the legacy version is now bundled with CUDA 6.5. Please use the newest drivers possible. The 340.52 should work for your combination: http://www.geforce.com/drivers/results/77225

 

Btw, try to use OpenCL as well. Maybe it works better.

Edited by _mat_
Link to comment
Share on other sites

This is certainly step in the right direction. Now all presets up to 100M works fine @ CUDA :). 500M and above however results in this error:

 

gpupi22_error4eaj4u.png

 

OpenCL never worked for me, not even in old 1.4 GPUPI. I think you said it is caused by poor nvidia OCL implementation on these old GPUs.

Link to comment
Share on other sites

I research some more and the problem seems to be register limitations of these old graphics cards. So the calculation code of the high precision loops is too complex to be successfully processed. To fix this I would have to get myself a GTX 200 series card and restructure the code to use less registers. I might do that, but it will take a while.

 

Btw, OpenCL on GTX 200 series should not be a problem. CatEye (Turrican's sister) benched a GTX 295 on Windows 7 64 bit with GeForce 340.52 drivers. Have you tried that?

 

http://hwbot.org/submission/2776702_cateye_gpupi___1b_geforce_gtx_295_11min_46sec_73ms

 

Please also be sure to only have cudart32_65.dll and cudart64_65.dll next to the GPUPI executable. If you still have cudart32_60.dll etc in there, delete these files.

Link to comment
Share on other sites

Strange - I've just installed 340.52 and OpenCL is still giving me "invalid result" message. GPUPI 1.4 @ CUDA is working fine. GPUPI 2.2 @ OCL = the same as 1.4 @ OCL.

 

2.2 @ CUDA... as I wrote before. Maybe I should try it on clean windows install. Maybe it will help.

gpupi22_error57dskh.png

Link to comment
Share on other sites

Strange - I've just installed 340.52 and OpenCL is still giving me "invalid result" message. GPUPI 1.4 @ CUDA is working fine. GPUPI 2.2 @ OCL = the same as 1.4 @ OCL.

 

2.2 @ CUDA... as I wrote before. Maybe I should try it on clean windows install. Maybe it will help.

It won't be your OS install.

Said it earlier, this bench is broken.

Link to comment
Share on other sites

And I said earlier as well, Mr Scott, you have little to no knowledge on this topic. Parallel processing is a relatively new technology, especially for these old cards. APIs, drivers and GPUs themselves have bugs and several limitations, which are constantly improved with every generation. I am actually quite impressed that so much old hardware could be successfully benched.

 

So Mr, stop with your over-simplifying oneliner bunnyextraction. It's not appreciated here.

Link to comment
Share on other sites

GPUPI_93h_11m_too_slow.jpg

 

Help! I know that the point was to go slow, but this is too much! I'm not even in the half and the speed go to like 24-30h per batch. Will that speed stay, or it improve? I did not touched anything... and I will refrain from clicking on the window everagain - I promise!

Edited by trodas
Link to comment
Share on other sites

And I said earlier as well, Mr Scott, you have little to no knowledge on this topic. Parallel processing is a relatively new technology, especially for these old cards. APIs, drivers and GPUs themselves have bugs and several limitations, which are constantly improved with every generation. I am actually quite impressed that so much old hardware could be successfully benched.

 

So Mr, stop with your over-simplifying oneliner bunnyextraction. It's not appreciated here.

All I want to know is why CPU's that ran on version 1.4 will no longer run on version 2.2.

Are we sacrificing some older stuff to bench newer stuff for compatibility reasons? That's all I'm asking.

You stated before that CPU support shouldn't have changed version to version but clearly it has.

Sorry if you think I'm wasting your time. Just figured you wanted to know. I won't bother you any more.

BTW, I know a little more than you credit me for......not that it mattered anyway.

Link to comment
Share on other sites

If you want to know if something changed for compatibility reasons, why not just ask that. Claiming "the bench is broken" is needlessly negative and will of course cause a reaction.

 

Mat's putting a lot of work in this benchmark and it's one of the better ones we currently have at the bot. It wouldn't hurt being a bit more positive in your approach.

 

Anyway, just my 2c.

Link to comment
Share on other sites

If you want to know if something changed for compatibility reasons, why not just ask that. Claiming "the bench is broken" is needlessly negative and will of course cause a reaction.

 

Mat's putting a lot of work in this benchmark and it's one of the better ones we currently have at the bot. It wouldn't hurt being a bit more positive in your approach.

 

Anyway, just my 2c.

 

Well technically if it did work and now doesn't, it is "broken". As the dictionary says "Not functioning, not in order" under broken. :lol:

Link to comment
Share on other sites

Well, works pretty well to me. I discovered (by my mistake) only one serious bug, the cosmetic bug is, that when you click into the window during bench, you get only the last line of the bench reported on the screen now... but that it is.

 

IMHO it work pretty well, considering how much stuff it have to work... I say it is a small miracle that it does work on variety of different hardware, considering downclocked machines to unsane limits, etc. :)

Link to comment
Share on other sites

All I want to know is why CPU's that ran on version 1.4 will no longer run on version 2.2.

Are we sacrificing some older stuff to bench newer stuff for compatibility reasons? That's all I'm asking.

You stated before that CPU support shouldn't have changed version to version but clearly it has.

Sorry if you think I'm wasting your time. Just figured you wanted to know. I won't bother you any more.

BTW, I know a little more than you credit me for......not that it mattered anyway.

That's completely different than saying that the bench is broken and I am happy to answer it.

 

As I said before, CPU support has not changed at all. The hardware limitations are OpenCL support (which depends on SSE availabilty) and double precision support. So if it's not listed as available device, the installed drivers do not recognise it as device, that can run OpenCL code. If it's listed but ignored because of missing double precision support, it's because the drivers say that dp is not available. In conclusion and as answered a few posts above the limitations depend heavily on the installed drivers, not on the version of GPUPI.

 

I would recommend to install the AMD OpenCL drivers also for Intel CPUs especially on hardware, because they support CPUs since SSE2.x. The Intel drivers only offer support since SSE4.1, which excludes all Pentium 4 models. Seems like that may be the problem that mr. paco posted here.

 

Regarding GeForce 200 series cards, there also have been no deliberate changes to sacrifice compatibility. But there have been changes to the kernel calls to allow multiple GPUs and since then many smaller kernel improvements (mostly performance enhancements) were implemented. That's why the GPUPI 2.2 kernel with high precision exceeds the available registers and shared memory limitations of the GeForce GTX 285. As soon as I get my hands on one of those cards, I will optimize the legacy version to slim the code down. Maybe just for these cards.

 

So the bench is far away from being broken and it would be sad if it would be after all the time that I have invested. These are just driver bugs or certain limitations for old hardware, that I would never have anticapted to work at all.

 

Well, works pretty well to me. I discovered (by my mistake) only one serious bug, the cosmetic bug is, that when you click into the window during bench, you get only the last line of the bench reported on the screen now... but that it is.
Can anybody reproduce this? I tried with all my test systems and had no luck at all. If I had a guess, I would say that the output buffer was and is deleted because there are memory problems. The clicking just redraws the window, that's all. Let's see if the run turns out valid ... keep me posted please. :)
Link to comment
Share on other sites

_mat_ - send details by PM how I exactly slow the thing down...

...and publicaly I can say it seems to be recovering from my clicks:

 

GPUPI_119h_and_batch_9_done.jpg

 

So, if I wait so long and it will not validate, them I would be kinda mad :D (I bet that is understandable, looking at the times need to perform that superslow run...) :)

Link to comment
Share on other sites

  • 2 weeks later...

Hoooray! Just there loops to go, at that speed it should be roughly in 3 days finally finished:

 

GPUPI_303h_loop_17.jpg

 

...and we see, if that redraw is just a glitch or priority thing! If I believe, I would say: let's pray.

 

But I don't. I believe in _mat_ and that the validation file will be okay! :D

Link to comment
Share on other sites

The validation file only avoids photochopping of screenshots and validates the result itself. It does not show irregularities in loops, if you know what I mean. Only GPUPI 2.2 ensures that to 100%. Well, let's see if the screen gets redrawn, but I won't bet on it. ;)

Link to comment
Share on other sites

mat,

can you please take a look at this data file? Suddenly I'm unable to submit my results - Invalid data file: Unable to parse the datafile. Similar error shows up when uploading directly from GPUPI.

 

Testing system is:

2x Xeon 5050 (stock, not overclocked)

8GB RAM

Windows 7 x64, HPET on

GPUPI 2.1.2 or 2.2 (both 64bit)

 

http://hw-museum.cz/data/temp/2x_xeon_5050_GPUPI_for%20CPU_100M_02m-14.666s.result

 

Thanks :)

Link to comment
Share on other sites

Update:

Single CPU on the 771 platform now results in the same error. Weird, it was working fine two days ago... and I'm not aware of any changes in OS, drivers or other SW.

 

Another PC using Pentium D 945 + windows 7 x64 works just fine. Well, the OpenCL driver is older and I'm running legacy version of GPUPI because of that... still, the "Unable to parse the datafile" error sounds completely unrelated to this.

Link to comment
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...