Question XFX 5700 XT Cards

Anthony

Mitglied
Mitglied seit
Nov 28, 2020
Beiträge
22
Bewertungspunkte
6
Punkte
2
Preface: @Mini_Me has demonstrated profound knowledge when modding the 5700 XT series of graphics cards, and for those looking for an awesome tutorial, please reference https://www.igorslab.de/community/threads/gigabyte-5700-xt-bios-mod-fails.3304/.

Story: At the present time, I own 40 XFX 5700 XT video cards; I have 40 more on order to make a total quantity of 80 cards. With this many cards, squeezing out an additional 1 MegaHash (MH) from each card would translate into +80MH. Out of the box, the stock bios on these XFX 5700 XT cards reflects about 51MH per card. With bios modifications and tweaks, it is possible for me to obtain between 54MH-56MH per card.

Configuration: I have 5 rigs in operation and soon I will have 5 more in operation. Each rig uses 8 cards. I am using HiveOS to manage my rigs.

The importance of bios modifications: Basically, using bios modifications allows each card to provide a better hashrate -- and as hashrate increases, potential ETH mined increases. (More Hash == More $$). Bios modifications also change how each card consumes electricity -- and aside from the initial cost of computer components, electricity is the second greatest cost when it comes to mining.
  • Stock Bios: 80 Cards x 51MH from each card = 4.080GH
  • Modified Bios: 80 Cards x (54MH || 56MH) = (4.320GH || 4.480GH).
The delicate balance in bios modification reflects a specific relationship between: 1.) Power Consumption, 2.) Temperature, 3.) Hashrate, and 4.) Stability vs. Instability. In my opinion, the goal of proper bios modification means that: (a.) each card consumes as little power as necessary, (b.) each card remains within "safe" temperatures to prevent card damage, (c.) each card produces the maximum hashrate, (d.) each card remains stable, which is to say that the card does not produce "invalid" shares or other errors that get in the way of making money.

The above mentioned thread link reflects a dialogue between two members, @Mini_Me and @AlleyCat. The thread is several pages long because it reflects a series of tests (trials, errors, and new trials with new errors, etc.) with different bios values and with different values in HiveOS. There are multiple variables in play (both on the bios tweaking side with MPT and RBE and on the software tweaking side in HiveOS), and changing one variable influences other variables. The thread is so long because many different settings were tried out in order to achieve the best possible result for specific card manufacturers, in this case manufacturer Sapphire and manufacturer Gigabyte.

This post in particular captured my attention: https://www.igorslab.de/community/threads/gigabyte-5700-xt-bios-mod-fails.3304/post-79315.

RBE modified values in use:
Option 1 (Preferable and Recommended), applying Apple Inc. vram timings straps linked below, once for MT61K256M32 Micron, and Save the vbios after that load the saved vbios and once again apply the straps for K4Z80325BC Samsung if it is existed and save the vbios again,
https://www.igorslab.de/community/a...-mt61k256m32_gddr6_optimized_timings-zip.6544

MPT modified values in use:
Features Tab:
PPTable Features -> Feature Control = Nothing done. Left at default.
Overdrive Features = All Boxes Checked.

Overdrive Limits Tab:
GFX Maximum Clock = 1550
Memory Maximum Cock = 1000
Power Limit Maximum = 0
Power Limit Minimum = 0
Memory Timing Control = 1
Fan RPM Maximum = 3500
Fan RPM Minimum = 1100
Fan Acoustic Limit RPM Maximum = 3500
Fan Acoustic Limit RPM Minimum = 1100
Zero RPM Control = 0

Power and Voltage Tab:
Maximum Voltage GFX = 900
Maximum Voltage SoC = 1150
Minimum Voltage GFX = 700
Minimum Voltage SoC = 750
Power Limit GPU = 140
TDC Limit GFX (A) = 140
TDC Limit SoC (A) = 14

Frequency Tab:
GFX Maximum = 1400
GFX Minimum = 300
SoC Maximum = 1267
SoC Minimum = 507
Memory DPM 0 = 100
Memory DPM 1 = 500
Memory DPM 2 = 625
Memory DPM 3 = 960

Curve Tab:
(Left Alone, no change).
AVFS (GHz->V)
Override box is not ticked.
a = 0.017810
b = -0.047280
c = 0.054020
StaticVoltageOffset (GHz->V) = 0.000000

Fan Tab:
PWM Minimum = 15
Fan Acoustic Limit RPM = 1550
Fan Throtteling RPM = 3200
Fan Maximum RPM = 3500
Fan Target Temperature = 85
Fan GFX Clock = 800
Zero RPM Enable Box is ticked.
Stop Temperature = 60
Start Temperature = 68

As you can see, in order to achieve stability (prevent invalid shares, rejected shares, and phoenixminer reboots), I have to run most cards at 1375 core. Other cards I had to reduce to 1325 core. Some cards run at memory 930, some at 905, some at 900, and some at 890.

I know my cards can give better performance. Even though I have reduced power values to reduce # of invalid-rejected shares, phoenixminer still reboots from time to time, and sometimes, I will have a card that will "stop" and disappear ... which causes the phoenixminer rebooter. For these cards, I reduce values.

Any advice from master @Mini_Me would be appreciated.


RIG #1
RIG1.png

RIG #2
RIG2.png

RIG #3
RIG3.png

RIG #4
RIG4.png

RIG #5
RIG5.png
 
The rig worked for 6 hours. During this time gpu1 had 2 invalid shares. And after 6 hours, this card was recognized as dead and the rig was restarted.
Anhang anzeigen 12415
Very well, let us increase the MVDD to 1350 mV, after that if there is still a problem then I believe it is better not to cap the SOC maximum clock and leave it at 1267 MHz.
 
Very well, let us increase the MVDD to 1350 mV, after that if there is still a problem then I believe it is better not to cap the SOC maximum clock and leave it at 1267 MHz.
For this gpu1 the value of mvdd was 1350mV. And I have not specified soc to any extent, I leaved it in the BIOS at 1267mhz.
Screenshot_20210414-161303.jpg
 
Zuletzt bearbeitet :
I report only to let you know that it might be useful to someone.
I reduced the problematic graphics card memory clock from 900 to 898 and the problem was fixed.
Overclocking settings are now in place and the system is stable.
10.JPG
 
Greetings,

I am going very soon to post all the unlocked vbios on a new thread in order for everyone to download the appropriate one for their cards,

You may have a good day,

Regards.
 
  1. TeamRedMiner (55.5MH per XFX 5700XT) produces less hash per card than PhoenixMiner (56.60MH per XFX 5700 XT). Over 8 cards per rig, the rate went from about 453MH to about 444MH. However, if the loss in MH produces a more stable system, that that's okay in my book. If PhoenixMiner causes instability or causes a system reboot, then you're not making money during the reboot process. If you have a system that reboots a lot over 24Hrs, then you lose money each time the system reboots.
  2. Average Core Temp Across 40 cards = 47.425
  3. Average Mem Temp Across 40 cards = 72.75
  4. Some cards produced errors (invalid shares, rejected shares) within 5 minutes of operation. For these cards, I lowered mem clock from 905 to 900, and left all other values alone. That seems to have helped.
Overclocking Template I use in HiveOS:
Core Clock = 1420
Core State = 2
Core Voltage = 790
Memory Controller Voltage = 780
Memory Clock = 905 ... (Adjusted down to 900 for cards that give errors within 5 mins of startup)
Mem State = blank
Memory Voltage = 1350
Fan % = blank
Power Limit = blank
Aggressive Undervolting = off
Amdmemtweak REF = blank

MPT:
Features Tab:
PPTable Features = Not Touched. Feature Control = Nothing Touched.
Overdrive Features = All boxes are ticked.

Overdrive Limits Tab:
GFX Maximum Clock = 1440
Memory Maximum Clock = 1000
Power Limit Maximum = 0
Power Limit Minimum = 0
Memory Timing Control = 1
Fan RPM Maximum = 3500
Fan RPM Minimum = 1100
Fan Acoustic Limit RPM Maximum = 3500
Fan Acoustic Limit RPM Minimum = 1100
Zero RPM Control = 0

Power and Voltage Tab:
Maximum Voltage GFX = 1050
Maximum Voltage SoC = 1050
Minimum Voltage GFX = 750
Minimum Voltage SoC = 750
Power Limit GPU = 140
TDC Limit GFX (A) = 120
TDC Limit SoC (A) = 12

Frequency Tab:
GFX Maximum = 1270
GFX Minimum = 300
SoC Maximum = 1267
SoC Minimum = 507
Memory DPM 0 = 100
Memory DPM 1 = 500
Memory DPM 2 = 625
Memory DPM 3 = 950

Curve Tab:
Nothing touched.
AVFS (GHz->V) Override - not ticked.
0.017810 = a
-0.047280 = b
0.054020 = c
StaticVoltageOffset (GHz->V) = 0.000000

Fan Tab:
PWM Minimum = 15
Fan Acoustic Limit RPM = 1550
Fan Throtteling RPM = 3200
Fan Maximum RPM = 3500
Fan Target Temperature = 85
Fan Target GFX Clock = 800
Zero RPM Enable = is ticked.
Stop Temperature = 60
Start Temperature = 70

RBE:
Am using Apple Straps
 
Yes, but if i follow the steps on page 32, the 124W on Power Limit GPU limit the eficient of this card... I think there is something wrong with the api of this card.. I only get the correct hashrate with 140W on Power Limit GPU, and at wall consumption is equivalent to Sapphire 5700xt :

Anhang anzeigen 9716
I would appreciate if you could re do the guide as DS_SOCCLK and DS_DECFCLK were removed as they do not work well with Micron memory.
 
I have move the rig to a room with air conditioning set at 24° and a floor fan and now I have 3 cards at 78-80° and one card at 84° Celsius. I'm going to give your bios a try with the hottest one. The bios that I have in my cards is a modified one following the page 32 guide of @Mini_Me; I haven't followed the guide in page 15 (for cards with sensitive (restricted) power curve) because it envolves using windows in the rig and I have HiveOS, but if I have to do it for better mem temps I I will do it. Will this guide help me with mem temps? I don't understand yet the basics of bios mod, I jus follow the gides; regaring the OC settings I have tried lowering the memory voltages but I get an unstable system (I'm currently at 1290mV).
At this time I have 48 hours without GPU dead errors with this values:

Anhang anzeigen 10444

if I have to sacrifice a little hashrate to get safe mem temps I will do it; What values do you guys suggest to archieve this? If I reduce memory voltage I understand that I have to reduce mem clocks but In what intervals do I have to do it?

Thanks in advance
Please use one of the setting below to decrease the memory temperature,

Core clock, VDD : Memory clock, VDDCI, MVDD, Hashrate in TeamRedMiner,

- 1330 MHz, 750 mV : 850 (1700/2) MHz, 800 mV, 1350 mV, ~52.10 MH/s,
- 1345 MHz, 750 or 760 mV : 865 (1730/2) MHz, 810 mV, 1350 mV, ~53.10 MH/s,
- 1360 MHz, 760 or 770 mV : 880 (1760/2) MHz, 820 mV, 1350 mV, ~54.10 MH/s.
 
Oben Unten