Basics • GPUs • Graphics • Practice • Reviews

Gaming Benchmarks: FPS, Percentiles, Frame Time, Variances, Power Consumption and Efficiency simply explained

7. July 2021 06:25

Power consumption measurement with NVIDIA PCAT

Interesting now, however, is the permanent measurement of the power consumption (colloquially “consumption”) with PCAT and the necessary hardware as well as the linking of this data via a suitable API with the just mentioned FrameView from NVIDIA. If you don’t know my articles about power consumption measurement with high-resolution oscillograph technology (8-channel analog in master/slave with two storage oscillographs), I would like to refer you to the exact explanation of my measurement technique and the actual problem, but I will have to come back to it later anyway. Also to explain what PCAT can and cannot do. First of all, here are the two important links for a better understanding in advance:

– Graphics card vs. power supply – basics, causes of errors and correct power supply sizing
– The battle of graphics card vs. power supply – power consumption and load peaks demystified

But back to PCAT and what you can do with this new tool. The comfortable and perhaps financially or technically limited tester will be happy about the fact that PCAT elegantly solves two problems at the same time. It reads out as hardware more or less in real time (100 ms) the power consumption of the live 12V and 3.3 volt rails (PCIe and slot) on the graphics card and can also display the whole thing graphically. In addition, there is an interface (API) that FrameView accesses to log everything in a single log file with chronological accuracy. But more on that in a moment. First, let’s take a look at what NVIDIA has packed into the box:

PCAT consists of a MCU board (Multi Controller Unit) for the measurement of up to three 6+2 pin connectors including an optional OLED display for on-the-fly indication, the PCIe 4.0 compatible riser card for direct measurement at the motherboard slot and spacers which help to compensate the height difference of this card in order to still be able to screw the graphics card securely. Let’s take a closer look at the MCU board first.

The measuring principle is simple. One simply measures the voltage drop in parallel with a very low impedance resistor (shunt), which is located in each individual live rail, in order to be able to deduce the current flowing. In addition, one measures the remaining voltage after the shunt and thus automatically has the power consumption that runs over the respective rail. Add it all up and you get the total power consumption of the graphics card. This so-called monitoring is provided by the well-known INA3221, which could be found as a monitoring chip on many graphics cards, even though it is now often replaced by a similar chip from On Semi.

The microcontroller is the MEGA32U4 from ATMEL. This is the usual low-power 8-bit AVR RISC microcontroller, which has 32 KB of programmable Flash program memory and also includes 2.5 KB of SRAM and a 1 KB EEPROM, supports USB 2.0 output, has a 12-channel 10-bit A/D converter, and supports the JTAG interface. The computing power is about 16 MIPS and 16 MHz. Sound a little too Arduino? Logically, the principle is always the same and very similar moreover, which I also use for all my motherboard tests, which also include the graphics cards.

The riser card is similar in principle, also here an INA3221 is located, which monitors a total of 2 shunts, once for the 12 volt and once for the 3.3 volt rail. The shunts are of course much smaller because the currents are also lower. Coupling and communication takes place via the bus interface already mentioned.

PCAT – The Software

The software reads the whole story, so it communicates with the MCU board of the PCAT system via the USB. Arduino uses that too, though I’ve since switched to Ethernet for smaller intervals. The intervals of 100 ms used by PCAT are quite large, but they are quite sufficient for a simple acquisition over a longer period of time. A peak load will only be caught by chance, but even that is possible with a bit of luck. But PCAT, to be fair, is simply not designed for such analyses. You can quickly get into the middle five-digit range, which is beyond the scope of most people and therefore not up for discussion.

But what the PCAT software can do: it also acts as an API, so that the latest versions of NVIDIA’s FrameView can also access it and read out the relevant data. What you can do with it, we will see later in the graphical evaluation.

Pages:

18 Antworten

Zeige alle Kommentare an

Kommentar

Lade neue Kommentare

Blubbie

Urgestein

808 Kommentare 275 Likes

#1 Jul 07, 2021

Genau wegen solchen fundierten und gut erklärten Grundlagen Artikeln ist Igor's Seite absolute Spitzenklasse!
Weiter so!
(y):coffee:

Antwort 8 Likes

RX480

Urgestein

1,857 Kommentare 850 Likes

#2 Jul 07, 2021

Das klingt total interessant, an welchen Stellen FrameView misst.

Da gabs letztens Diskussionen bei CB, ob net CFX totalen Mist misst.

Antwort Gefällt mir

LurkingInShadows

Urgestein

1,348 Kommentare 550 Likes

#3 Jul 07, 2021

@Igor Wallossek :
Frage zu den Mikrorucklern; du schreibst ab 8ms wirds merkbar, ab 16 sehr deutlich; aber ist das nicht Monitorabhängig?

Also solange Varianz (ms) < (1000/Hz)/2 (ms) ists nicht merkbar? Da kommt auch deine Aussage hin, allerdings nur bei einem 60 Hz Monitor.

Antwort Gefällt mir

Igor Wallossek

10,173 Kommentare 18,755 Likes

#4 Jul 07, 2021

Die ganzen Sync-Verfahren lasse ich mal außen vor, weil ich sie ja eh nicht aktivieren darf. Und ich beziehe mich hier weniger auf Monitore, sondern physiologische Grenzen und das, was der mernsch selbst wahrnehmen kann.

Antwort Gefällt mir

Martin Gut

Urgestein

7,751 Kommentare 3,558 Likes

#5 Jul 07, 2021

Ja und nein. Bei 240 Hz merkt man es viel weniger als bei 60 Hz wenn mal ein Bild ausfällt. Unterschiede von mehr als 16 ms sind bei 60 wie auch bei 240 Hz spürbar. Unterschiede von 2 oder 4 ms sind bei 60 Hz meist gar nicht sichtbar. Bei 240 Hz sind sie wohl sichtbar, aber so kurz dass man es auch kaum wahr nimmt.

Antwort 1 Like

RX480

Urgestein

1,857 Kommentare 850 Likes

#6 Jul 07, 2021

Das Auge sieht ja net den absoluten Wert, merkt hingegen die Schwankung.(x)
D.h. in der Praxis hilft ein Cappen per Fps-Limit, um die Schwankung max Fps zu minFps zu verringern,
was zusätzlich noch dem Inputlag hilft.

(x) worst Case = beim Frameverlauf wenn auf einen Berg direkt ein Tal folgt = größte Differenz
(natürlich kann Buffern das etwas abmildern; ... interessant ist dann wirklich Wie und Wo die Frametimes gemessen werden)

Antwort 1 Like

LurkingInShadows

Urgestein

1,348 Kommentare 550 Likes

#7 Jul 07, 2021

Solange aber dieser worst case bei der halben Bildfrequenz liegt kann ein Monitor das NIE darstellen => wird nie merkbar

zB.: 60 Hz ergibt 16 2/3 ms Bildanzeigeintervall, wenn die GPU mehr Zeit braucht wird das letzte Bild nochmals für ein komplettes weiteres Intervall dargestellt, oder?
=> <16 2/3 sollte sich ausgehen, kann aber mit lag daneben gehen, 8 1/3 geht sich sicher aus.

Bei 240 Hz müssten alle Zeiten geviertelt sein, um den selben Effekt zu erreichen, also 4 1/6 ms Intervall; 2 1/12ms zum sichergehen.

Meine Frage oben bezog sich dann mit monitorabhängig darauf, dass 8 ms Varianz einem 60Hz Monitor schurzegal sind, bei einem 240 Hz Monitor aber zur erneuten Darstellung des Frames führen => Ruckler

Ob der sichtbar/spürbar wäre weiß ich nicht, wenn ja müssten die Nutzer der beiden Monitore in der Grafik verschiedene Grün-Rottöne als Messlatte nehmen um flüssiges Spielen zu garantieren.

Antwort Gefällt mir

RX480

Urgestein

1,857 Kommentare 850 Likes

#8 Jul 07, 2021

240Hz ist vllt. gleich ein zu großer Sprung für nen Vgl., idealer wäre 144Hz, weil sehr verbreitet, bzw. bei 4k auch 96...120Hz@HDR gebräuchlich.
(96Hz waren die Modelle mit Bandbreitenlimit)

Antwort Gefällt mir

Martin Gut

Urgestein

7,751 Kommentare 3,558 Likes

#9 Jul 07, 2021

Die Varianz ist nicht die Frametime. Für deine Überlegungen mit der Anzeige auf dem Bildschirm sind die Frametimes entscheidend. So wie du es beschreibst, läuft es, wenn man V-Sync an hat und die FPS auf die Bildschirmfrequenz limitiert.

Die Varianz ist (wenn ich Igors Beschreibung richtig verstehe) der Unterschied zwischen aufeinander folgenden Renderzeiten (Frametimes). Ein Frame hatte beispielsweise eine Frametime von 10 ms, der nächste 19 ms. Das gäbe dann eine Varianz von 9 ms. Das zeigt, wie unregelmässig das Game läuft. Die Varianz lässt sich aber nicht mit den Zeitintervallen der Bildschirmfrequenz vergleichen.

Bitte korrigiert mich wenn ich es falsch interpretiert habe.

Antwort 1 Like

Igor Wallossek

10,173 Kommentare 18,755 Likes

#10 Jul 07, 2021

Du liegst völlig richtig, es ist die Differenz zwischen zwei aufeinanderfolgenden Frame Times. Bei sehr hohen FPS-Zahlen sind kurze Drops oder Spikes auch nichts, was die Elektronic so schnell wegbügelt.

Antwort Gefällt mir

RX480

Urgestein

1,857 Kommentare 850 Likes

#11 Jul 07, 2021

Wie war das genau bei Vsync+Fps-Limit ?

Je nachdem welcher Abstand zu der Moni-Frequenz (Hz), gibts 1 oder 2 oder 3 Frames Buffer.(x)
Bei AMD weniger Buffer als bei NV.(nur im UltraLatencyMode wirds auch bei NV weniger)

Falls man denn versuchen will die Varianz etwas wegzubügeln.

(x) manche Games haben neuerdings sogar die Möglichkeit Vsync mit niedrigem oder normalen Buffer auszuwählen.

Antwort Gefällt mir

Martin Gut

Urgestein

7,751 Kommentare 3,558 Likes

#12 Jul 07, 2021

Ich kenne es nur von Nvidia. Die Grösse des Buffers kann man seit Jahren in der Nvidia-Systemsteuerung einstellen. Es ist eine separate Einstellung. Für eine kurze Latenz sollte man den Wert möglichst tief setzen und ev. den Ultra-modus einstellen. Dafür hat man dann keinen Buffer, der Ruckler wegbügeln kann.

Antwort Gefällt mir

RX480

Urgestein

1,857 Kommentare 850 Likes

#13 Jul 07, 2021

seit Jahren, really?

Erst seit Navi mit AntiLag kam, hat NV ein bisschen nachgelegt und am Anfang funzte mit Turing UltraLow mehr schlecht als recht
in Games. Showcases mal außen vor.

Antwort Gefällt mir

Martin Gut

Urgestein

7,751 Kommentare 3,558 Likes

#14 Jul 07, 2021

Der Ultra-Modus ist recht neu. Aber die Einstellung wie viele Frames der Buffer haben soll und ob man 3-fachpuffer verwenden will, sind über 5 Jahre alt. Wie gut es funktioniert hat, kann ich nicht sagen, da ich das nicht messen kann. Zumindest konnte man es mit den falsche Einstellungen spürbar schlechter machen. :p

Antwort Gefällt mir

RX480

Urgestein

1,857 Kommentare 850 Likes

#15 Jul 07, 2021

Das stimmt,
zum Glück gibts die BlurBusters auch schon seit Jahren.
War schon interessant mit RTSS und genauen Einstellungen.
(weil 60Hz ist net 60Hz exactly)

btw.
Ganz nebenbei ist z.Bsp. auch 59,9394 bzgl. der Linkgeschwindigkeit bei meinem 4k-Moni dann niedriger, wenn ich ich auf dem Desktop
FHD einstelle. Wenn ich dumb W10 die Wahl lasse, bekomme ich 60Hz mit ner verkappten Linkgeschwindigkeit ala 4k, ... und schon ist man
Dabei mit dem "Vram net runtertakten -Problem", falls man bei DS die Häkchen rausnimmt (im MPT).

Antwort Gefällt mir

Kearvaig

Mitglied

32 Kommentare 9 Likes

#16 Jul 08, 2021

Vielen Dank für den Artikel, endlich mal einige Fragen beantwortet...

Antwort Gefällt mir

Raistlin666

Neuling

1 Kommentare 0 Likes

#17 Nov 25, 2022

Kurze Frage:
Bei Nvidia gibts weder den Interpreter noch das Sheet zu laden (ist auch nicht mehr im Install-Ordner zu finden). Ich hab gestern ne Analyse starten wollen, aber bei 10mins recording kommt kein Tool ansatzweise mit der Datenmenge klar - gibts da irgendwelche Tipps? Excel kann ich komplett knicken, nicht nur dass es bei 15MB csv irgendwie 1,5GB RAM braucht, es crasht auch dauernd und zeit irgendwie random Daten an, wenn ich die korrekte Spalte wähle (100k Zeilen, Datenauswahl hat 100k Zeilen, Diagramm zeigt aber nur 3000). Mit PowerBI und Google Sheets komme ich auch nicht weiter...

Antwort Gefällt mir

BullX

Neuling

4 Kommentare 0 Likes

#18 Nov 21, 2023

Hallo zusammen,

kennt jemand eine Quelle für eine aktualisierte Überstellung der Leistungsaufnahme von Grafikkarten ?
So wie hier im Artikel gezeigt ist es top. Nur leider ist dort die neue Generation nicht vertreten.

Antwort Gefällt mir

Alle Kommentare lesen unter igor´sLAB Community →

Danke für die Spende

Du fandest, der Beitrag war interessant und möchtest uns unterstützen? Klasse!

Hier erfährst Du, wie: Hier spenden.

Hier kannst Du per PayPal spenden.

Boosters for the Rocket (Lake) – G.Skill Ripjaws V DDR4-5066 CL20 2x 8 GB Memory Kit Review with Overclocking

Original vs. Clone – Prusa MK3S vs. Fysetc Replica for the Half of the Price!

About the author

View All Posts

Igor Wallossek

Editor-in-chief and name-giver of igor'sLAB as the content successor of Tom's Hardware Germany, whose license was returned in June 2019 in order to better meet the qualitative demands of web content and challenges of new media such as YouTube with its own channel.

Computer nerd since 1983, audio freak since 1979 and pretty much open to anything with a plug or battery for over 50 years.

Follow Igor:
YouTube Facebook Instagram Twitter