PCI (GPU) passthrough hardening: option ROM edition #1087

marmarek · 2024-10-12T12:27:02Z

The problem you're addressing (if any)

Using GPU (or any PCI device for that matter) passthrough with a less trusted VM may allow it to reflash firmware of such device. Just after reboot (during firmware and OS startup) such device is not isolated in a VM and may try to compromise the whole host. This can be done in at least two ways:

doing DMA into arbitrary adresses - this should be covered by the early boot DMA protection already
providing malicious option ROM for firmware to execute

Theoretically, reflashing malicious firmware should not be possible due to (at least) signature check done by the GPU firmware update mechanism, but history shows this sometimes happen to be buggy/ineffective or in some cases even non-existent.

Describe the solution you'd like

I see two solutions:

An option to disable loading option ROM, either globally, or per-device. This of course is acceptable only if OS driver (within a VM) can work correctly if option ROM wasn't loaded. This also kinda assumes the dGPU is not the only GPU in the system (which is true in setups where dGPU is used for passthrough).
An option to enforce UEFI SecureBoot signature on option ROM independently of enabling SecureBoot for loading OS. This is less robust, as it still may allow attacks where properly signed option ROM is used (maybe downgraded to an earlier version by malicious actor?) but some configuration data is changed to exploit it. But on the other hand, it's probably more compatible (especially if executing option ROM is necessary for later using the dGPU in a VM, or if it's the only GPU in the system).

In either case, there needs to be a mechanism for the OS to verify if the mechanism was enabled to inform the user if passthrough is safe for a given device. And similarly, OS needs to be informed if early boot DMA was enabled. Maybe there is some ACPI table that can be used to pass this info to the OS? Or maybe OS can inspect coreboot config (cbfs?) to check if the option is enabled?

Where is the value to a user, and who might that user be?

Use GPU passthrough with reduced risk of compromising the whole system.

Describe alternatives you've considered

Alternative solution could be reliably blocking reflashing dGPU firmware by the VM. And ensure device reset on reboot works reliably too. In other words - ensure that all VM-controlled state is discarded on reboot.

I think this solution would require changes to the board design, and thus be significantly harder to make in practice.

Additional context

We consider making a feature like this mandatory for allowing Qubes OS certification of systems with dGPU. Without such feature, we don't consider dGPU passthrough safe enough to certify such system, and thus it doesn't make much sense for users to buy systems like this if dGPU would be allowed only in dom0, as it would be mostly wasted.

This is especially relevant for V5x models with nvidia.

zirblazer · 2024-10-16T02:44:21Z

An option to disable loading option ROM, either globally, or per-device. This of course is acceptable only if OS driver (within a VM) can work correctly if option ROM wasn't loaded. This also kinda assumes the dGPU is not the only GPU in the system (which is true in setups where dGPU is used for passthrough).

The MSI boards already have global Option ROM disable, but couldn't convince them to make them per-Slot granularity. It is either fully on, fully off, or GPU Option ROMs only.

An option to enforce UEFI SecureBoot signature on option ROM independently of enabling SecureBoot for loading OS. This is less robust, as it still may allow attacks where properly signed option ROM is used (maybe downgraded to an earlier version by malicious actor?) but some configuration data is changed to exploit it. But on the other hand, it's probably more compatible (especially if executing option ROM is necessary for later using the dGPU in a VM, or if it's the only GPU in the system).

You want to enable Secure Boot then tell it to ignore to check Boot Loaders so that it only validates Option ROMs only. So, Point 8 here: #929
Never expected someone from Qubes to ask for something to be made less secure, heh.

Alternative solution could be reliably blocking reflashing dGPU firmware by the VM. And ensure device reset on reboot works reliably too. In other words - ensure that all VM-controlled state is discarded on reboot.

And how you can possibly do that? As far that I know, VBIOS flashing works by using vendor tools that tells the GPU to use its internal I2C/SPI/whatever Controller to flash the ROM. If you passed the card, these vendor tools would work as in a bare metal environment and I don't see how are you going to block that.
What you can do is to sideload the VBIOS of the card on VM launch (Not sure if on Xen, but on standalone QEMU it is possible and I have been sporadically using it since 2015). So you dump the VBIOSand tell QEMU to load it everytime the VM is launched.

I think this solution would require changes to the board design, and thus be significantly harder to make in practice.

Sure, you can try asking one of the third party vendors to put a Flash Write disable jumper, which I recall having seen in a few historial PC Motherboards when Flash ROM was first introduced. But unless you're a high bidder than can ask for a few thousands of custom cards it is not gonna happen, so I won't bother with it.

marmarek · 2024-10-16T15:59:01Z

The MSI boards already have global Option ROM disable, but couldn't convince them to make them per-Slot granularity. It is either fully on, fully off, or GPU Option ROMs only.

This may be enough, if there are no other devices needing Option ROM. In this particular case we are talking about a laptop, so the customization is limited (yes, I know you can still attach almost any PCIe device, but it's much less common in practice).

You want to enable Secure Boot then tell it to ignore to check Boot Loaders so that it only validates Option ROMs only. So, Point 8 here: #929

Yes, exactly.

Never expected someone from Qubes to ask for something to be made less secure, heh.

Well, I want to enable it for Option ROM even if you need it disabled for the OS.

And how you can possibly do that? As far that I know, VBIOS flashing works by using vendor tools that tells the GPU to use its internal I2C/SPI/whatever Controller to flash the ROM. If you passed the card, these vendor tools would work as in a bare metal environment and I don't see how are you going to block that.

Yes, exactly, that's the problem.

What you can do is to sideload the VBIOS of the card on VM launch (Not sure if on Xen, but on standalone QEMU it is possible and I have been sporadically using it since 2015). So you dump the VBIOSand tell QEMU to load it everytime the VM is launched.

That doesn't help much if internal flash can still be modified, even if not loaded by that VM. Option ROM could still be changed and will be used by firmware on next reboot (unless you do similar trick in firmware to side-load Option ROM?).

Sure, you can try asking one of the third party vendors to put a Flash Write disable jumper, which I recall having seen in a few historial PC Motherboards when Flash ROM was first introduced. But unless you're a high bidder than can ask for a few thousands of custom cards it is not gonna happen, so I won't bother with it.

Yes, it would be technically better solution (as it's more comprehensive than just Option ROM), but not feasible at this scale.

zirblazer · 2024-10-16T21:01:58Z

And how you can possibly do that? As far that I know, VBIOS flashing works by using vendor tools that tells the GPU to use its internal I2C/SPI/whatever Controller to flash the ROM. If you passed the card, these vendor tools would work as in a bare metal environment and I don't see how are you going to block that.

Yes, exactly, that's the problem.

I don't see that like a problem you can actually fix.

What you can do is to sideload the VBIOS of the card on VM launch (Not sure if on Xen, but on standalone QEMU it is possible and I have been sporadically using it since 2015). So you dump the VBIOSand tell QEMU to load it everytime the VM is launched.

That doesn't help much if internal flash can still be modified, even if not loaded by that VM. Option ROM could still be changed and will be used by firmware on next reboot (unless you do similar trick in firmware to side-load Option ROM?).

Not if you disable loading Option ROMs. It may also be possible to hash the Option ROM (Point 7 of my writeup) so that you know it didn't changed. And yeah, putting an Option ROM in Firmware and loading it for that device instead of its own one QEMU style should also be possible.

marmarek · 2024-10-16T22:47:07Z

What I care about, is for a reflashed GPU (which we established already is hard to prevent in the first place) to not be able to attack host. There are many ideas how to achieve it - in the issue description, comments, and the other issue.

pietrushnic · 2024-10-29T10:20:04Z

doing DMA into arbitrary adresses - this should be covered by the early boot DMA protection already

The only problem I see here is that it is not validated. We would need grants to enable the DMA attacking tool in the automation process. We have capable hardware. That could confirm in every release that DMA protection is correctly applied.

providing malicious option ROM for firmware to execute

OptionROMs are typically signed by Microsoft Option ROM UEFI CA 2023 or older one. We can have such in our DB, the problem is if we trust MSFT. OTOH at "clean" state user could populate the hash of OptionROM into DB as it is done by sbctl users to remove MSFT from the chain of trust.

Theoretically, reflashing malicious firmware should not be possible due to (at least) signature check done by the GPU firmware update mechanism, but history shows this sometimes happen to be buggy/ineffective or in some cases even non-existent.

This is an exciting part. Do you have any examples of such issues? Because of that, OCP requested a standard update mechanism for GPU firmware, and a document was created.

An option to disable loading option ROM, either globally, or per-device.

This probably was already requested and at least partially implemented. Pease check #139

This of course is acceptable only if OS driver (within a VM) can work correctly if option ROM wasn't loaded.

And for complex modern devices, that can be the core issue.

This also kinda assumes the dGPU is not the only GPU in the system (which is true in setups where dGPU is used for passthrough).

To prevent soft-bricking, one would imagine that boot firmware would detect that fact and warn the user or even not allow the user to self-soft-brick.

An option to enforce UEFI SecureBoot signature on option ROM independently of enabling SecureBoot for loading OS.

This and many other improvements could be employed in UEFI Secure Boot. I already have a ton of requirements in that space. I will explore our options as part of my training campaign in 2025. It may not be hard to implement that at least partially.

In either case, there needs to be a mechanism for the OS to verify if the mechanism was enabled to inform the user if passthrough is safe for a given device.

TPM measurement + event log? There are also UEFI variables dedicated to exposing firmware capabilities to OS like OsIndicationsSupported.

Maybe there is some ACPI table that can be used to pass this info to the OS?

I guess we should employ guidance from here and expose things in ACPI DMAR table, some information already should be there, but the point is there is no validation of that.

Or maybe OS can inspect coreboot config (cbfs?) to check if the option is enabled?

There are better directions than this. Relying on some custom coreboot files exposed will create technical debt, and appropriate mechanisms already exist in the UEFI world. We should ask what to do with non-UEFI builds. Still, I think we should get back to the question of what the standard behavior OSes use for such capability is, and standard most likely will mean what Windows uses for that. Also, checking the Linux approach would be useful.

@zirblazer It is hard to read your write-up. It should be split, TBH. Every point is separate (it could be linked for better context).

@marmarek I don't think it is possible to make boot firmware responsible for controlling peripheral updates when those peripherals have their closed-source verification mechanism. We cannot handle all possible mocking of buses in the system without affecting correct operation. Unless we reach SPDM and device authentication for the whole system, the feature is unlikely to be implemented. Getting updates only from reasonably trustworthy sources with known paths for escalation, e.g., LVFS, can be done, but that does not prevent malicious actors from gaining privileges in the system and abusing those to deliver the wrong firmware to peripherals if those allow unauthenticated updates. That is on the peripheral vendor to provide the correct update mechanism or on the open-source firmware community to deliver support for a transparent mechanism. The best thing we can do is to look for best practices regarding peripheral firmware updates, test that on given hardware, and provide advice on what hardware is recommended now. Even together, we do not have enough resources to solve that problem.

P.S. Maybe this is good discussion for December DUG?

marmarek added the enhancement New feature or request label Oct 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PCI (GPU) passthrough hardening: option ROM edition #1087

PCI (GPU) passthrough hardening: option ROM edition #1087

marmarek commented Oct 12, 2024

zirblazer commented Oct 16, 2024

marmarek commented Oct 16, 2024

zirblazer commented Oct 16, 2024

marmarek commented Oct 16, 2024

pietrushnic commented Oct 29, 2024

PCI (GPU) passthrough hardening: option ROM edition #1087

PCI (GPU) passthrough hardening: option ROM edition #1087

Comments

marmarek commented Oct 12, 2024

The problem you're addressing (if any)

Describe the solution you'd like

Where is the value to a user, and who might that user be?

Describe alternatives you've considered

Additional context

zirblazer commented Oct 16, 2024

marmarek commented Oct 16, 2024

zirblazer commented Oct 16, 2024

marmarek commented Oct 16, 2024

pietrushnic commented Oct 29, 2024