-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Discussion of Scale-out Architecture for High-volume Repositories TAP (TAP 21) #190
Comments
You mention TOFU but it's a little unclear how the client reacts when top level repository changes the "initial sub-repo metadata". Assume client has cached earlier sub-repo metadata and the new sub-repo metadata is not compatible with the sub-repo metadata in client cache (as it can't be guaranteed to be). How does client react? |
By "initial sub-repo metadata", we mean only Under normal circumstances, key rotation for a sub-repo would involve deploying a new So, the scenario you describe should not occur under normal operations. If it were to occur, presumably the top-level repo has to be considered canonical, since it is the root of trust (ie. the top-level repo's The client behaviour should presumably be to:
|
This is currently not part of the spec but I believe it would be a good addition: verify-from-bootstrap feature in python-tuf
If this scenario is not expected to happen... then I'm thinking I maybe have not understood who the signers/keyowners of the sub-repo are. Can you talk more about that? The reason I'm thinking I don't quite understand the setup is this:
|
I believe the client behaviour described above ought to handle this scenario reasonably well. But please correct me if you see a flaw in that process. We're working on incorporating it into the Specification section of the TAP.
First off, we are primarily targeting this scenario ("signers are repository automation"). I'll update the TAP to reflect this. That said, we considered re-using the same root metadata across all the sub-repos. However, when an online key (targets, snapshot or timestamp) is rotated, all of the metadata signed by that key needs to be re-signed. For a high-volume repository this can take a (relatively) long time. Separate root metadata per sub-repo allows for an incremental roll-out of new online keys. Note that this still allows implementers to re-use keys across sub-repos. In fact, I think this would be the recommended approach. |
I meant that I don't understand what the advantage of TAP-21 in general is for the "signers are repository automation" case. Compare to a setup where there are no sub-repos and the top-level TUF repo only contains a set of public keys as artifacts (no project indexes or packages are added to TUF): These keys are used to sign the project indexes and/or actual packages outside of tuf by the repository automation, and clients get these keys using TUF and use them to verify indexes/packages. So TUF is only used as a repository signing key rotation mechanism, not as a delegation mechanism. In this setup:
It's entirely possible I've missed something: Apart from the theoretical ability to switch to "project signing" at a later date, do you believe TAP-21 has real security advantages over the simple setup described above? |
How do clients get the index & package signatures in this scenario? That's the payload of the sub-repos. |
I suppose there's many ways to handle that but for discussion let's say well-known URLs based on the artifact URL: if client wants to download "$PACKAGE_URL", signature is at "${PACKAGE_URL}.signature" |
This is an advantage of TUF, not TAP-21 specifically. TUF also affords protection against a variety of attacks that the "simple setup" appears not to provide. For example, TUF mitigates against endless data attacks by including the size of downloaded files within TUF metadata.
I don't doubt that there are alternative mechanisms to provide the same protections as TUF. But each would presumably complicate the alternative solution. This, in turn, undermines the argument for its simplicity. Why re-invent the wheel?
TAP-21 preserves all but a handful of the protections afforded by TUF. So, yes. |
Left some comments in your TAP PR, @ergonlogic. I think your real problem there is different than the one you have proposed a solution for. As I suggested there, do you think getting on a call to discuss your problem in its entire context would help serve you better? |
Sure but this pretty much a form of DOS which the attacker could do anyway if they control the repository or the mirror. I don't think this makes a meaningful difference between the two models.
I think you may be underestimating the complexity of running the sub-repositories. The two models being compared have, in my opinion, vastly different levels of simplicity -- and this overall difference would not be changed by some small changes in either one |
The TUF "security model" refers to an idealized repository that we've found out does not usually exist. I'd really like to see specific relevant attacks that are being protected from: DOS protection is nice but not that persuasive... (I'm not trying be difficult by the way: There may well be attacks that this protects from, I just haven't been able to find any that a simpler solution wouldn't handle) |
Thank you, I've responded in the PR. In the #tuf Slack channel, @mnm678 reviewed our metadata download calculations, and surmised that previous calculations (for PEP-458) underestimated the size of snapshot metadata. The problems we describe in the TAP are not purely theoretical. We are observing them IRL. That said, maybe we've misinterpreted the root cause. Could you elaborate?
I'm happy to discuss further on a call. FWIW, we did discuss this on the last community call. Is there another one coming up this week? If not, I'm happy to chat in another venue. |
Perhaps. However, based on my experience implementing Rugged, I don't think it would add too much complexity. Each sub-repo is just a simple TUF repo, after all.
Agreed. TAP-21 is implementing TUF for sub-repos, whereas the alternative is essentially just publishing some signatures.
I disagree. I think you're comparing apples and oranges. For an alternative to actually provide security comparable to TUF, it's reasonable to think that it would become significantly more complex than just publishing signatures. |
Within a sub-repo, all of the protections afforded by TUF are present. As far as I can tell, the "simple" alternative does not appear to protect against rollback or indefinite freeze attacks. For example, an attacker could present an older version of an index, along with its previously published signature. Since the signature was generated by a valid key and matched the older index, a client would presumably validate it.
I'm sorry if I am being defensive. I guess I just don't really understand how showing that TUF (not TAP-21, but TUF itself) is more secure than a theoretical rudimentary alternative is relevant. Registries are free to adopt TUF or not. Is it overkill for sub-repos? Perhaps. But I'd prefer to err on the side of caution. TUF itself, along with PHP-TUF, Rugged and other implementations, have all passed security audits. It doesn't seem to me that starting from scratch is a worthwhile endeavour. |
No need to discuss again on the community call, but let me reach out to you over DM on the CNCF Slack to chat separately. Thanks! |
At the recent TUF Community meeting, we discussed TAP-21:
We ran out of time at that meeting, and so scheduled a follow-up call. In the follow-up call, we discussed:
|
Reflecting on this further, this approach would seem to also require a large volume of target delegation rules. Unlike Snapshot metadata, delegations would be slow-moving, as they would only change when new packages are added to the registry. Even so, a single level of intermediate delegations ( |
Right. Most vendors wouldn't have this. The few that would could use hash bin delegations within their role to avoid this cost. |
Reviewing TAP 4, I note that it describes the core use-case of TAP-21:
So, it looks like there's already an accepted mechanism to delegate targets to separate repos. It also provides guidance for where the client should look for sub-repository root metadata:
I haven't found any guidance (yet) about how to provide this root metadata in a trustworthy fashion. For a handful of well-known or private repos, I suppose this could be done manually. However, to manage this for thousands of sub-repos, as we'd need for TAP-21, that's completely unmanageable. This leads back to @jku's concern. Based on subsequent offline discussions, I believe that this concern amounts to:
However, since the root metadata format is prescriptively defined in TUF, we don't need to treat it as simple target. Instead of the top-level containing only targets metadata, we could perhaps add a role (eg. |
TAP-21 originated with theupdateframework/specification/issues/309. Following discussion at the TUF community meeting on 2024-11-01, we've drafted TAP-21 and submitted a PR (#189).
In addition to broader discussions of TAP-21, we are specifically seeking feedback to:
We have thoughts on the sparse sections, but have not yet been able to articulate them clearly enough to include in the TAP.
The text was updated successfully, but these errors were encountered: