Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stop escaped compiler args from being mangled by MSBuild #76888

Open
wants to merge 5 commits into
base: main
Choose a base branch
from

Conversation

JoeRobich
Copy link
Member

@JoeRobich JoeRobich commented Jan 23, 2025

The TaskItem implementation from MSBuild treats the ItemSpec as file path and tries to normalize the path separators. When running on non-windows machines this means changing '\' to '/'. However this breaks how double quotes are escaped in the compiler args. By using our own implementation of TaskItem we can use the compiler args as the ItemSpec without any modification.

Resolves #72014

The TaskItem implementation from MSBuild treats the ItemSpec as file path and tries to normalize the path separators. When running on non-windows machines this means changing '\' to '/'.  However this breaks how double quotes are escaped in the compiler args. By using our own implemetation of TaskItem we can use the compiler args as the ItemSpec without any modification.

Resolves #72014
@JoeRobich JoeRobich requested a review from a team as a code owner January 23, 2025 18:59
@dotnet-issue-labeler dotnet-issue-labeler bot added Area-Compilers untriaged Issues and PRs which have not yet been triaged by a lead labels Jan 23, 2025
src/Compilers/Core/MSBuildTask/ArgsTaskItem.cs Outdated Show resolved Hide resolved
@@ -159,7 +159,7 @@ protected static ITaskItem[] GenerateCommandLineArgsTaskItems(List<string> comma
var items = new ITaskItem[commandLineArgs.Count];
for (var i = 0; i < commandLineArgs.Count; i++)
{
items[i] = new TaskItem(commandLineArgs[i]);
items[i] = new ArgsTaskItem(commandLineArgs[i]);
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will change from us unconditionally escaping to unconditionally not escaping. Won't this regress scenarios where escaping is making builds work today? Consider for example that /keyfile: is passed via commandLineArgs. If that was using \ today it would be / on unix but after this change it woudl still be \ correct?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

oof. Luckily many args such as Analyzers, References, Source all come from TaskItems of their own which apply the path correction. So I need to look at args which come from properties and may contain file paths.

{
get
{
var clone = new List<string>(_metadata.Keys);
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
var clone = new List<string>(_metadata.Keys);
var clone = new List<string>(capacity: MetadataCount);
clone.AddRange(_metadata.Keys);

Avoid unnecessary allocations.

}
}

// Implementation notes that we should include the built-in metadata names as well as our custom ones.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Which implementation notes this?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will add links.


namespace Microsoft.CodeAnalysis.BuildTasks
{
internal sealed class ArgsTaskItem : ITaskItem
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we add a comment summarizing that we're using this to avoid unnecessary path escaping?

destinationMetadataNames.Add((string)name);
}

foreach (var metadataName in _metadata.Keys)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why are we only using _metadata.Keys here and not MetadataNames? Seems like it's only copying our custom metadata not the full set.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In my reading of the interface comments. There seems to be a distinction made between "built-in" and "custom" metadata. The "built-in" metadata seems to be related to the ItemSpec which this method is not supposed to update.

}
}

public string GetMetadata(string metadataName)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@rainersigwald, @baronfel can you help me understand the contract of ITaskItem a bit better? One part of this impl, and others I've seen, is that they have this pattern of exporting a set of names via MetadataNames that will throw when you pass them to GetMetadata. My mind sees that as a dictionary like type exporting a key but then throwing when you ask for the value. Why does ITaskItem behave this way?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do you have a specific instance we could look at? From what we can see TaskItem MetadataNames + GetMetadata should be safe for all of the returned names, which include explicit Item Metadata as well as the historically-significant set of metadata that all Items have (because all items are files, don'tchanknow).

Copy link
Member Author

@JoeRobich JoeRobich Jan 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe I misinterpreted the comments on the interface and the implementation here. MetadataNames includes the "built-in" metadata which I wasn't sure would actually be in my custom metadata dictionary.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Area-Compilers untriaged Issues and PRs which have not yet been triaged by a lead
Projects
None yet
Development

Successfully merging this pull request may close these issues.

MSBuildWorkspace does not cleanly load VB project on Linux or MacOS
3 participants