There is a lot of code generated in layers/vulkan/generated/
. This is done to prevent errors forgetting to add support for new
values when the Vulkan Headers or SPIR-V Grammer is updated.
pyparsing
is required, so if it is not installed, you will need to call pip install pyparsing
- Linux:
scripts/generate_source.py external/Vulkan-Headers/registry/ external/SPIRV-Headers/include/spirv/unified1/
- Windows Powershell:
pwsh -Command { python3 scripts/generate_source.py external/Vulkan-Headers/registry/ external/SPIRV-Headers/include/spirv/unified1/ }
- Windows Command:
cmd /C "python3 scripts/generate_source.py external/Vulkan-Headers/registry/ external/SPIRV-Headers/include/spirv/unified1/"
When making change to the scripts/
folder, make sure to run generate_source.py
and check in both the changes to
scripts/
and layers/vulkansc/generated/
in any PR. (Code generation does not happen automatically at build time.)
Note: All generated code is formatted with
clang-format
after it generates (see PR #6480 for details)
A helper CMake target vvl_codegen
is also provided to simplify the invocation of scripts/generate_source.py
from the build directory:
cmake -S . -B build -D VVL_CODEGEN=ON
cmake --build build --target vvl_codegen
NOTE: VVL_CODEGEN
is OFF
by default to allow users to build VVL
via add_subdirectory
and to avoid potential issues for system/language package managers.
If only dealing with a single file, run scripts/generate_source.py
with --target
# Example - only generates chassis.h
scripts/generate_source.py external/Vulkan-Headers/registry/ external/SPIRV-Headers/include/spirv/unified1/ --target chassis.h
Make sure to look at the python coding style guide
The base_generator.py
and vulkan_object.py
are the core of all generated code
BaseGenerator
- This is the only file that understands the
reg.py
flow in theregistry
- most developers will never need to touch this file
- This is the only file that understands the
VulkanObject
- Can be accessed with
self.vk
- "C Header" like file that describes what information can be used when generating code
- Uses the Python 3.7 Dataclasses to enforce a schema so developers
- Can be accessed with
Every "Generator" that extends BaseGenerator
has a def generate(self)
which is the "main" function
The following are examples of helpful things that can be done with VulkanObject
#
# Loop structs that have a sType
for struct in [x for x in self.vk.structs.values() if x.sType]:
print(struct.name)
#
# Print each command parameter C string
for command in self.vk.commands.value():
for param in command.params:
print(param.cDeclaration)
#
# Loop commands with Transfer Queues
for command in [x for x in self.vk.commands.value() if Queues.TRANSFER & x.queues]:
print(command.name)
#
# Find enums that are extended with an Instance extension
for enum in self.vk.enum.values():
for extension in [x for x in enum.extensions if x.instance]:
print(f'{enum.name} - {extension.name}')
#
# List all VkImageViewType enum flags
for field in self.vk.enums['VkImageViewType'].fields:
print(field.name)
Written by someone who has written bad Vulkan code gen, debugged other's bad code gen, and rewrote all the scripts.
All code gen has a single function that outputs one large string, there is zero dynamic control flow that occurs. (Generators don't take any runtime arguments other then file locations)
While it seems useful to group your logic into a single function, it because hard to debug where all the sub-strings are appearing from in the final file.
The few times it is good to use functions is
- It returns a non-string value. (ex. sorting logic)
- There is recursion needed (ex. walking down structs with more structs in it)
If you find yourself having many functions that are just written as python strings, it might be worth looking into having the file live in the actual layer code.
Code generation is not a bottleneck for performance, but trying add/edit/debug code generation scripts is a bottleneck for developer time. The main goal is make any python generating code as easy to understand as possible.
generate_source.py
sets up the environment and then calls into run_generator.py
where each file is generated at a time. Many of the generation scripts will generate both the .cpp
source and .h
header.
The Vulkan code is generated from vk.xml and uses the python helper functions in the Vulkan-Headers/registry
folder.
The SPIR-V code is generated from SPIR-V Grammer
The Vulkan-Headers/registry
generation scripts biggest issue is it's designed to generate one file at a time.
The Validation Layers became very messy as each generated file had to re-parse this and try to create its own containers.
The new flow was designed to still make use of the registry
generation file, but allow a more maintainable way to find data when one only wants to add a little extra code to generation.
The base_generator.py
and vulkan_object.py
are were added to help reduce the work needed for each script.
Before the workflow was:
SomethingOutputGenerator::beginFile()
(in./scripts/
)OutputGenerator::beginFile()
(in./external/Vulkan-Headers/registry/
)SomethingOutputGenerator::beginFeatures()
OutputGenerator::beginFeatures()
SomethingOutputGenerator::genCmd()
(orgenGroup
,genStruc
,genTyp
,genEnum
, etc)OutputGenerator::genCmd()
- repeat step 3-6
SomethingOutputGenerator::endFile()
OutputGenerator::endFile()
This is an issue because having to decide to write things out to the file during a genCmd
or endFile
call gets messy.
The new flow creates a seperate base class so the workflow now looks like:
BaseGenerator::beginFile()
OutputGenerator::beginFile()
(in./external/Vulkan-Headers/registry/
)BaseGenerator::beginFeatures()
OutputGenerator::beginFeatures()
BaseGenerator::genCmd()
OutputGenerator::genCmd()
- repeat step 3-6
SomethingOutputGenerator::generate()
(single function per generator)OutputGenerator::endFile()
The big difference is SomethingOutputGenerator
(ex. CommandValidationOutputGenerator
) only has to be called once at the end.
This means each generator script doesn't have to worry about understanding how the registry
file works.
This is possible because of the new class VulkanObject()
The VulkanObject
makes use of the Python 3.7 Dataclasses to enforce a schema so developers don't have to go guessing what each random object from the various registry
is. Most of the schema is derived from the Spec's registry.rnc file. This file is used to enforce the vk.xml
schema and serves as a good understanding of what will be in each element of the XML.
If a developer needs something new, it can be added in the VulkanObject
class. This provides 2 large advantages
- Code to create a container around the XML is properly reused between scripts
- Only one file (
base_generator.py
) use to understand the inner working of theregistry
. A developer can just view thevulkan_object.py
file to see what it can grab in the single pass