Dependency resolution

Cache dependency descriptors in memory

Useful for regular builds. Very useful for daemon builds as the daemon can keep the descriptors in memory.

User visible changes

Faster builds, especially with hot daemon
Create a property that allows turning off this feature

Sad day cases

Someone tinkers with the artifact cache (deletes file, etc) while the daemon has hot cache

Test coverage

Changing modules are cached only within a given build
Dynamic versions are cached only within a given build
Static versions are cached forever (until daemon exits)
Modules/artifacts from different repositories are separated
Building with --refresh-dependencies throws away cached data

Implementation plan

Cache should use soft maps (see guava map maker)
Use ModuleVersionRepository
Cache's state should be based on TopLevelProjectRegistry
Document the breaking changes: we no longer check local repos with every resolve. The remote repo cache may no longer expire in the middle of the build.
For descriptor caching we should cache states like: missing(), probablyMissing(), resolved()

Questions

Sensitive to --offline builds?

Persistent caches

Improve cache locking efficiency when running in parallel mode

Improve our locking implementation so that we can hold the lock across long running operations and release it only if it is required by another process. The potential downside is that we will make it more expensive to acquire the lock when it's contented. Mostly this would affect the artefact cache, but we can offset this by caching stuff in memory. Or use a different lock implementations for those things that are likely to be shared by multiple processes and for those things that are unlikely to be shared.

Potential implementation plan: -first process takes lock, writes address, hangs on to the lock -other process reads the address and sends a message 'I may need the lock' -after receiving this message, the first process enters 'share lock' mode

Incremental build

Cache task history in memory across builds

Results from a spike show 30% speed improvement for the 'fully' incremental build.

User visible changes

Faster builds when daemon used. Reasonable increase of heap consumption.

Implementation

provide implementation of TaskArtifactStateCacheAccess that ads in-memory caching capabilities
expire the cache data when cache file's last modified time changes. Check for expiration before locking the file, remember the last modified before unlocking.
the implementation can be improved in various ways (e.g. stop using the last modified, we know when cross-process lock is requested by other processes)
the cache should have some bounds otherwise it will use a lot of memory for gigantic builds. Initially, we will cap the cache size.
enable caching only when daemon is used

Test coverage

add performance tests with the daemon

Daemon

Hint the vm to gc after build in the daemon completes

Full gc scans are unavoidable when the daemon is used. It's because at the end of the build, there will be lots of objects in the tenured space. It's better if the full scan happens 'outside' of the user's build, because full scan pauses the entire vm.

User visible changes

It will be hard to prove but in theory daemon builds should be more consistent and faster.

Implementation

Add new DaemonCommandAction, say DaemonHygiene.
Slot it in the actions chain so that it is executed after the build has completed, and the user received the 'build successful' message
This action needs to be stateful (currently all actions are created per build request)
This action may perform gc at the end of the build, but not too often, say once per 2 minutes.
This action may perform other hygiene actions, monitor memory usage, etc.

Other potential spikes/stories:

Daemon rebuilds and caches the model after the build. This way, next time we run the build, the configured model is already built and configuration time is zero. Tasks selected for execution also determine the model. We'd need to start watching for changes to the files that are inputs to the model. This includes external classpath dependencies that can change remotely, all the source files of buildSrc and its dependencies (and its model inputs), the build environment settings in various places, and so on.

Implementation notes -projects can declare inputs for model caching -the feature is not enabled by default (can be turned on)

Reuse build script and plugin classloaders.

Model configuration

Potential spikes/stories:

More profiling of configuration time to look for hot-spots.
Push implicit plugin application, so that plugins are only applied at configuration time when they are required.
Only create those tasks that are required for the build.
Only configure those domain objects that are required for the build.
parallel configuration.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

performance.md

performance.md

Dependency resolution

Cache dependency descriptors in memory

User visible changes

Sad day cases

Test coverage

Implementation plan

Questions

Other stories

Persistent caches

Improve cache locking efficiency when running in parallel mode

Other stories

Incremental build

Cache task history in memory across builds

User visible changes

Implementation

Test coverage

Other stories

Daemon

Hint the vm to gc after build in the daemon completes

User visible changes

Implementation

Other potential spikes/stories:

Model configuration

Files

performance.md

Latest commit

History

performance.md

File metadata and controls

Dependency resolution

Cache dependency descriptors in memory

User visible changes

Sad day cases

Test coverage

Implementation plan

Questions

Other stories

Persistent caches

Improve cache locking efficiency when running in parallel mode

Other stories

Incremental build

Cache task history in memory across builds

User visible changes

Implementation

Test coverage

Other stories

Daemon

Hint the vm to gc after build in the daemon completes

User visible changes

Implementation

Other potential spikes/stories:

Model configuration