AMF-Placer 2.0 is an open-source comprehensive timing-driven Analytical Mixed-size FPGA placer. It supports mixed-size placement of heterogeneous resources (e.g., LUT/FF/LUTRAM/MUX/CARRY/DSP/BRAM) on FPGA. To speed up the convergence and improve the timing quality of the placement, standing upon the shoulders of AMF-Placer 1.0, AMF-Placer 2.0 is equipped with a series of new techniques for timing optimization, including an effective regression-based timing model, placement-blockage-aware anchor insertion, WNS-aware timing-driven quadratic placement, and sector-guided detailed placement. Based on a set of the latest large open-source benchmarks from various domains for Xilinx Ultrascale FPGAs, experimental results indicate that critical path delays realized by AMF-Placer 2.0 are averagely 2.2% and 0.59% higher than those achieved by commercial tool Xilinx Vivavo 2020.2 and 2021.2 respectively. Meanwhile, the average runtime of placement procedure of AMF-Placer 2.0 is 6.7% and 11.4% lower than Xilinx Vivavo 2020.2 and 2021.2 respectively. Although limited by the absence of the exact timing model of the device, the information of design hierarchy and accurate routing feedback, AMF-Placer 2.0 is the first open-source FPGA placer which can handle the timing-driven mixed-size placement of practical complex designs with various FPGA resources and achieves the comparable quality to the latest commercial tools. Detailed Doxygen-based documentation (e.g, introduction, usage, implementation and experimental results) can be accessed here.
This project is developed by Reconfiguration Computing Systems Lab, Hong Kong University of Science and Technology (HKUST). Tingyuan Liang ([email protected]), Gengjie Chen ([email protected]), Jieru Zhao ([email protected]), Sharad Sinha ([email protected]) and Wei Zhang ([email protected]) are the major contributors of this project.
In this repo, we provide the basic implementation of AMF-Placer 2.0, under the Apache License 2.0, supporting comprehensive timing-driven placement with critical path delay and runtime which are downgraded slightly by ~5% on average. If you want to obtain the advanced version of AMF-Placer 2.0 to reproduce the experimental results in the paper of AMF-Placer 2.0 for academic evaluation or commercial usage, you are required to contact the authors Tingyuan Liang ([email protected]) and Wei ZHANG ([email protected]) with your offcial instituation email and we will response in 72 hours. If you are commercial entities, you can also contact [email protected] for licensing opportunities of the advanced version. The authors acknowledge support from the Hong Kong Research Grants Council (HK-RGC) General Research Funds (GRF No. 16213521) and thank the anonymous reviewers for their valuable comments to improve the quality of the papers.
If our works are involved in your works, please cite our papers:
[1] AMF-Placer: High-Performance Analytical Mixed-size Placer for FPGA(pre-print)
@INPROCEEDINGS{AMFPlacer,
title={AMF-Placer: High-Performance Analytical Mixed-size Placer for FPGA},
author={Liang, Tingyuan and Chen, Gengjie and Zhao, Jieru and Sinha, Sharad and Zhang, Wei},
booktitle={2021 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)},
year={2021}, volume={}, number={}, pages={1-6},
}
[2] AMF-Placer 2.0: Open Source Timing-driven Analytical Mixed-size Placer for Large-scale Heterogeneous FPGA(pre-print)
@ARTICLE{liang2024amf,
author={Liang, Tingyuan and Chen, Gengjie and Zhao, Jieru and Sinha, Sharad and Zhang, Wei},
journal={IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems},
title={AMF-Placer 2.0: Open Source Timing-Driven Analytical Mixed-Size Placer for Large-Scale Heterogeneous FPGA},
year={2024},
volume={},
number={},
pages={1-1},
keywords={Field programmable gate arrays;Timing;Computer architecture;Microprocessors;Table lookup;Routing;Standards;timing-driven placement;analytical placement;mixed-size placement;FPGA},
doi={10.1109/TCAD.2024.3373357}}
- Just reinvent the wheel for fun, try to build a complete flow and reproduce/improve some state-of-art techniques in the latest papers.
- Resolve some existing constraints in some previous works and consider more practical situations, like FPGA mixed-size placement with a series of optimization from the perspectives of timing, clocking, routability-aware and parallelism.
- A beginner-friendly placement framework with clear hierarchy and detailed Doxygen-based documentation. We hope that it can lower the overhead for people who are also interested in this research area.
- Currently, this framework is under development and it is still far from our goals and the practical demands, but we are happy to share our progress in this GitHub repository. If you have any questions/problems/suggestions, please contact feel free to contact us (Tingyuan LIANG, [email protected])
- supports placeemnt with a large number of mixed-size macros with shape constraints in practical FPGA applications.
- wirelength-driven, routability-aware, packing-aware, clock-aware, region-aware.
- initially timing-driven with basic embedded static timing analysis, WNS-aware global placement, efficient detailed placement
- a set of optional optimization techniques to improve mixed-size FPGA placement QoR
- parallelizes the implementation of each stage of placement based on multi-threading
- modularized function implementation for easier further development
- flexible and extensible JSON-based placement configuration
- supports placement check-point importing/exporting
- a set of pre-implementation benchmarks from latest practical FPGA applications
- provides a set of Tcl scripts which extracts design netlist from Vivado and exports post-placement information to Vivado
- A basic GUI for user to analyze the placement procedure to optimize the implementation
Below is a hiearchy tree of this project. As for the details, please refer to the class information and collaboration diagram in the Doxygen documentation and trace the implementation, e.g., AMFPlacer, GlobalPlacer, and PlacementInfo.
├── benchmarks // benchmark information
│ ├── analysisScripts // experimental result analysis Python scripts
│ ├── testConfig // some test settings of placer
│ ├── VCU108 // information of design and device for VCU108
│ │ ├── compatibleTable // mapping information between design and device
│ │ ├── design // design information
│ │ └── device // device information
│ └── vivadoScripts // some Vivado Tcl scripts to extract design/device information
├── build // potential target directory for built output
├── doc // documentation-related materials
└── src // source code of the project
├── app // application (e.g., AMFPlacer)
│ └── AMFPlacer // A High-Performance Analytical Mixed-size Placer for FPGA
└── lib // libraries for application implementation
├── 3rdParty // third-party libraries
├── HiFPlacer // our placement function modules
│ ├── designInfo
│ ├── deviceInfo
│ ├── placement
│ │ ├── globalPlacement
│ │ ├── legalization
│ │ ├── packing
│ │ ├── placementInfo
│ │ └── placementTiming
│ └── problemSolvers
└── utils // some minor utility functions
AMF-Placer 2.0 relies on the following dependencies and we sincerely appreciate the efforts of the authors of the related open-source project. Each of them has corresponding licensing policy and please confirm that you are following the rules mentioned in the licenses. In the source code of AMF-Placer 2.0, we only include the open-source projects under the Apache License 2.0 or MIT License. The other dependencies will be downloaded when you are building the project and interact with the main body of AMF-Placer 2.0 via libraries and APIs.
- eigen 3.3.8 (MPL2 license, source code will be downlaoded into the building directory during building if you have not installed it)
- PaToH (academic use only, library will be downlaoded into the building directory)
- Asmjit (Apache License, source code included in src/lib/3rdParty/asmjit for fast placement rendering)
- Blend2d (Apache License, source code included in src/lib/3rdParty/blend2d for fast placement rendering)
- Qt5 (for GUI, you can install it on Ubuntu by: sudo apt-get install qt5-default )
- KDTree (MIT License, source code included in src/lib/3rdParty/KDTree)
- Maximal Cardinality Matching (MIT License, source code included in src/lib/3rdParty/MaximalCardinalityMatching))
- Min Cost Flow (MIT License, source code included in src/lib/3rdParty/minCostFlow
- clock tree synthesis
- ckock-related optimization
- reduce the hyperparameters in the algorithms and make them adaptive to design features
This project is under active development and far from perfect. We do want to make the placer useful for people in the community. Therefore,
- If you have any question/problem, please feel free to create an issue in the GitHub Issue or email us (Tingyuan LIANG, [email protected])
- We sincerely welcome code contribution to this project or suggestion in any approach!
(last updated Oct 26, 2022)