Mlp example #172

jianyicheng · 2024-04-24T12:52:37Z

No description provided.

…ners)

* new function to update meta after quantization [only works on fixed] * comment out manual precision & dtype assignment --------- Co-authored-by: Jianyi Cheng <[email protected]>

…side hardware components

…sults

* Created test case with errors * cache quantized weight after the first inference * Removed quantization check --------- Co-authored-by: Cheng Zhang <[email protected]>

pgimenes

Then changes to chop/models/manual seem fine (although maybe unnecessary with the new HFTracer update), but I almost all the changes in chop/passes/graph/analysis (i.e. the test_verilog pass) are reimplementations of functionality already existing in chop/passes/graph/transforms/verilog, but less general, with no tests under machop/test to evaluate, and using deprecated components. The same goes for all the changes in mase_components. I agree with verilog testing being an analysis pass, but I think it would be best to drop these changes and move the existing code to passes/graph/analysis instead. The refactoring of emit verilog needs work and could have been implemented without adding 1000 lines of code. I would suggest refactoring this feature since there is time and this feature is not time pressured.

pgimenes · 2024-05-14T21:31:38Z

machop/chop/passes/graph/analysis/verilog/cocotb.py

+
+
+# DUT test specifications
+class VerificationCase:


verification classes should inherit from mase_cocotb.testbench.Testbench

pgimenes · 2024-05-14T21:34:50Z

machop/chop/passes/graph/analysis/verilog/cocotb.py

+    dut.data_in_0_valid.value = 0
+    dut.data_out_0_ready.value = 1
+    debug_state(dut, "Pre-clk")
+    await FallingEdge(dut.clk)


Driving handshakes manually by assigning signals and awaiting clock edges is too verbose and error prone. We should use mase_cocotb.interfaces.streaming.StreamDriver instead

pgimenes · 2024-05-14T21:35:42Z

machop/chop/passes/graph/analysis/verilog/cocotb.py

+
+
+@cocotb.test()
+async def test_top(dut):


This functionality is already implemented in chop.passes.graph.transforms.verilog.emit_tb

pgimenes · 2024-05-14T21:37:04Z

machop/chop/passes/graph/analysis/verilog/test_verilog.py


-def get_test_parameters(mg):
+def hardware_reshape(input_data, input_shape, tiling):


A good implementation is already available in chop.mase_cocotb.utils.fixed_preprocess_tensor

pgimenes · 2024-05-14T21:37:39Z

machop/chop/passes/graph/analysis/verilog/test_verilog.py


+class VerificationCase:


As previously mentioned, verification classes should inherit from mase_cocotb.Testbench

pgimenes · 2024-05-14T21:54:07Z

machop/chop/passes/graph/analysis/verilog/test_verilog.py

+    return parameter_map
+
+
+def runner(mg, project_dir, top_name):


Already implemented in mase_cocotb.runner

pgimenes · 2024-05-14T21:58:54Z

machop/chop/passes/graph/transforms/verilog/emit_bram.py

    total_size = math.prod(
        node.meta["mase"].parameters["common"]["args"][param_name]["shape"]
    )
+
+    dim = len(node.meta["mase"].parameters["common"]["args"][param_name]["shape"])


The main-adls-2324 branch has a more clean, updated implementation of this handling of the size/dim/depth for the emitted BRAM. I would merge that into mlp_example branch and then see if any further changes are required here

pgimenes · 2024-05-14T22:02:51Z

machop/chop/passes/graph/transforms/verilog/emit_top.py

Separating the dataflow emitter will be useful (to enable using a memory shell later on instead of local BRAM), but it seems like the way it's been implemented has a lot of repeated and redundant code without any inheritance, and the size of the file has gotten huge. I would refactor this so the SignalEmitter, InterfaceEmitter etc are all agnostic of whether it's dataflow or memory mapped

pgimenes · 2024-05-14T22:05:18Z

machop/mase_components/linear/rtl/fixed_linear.sv

This is an old implementation of linear, which should not be merged back into main. The new implementation without parametrization restrictions is already available in main-adls-2324, which is ready to be merged

pgimenes · 2024-05-14T22:06:50Z

machop/mase_components/linear/test/fixed_linear_tb.py

I don't understand any of the changes here. We're reverting back to using deprecated tb components, without any new functionality, when the current implementation is already working and general. I would discard this entire file

Jianyi Cheng added 7 commits April 24, 2024 10:26

Updated Docker to main for local build

2e28da6

Sync scripts for updated docker setups (seperating CPU and GPU contai…

08c5fda

…ners)

removed docker as submodule

8537c2c

Pull from Makefile to avoid repeated sync on submodule

c2e47dc

Added instructions to log for better readability on CI log

b3081ba

Initially seperate dataflow sv out of top

9fd3d67

Split two files initially

cf6f26c

jianyicheng marked this pull request as draft April 24, 2024 12:52

Refactored dataflow level

3bfe064

Base automatically changed from docker-update to main April 24, 2024 15:47

Jianyi Cheng and others added 20 commits April 24, 2024 17:04

Added missing dependences and updated file paths

caad0b1

Rafactored memory map emit

5b8990f

Added rounding at the output of relu

0509014

Merge branch 'main' into mlp_example

7aaba88

Fixed relu var name typos

181b537

Updated linear layer with the right parameter names

f93c976

Remove temporary code for parallelism

0a8b4dd

Added missing component interface

35afa85

removed extra comma in parameter map

8ef71cf

Updated parallelism parameter formats

9210756

Updated bram width calculation with the latest parallelism parameters

3c272a7

Fix quantization meta data for fixed-point quantization (#173)

29faf71

* new function to update meta after quantization [only works on fixed] * comment out manual precision & dtype assignment --------- Co-authored-by: Jianyi Cheng <[email protected]>

reverted changes made by the quantization PR

50816af

reduced test case

c1d1fd7

Fetched previous verilog analysis pass for testing

8535c57

Fetched the latest version of draft

590f20e

Pass syntax error in Python

8076bc1

refactored test pass format

5d26837

format quantize PR #173

cfb256e

Added bit truncation in bram param to avoid verilator warnings

d825cbe

Jianyi Cheng and others added 4 commits April 28, 2024 17:38

Fixed bitwidth error (this is temporary for the version of casting in…

3a23cb2

…side hardware components

Fixed minor parallelism parameter shapes

80c671f

Get working flow for hardware testing - but need to check hardware re…

8b5309c

…sults

Mlp quantization error (#178)

047f27b

* Created test case with errors * cache quantized weight after the first inference * Removed quantization check --------- Co-authored-by: Cheng Zhang <[email protected]>

pgimenes requested changes May 14, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mlp example #172

Mlp example #172

jianyicheng commented Apr 24, 2024

pgimenes left a comment •

edited

Loading

pgimenes May 14, 2024

pgimenes May 14, 2024

pgimenes May 14, 2024

pgimenes May 14, 2024

pgimenes May 14, 2024

pgimenes May 14, 2024

pgimenes May 14, 2024

pgimenes May 14, 2024

pgimenes May 14, 2024

pgimenes May 14, 2024


		def get_test_parameters(mg):
		def hardware_reshape(input_data, input_shape, tiling):



		@cocotb.test()
		async def test_top(dut):


		class VerificationCase:

Mlp example #172

Are you sure you want to change the base?

Mlp example #172

Conversation

jianyicheng commented Apr 24, 2024

pgimenes left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pgimenes left a comment •

edited

Loading