-
Notifications
You must be signed in to change notification settings - Fork 151
Issues: ELS-RD/transformer-deploy
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Speed difference ONNX vs TensorRT with samples sorted by sequence length
#55
opened Feb 23, 2022 by
v1nc3nt27
Execute T5 inference with TensorRT
enhancement
New feature or request
#73
opened May 17, 2022 by
ayoub-louati
Optimising ONNX Graph either takes too long or doesn't seem to work
bug
Something isn't working
#109
opened Jul 4, 2022 by
accountForIssues
[ONNXRuntimeError] : 2 : INVALID_ARGUMENT : Unexpected input data type. Actual: (tensor(int64)) , expected: (tensor(int32))
question
Further information is requested
#110
opened Jul 5, 2022 by
Matthieu-Tinycoaching
t5_bf16 notebooks fails with [ONNXRuntimeError] : 10 : INVALID_GRAPH
#118
opened Jul 30, 2022 by
michaelroyzen
Encounter Error: ValueError: Message onnx.ModelProto exceeds maximum protobuf size of 2GB
#183
opened Mar 4, 2024 by
illumination-k
GPU quantization for sentence-transformer: ONNX quantized model
#124
opened Aug 9, 2022 by
Matthieu-Tinycoaching
[Question] Documentation for generative model API and parameters?
#129
opened Aug 20, 2022 by
tanmayb123
Using t5-large in t5 notebook, the translation result is invalid
#135
opened Sep 8, 2022 by
brevity2021
split model between host memory and GPU
enhancement
New feature or request
ONNX Runtime
#113
opened Jul 7, 2022 by
pommedeterresautee
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.