ELS-RD / transformer-deploy Public

Notifications
Fork 151
Star 1.7k

Additional navigation options

Code
Issues
Pull requests
Actions
Security
Insights

Issues: ELS-RD/transformer-deploy

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clear current search query, filters, and sorts

52 Open 69 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Unable To use batching

#180 opened Oct 13, 2023 by rahulmate

👍

convert_model command not found

#173 opened May 23, 2023 by pint1022

👍

GPT-J support

#40 opened Jan 5, 2022 by JonathanLehner

👍

Out of memeory error for batch size more than 1 for T5 models.

#60 opened Mar 18, 2022 by Ki6an

👍

Support for gpt2 quantization

#52 opened Feb 15, 2022 by kobzaond

Speed difference ONNX vs TensorRT with samples sorted by sequence length

#55 opened Feb 23, 2022 by v1nc3nt27

Feature extraction/dense embeddings Query inference error

#61 opened Mar 24, 2022 by 13604099691

Execute T5 inference with TensorRT enhancement

New feature or request

#73 opened May 17, 2022 by ayoub-louati

big performance difference on tensorRT

#85 opened May 31, 2022 by HireezShanPeng

Mixed precision conversion getting Assertion Error

#89 opened Jun 2, 2022 by caffeinetoomuch

Error Nodes in a graph must be topologically sorted, however input 'encoder_hidden_states' of node: name: MatMul_173_input_cast0 OpType: Cast is not output of any previous nodes

#97 opened Jun 15, 2022 by dheerajiiitv

Optimising ONNX Graph either takes too long or doesn't seem to work bug

Something isn't working

#109 opened Jul 4, 2022 by accountForIssues

[ONNXRuntimeError] : 2 : INVALID_ARGUMENT : Unexpected input data type. Actual: (tensor(int64)) , expected: (tensor(int32)) question

Further information is requested

#110 opened Jul 5, 2022 by Matthieu-Tinycoaching

HF pipeline based inference enhancement

New feature or request

#111 opened Jul 5, 2022 by kamalkraj

Cannot build TensorT engine for classification models

#115 opened Jul 12, 2022 by CecileGiang

t5_bf16 notebooks fails with [ONNXRuntimeError] : 10 : INVALID_GRAPH

#118 opened Jul 30, 2022 by michaelroyzen

Encounter Error: ValueError: Message onnx.ModelProto exceeds maximum protobuf size of 2GB

#183 opened Mar 4, 2024 by illumination-k

How to Run With Polygraphy Graph Surgeon

#121 opened Aug 7, 2022 by sam-h-bean

Question about generative model notebook

#123 opened Aug 9, 2022 by hyunwoongko

GPU quantization for sentence-transformer: ONNX quantized model

#124 opened Aug 9, 2022 by Matthieu-Tinycoaching

Unable to pull docker image

#127 opened Aug 12, 2022 by brevity2021

[Question] Documentation for generative model API and parameters?

#129 opened Aug 20, 2022 by tanmayb123

t5 notebook broken with transformer-deploy 0.5.0

#130 opened Aug 20, 2022 by michaelroyzen

Using t5-large in t5 notebook, the translation result is invalid

#135 opened Sep 8, 2022 by brevity2021

split model between host memory and GPU enhancement

New feature or request

ONNX Runtime

#113 opened Jul 7, 2022 by pommedeterresautee

Previous 1 2 3 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly