-
Notifications
You must be signed in to change notification settings - Fork 2.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support huggingface export to tensorrtllm #12889
Conversation
Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>
Signed-off-by: pthombre <pthombre@users.noreply.github.com>
3499e13
to
49463ee
Compare
Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>
[🤖]: Hi @pthombre 👋, We wanted to let you know that a CICD pipeline for this PR just finished successfully So it might be time to merge this PR or get some approvals I'm just a bot so I'll leave it you what to do next. //cc @pablo-garay @ko3n1g |
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approved per @oyilmaz-nvidia
The CI tests passed. The final call to upload code coverage failed because it included a -branch
param which is not consistent with how other tests are invoked.
Support for exporting huggingface models to trtllm, and deploying it on a triton server