Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Tracker] autoquant v2 tracker #1215

Open
3 of 12 tasks
jerryzh168 opened this issue Nov 1, 2024 · 0 comments
Open
3 of 12 tasks

[Tracker] autoquant v2 tracker #1215

jerryzh168 opened this issue Nov 1, 2024 · 0 comments

Comments

@jerryzh168
Copy link
Contributor

jerryzh168 commented Nov 1, 2024

This issue tracks the further development of autoquant tool

Goal

The goal for autoquant is to be able to get performance speedup over a broad set of models that we care about with minimal accuracy degradations (configurable by user), by reliably selecting the most performant quantization method and kernel implementation for the given input shape for each quantizable layer in the model. It could also be used for selecting hand written kernels that's optimized to get best performance on a specific model, runtime and device.

Performance

Accuracy

@jerryzh168 jerryzh168 changed the title [Tracker] autoquant tracker [Tracker] autoquant v2 tracker Nov 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant