The performance (runtime) of any AI model is influenced by its size and precision. AI model developers spend time in optimizing the model size/architecture and precision to achieve better runtime performance. However, there is a limit to reducing model size and precision without losing model quality.
mjayw2014/rvm_perf_inference
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|
