List model specifications including quantization

The web UI shows the model and context length, but it would be helpful to know the quantization too. Maybe quantization info fits in the web UI, but a separate page listing the model specs would also be a good idea (also for API users).

This post was merged into

Go to new post
Date

About 1 year ago

Author

arjan