The web UI shows the model and context length, but it would be helpful to know the quantization too. Maybe quantization info fits in the web UI, but a separate page listing the model specs would also be a good idea (also for API users).
This post was merged into
Go to new postAbout 1 year ago

arjan
About 1 year ago

arjan