A local, private AI chat platform that allows developers to measure and compare the performance of various large language model (LLM) endpoints in terms of latency and throughput. Test the capabilities of your LLM endpoints and optimize their performance with this tool.




































