Baseten is a high-performance inference platform for running open source LLMs. This API provides an OpenAI-compatible chat completions endpoint, so it works as a drop-in replacement with any OpenAI SDK or client. Just
POST /v1/chat/completionsSend a conversation to a model and get a completion back. Works exactly like the OpenAI chat completions endpoint. Passhttps://mpp.orthogonal.com/baseten