1 pointsby agcat10 months ago

1 comment

agcat10 months ago

An internal hackathon project to help you deploy with Triton easily. You just write your model logic, run a command, and Triton Co-Pilot does the rest. It automatically generates everything you need, uses AI models to configure inputs and outputs, and handles all the tedious parts. You get your Docker container ready to go in seconds.

LLM Wrapper Make Deployment with Nvidia Triton Inference Server Easier

1 comment

LLM Wrapper Make Deployment with Nvidia Triton Inference Server Easier

1 comment