NV

Nano vLLM

Efficient vLLM-style inference for smaller language models.

LLMs
306.3K views
0
Launch App

About Nano vLLM

Nano vLLM is a software application that implements a vLLM-style inference engine, focusing on efficiency and reduced resource consumption. It's designed for deploying smaller language models effectively, making them accessible on devices with limited computing power.

App Information

Version1.0.0
Category
LLMs
PricingFree
Published

Developer

UN

Unknown