NANO VLLM

// Nano vLLM

Nano Vllm

Nano vLLM

13EmergingUnknown

See Alternatives Compare...

License

MIT

Updated

Today

What it does

A lightweight vLLM implementation built from scratch. 🚀 Fast offline inference - Comparable inference speeds to vLLM 📖 Readable codebase - Clean implementation in ~ 1,200 lines of Python code ⚡ Optimization Suite - Prefix caching, Tensor Parallelism, Torch compilation, CUDA graph, etc. To download the model weights manually, use the following command: See for usage. The API mirrors vLLM's

Getting Started

git

git clone https://github.com/GeeeekExplorer/nano-vllm

Links

Platforms

🪟windows🍎mac🐧linux

Install Difficulty

moderate

Built With

python

Community Reactions

Similar Tools

See all alternatives →

36Emerging

Openvpn_webpanel_manager

A powerful, self-hosted web panel for managing OpenVPN servers, users, resellers (sub-admins), and multi-node deploym...

Active190open-source

36Emerging

Hink

Link Shortener for Hackers

Active158open-source

36Emerging

HackerBook

Hacker Book - COMMUNITY, ALL THE HN ARE BELONG TO YOU. An unkillable, static offline archive of all of Hacker News.

Active187open-source

36Emerging

Obsidian Life Tracker Base View

Capture and visualize the data that matters in your life

Active179open-source

36Emerging

Vyuh_node_flow

A flexible, high-performance node-based flow editor for Flutter. Build visual programming interfaces, workflow editor...

Active171open-source

36Emerging

Db Studio

The modern pgAdmin alternative that works with every database.

Active171open-source