QERL

// QeRL enables RL for 32B LLMs on a single H100 GPU.

QeRL

QeRL enables RL for 32B LLMs on a single H100 GPU.

13EmergingUnknown

See Alternatives Compare...

License

Apache-2.0

Updated

1mo ago

What it does

https://github.com/user-attachments/assets/3c9b5b04-0d44-4b68-a4af-059b3d834fc3 QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs [Paper] Wei Huang, Yi Ge, Shuai Yang, Yicheng Xiao, Huizi Mao, Yujun Lin, Hanrong Ye, Sifei Liu, Ka Chun Cheung, Hongxu Yin, Yao Lu, Xiaojuan Qi, Song Han, Yukang Chen We propose QeRL, a Quantization-enhanced Reinforcement Learning

Getting Started

git

git clone https://github.com/NVlabs/QeRL

Links

Platforms

🪟windows🍎mac🐧linux

Install Difficulty

moderate

Built With

python

Community Reactions

Similar Tools

See all alternatives →

36Emerging

Openvpn_webpanel_manager

A powerful, self-hosted web panel for managing OpenVPN servers, users, resellers (sub-admins), and multi-node deploym...

Active190open-source

36Emerging

Hink

Link Shortener for Hackers

Active158open-source

36Emerging

HackerBook

Hacker Book - COMMUNITY, ALL THE HN ARE BELONG TO YOU. An unkillable, static offline archive of all of Hacker News.

Active187open-source

36Emerging

Obsidian Life Tracker Base View

Capture and visualize the data that matters in your life

Active179open-source

36Emerging

Vyuh_node_flow

A flexible, high-performance node-based flow editor for Flutter. Build visual programming interfaces, workflow editor...

Active171open-source

36Emerging

Db Studio

The modern pgAdmin alternative that works with every database.

Active171open-source