FLASH SPARSE ATTENTION

// 🚀🚀 Efficient implementations of Native Sparse Attention

Flash Sparse Attention

🚀🚀 Efficient implementations of Native Sparse Attention

13EmergingUnknown

See Alternatives Compare...

License

Apache-2.0

Updated

Today

What it does

--- This repository provides the official implementation of Flash Sparse Attention (FSA), which includes a novel kernel design that enables efficient Native Sparse Attention (NSA) across a wide range of popular LLMs on modern GPUs. - News - Method - Advantages - Features - Installation - Usage - Instantiate FSA Module - Train with FSA - Evaluation - Benchmark FSA Module - Benchmark FSA Selected

Getting Started

git

git clone https://github.com/Relaxed-System-Lab/Flash-Sparse-Attention

Links

Platforms

🪟windows🍎mac🐧linux

Install Difficulty

moderate

Built With

python

Community Reactions

Similar Tools

See all alternatives →

36Emerging

Openvpn_webpanel_manager

A powerful, self-hosted web panel for managing OpenVPN servers, users, resellers (sub-admins), and multi-node deploym...

Active190open-source

36Emerging

Hink

Link Shortener for Hackers

Active158open-source

36Emerging

HackerBook

Hacker Book - COMMUNITY, ALL THE HN ARE BELONG TO YOU. An unkillable, static offline archive of all of Hacker News.

Active187open-source

36Emerging

Obsidian Life Tracker Base View

Capture and visualize the data that matters in your life

Active179open-source

36Emerging

Vyuh_node_flow

A flexible, high-performance node-based flow editor for Flutter. Build visual programming interfaces, workflow editor...

Active171open-source

36Emerging

Db Studio

The modern pgAdmin alternative that works with every database.

Active171open-source