Repositories
Open-source Agent Operating System
Rust
โ
7.4k
โ 792
Updated today
Run a 1-billion parameter LLM on a $10 board with 256MB RAM
C
โ
1.1k
โ 119
Updated today
Minimal TPU implementation with 8x8 systolic array and PyTorch integration
Python
โ
49
โ 5
Updated today
Claude Code for CUDA. Free AI assistant that actually understands GPU architecture
Python
โ
82
โ 17
Updated today
Forge: Swarm Agents That Turn Slow PyTorch Into Fast CUDA/Triton Kernels
TypeScript
โ
7
โ 1
Updated today
Pure Triton kernels for Qwen3.5-27B inference on NVIDIA B200
Python
โ
6
โ 1
Updated today
Open-source transpiler for CUDA Tile (13.1) migration
TypeScript
โ
16
โ 3
Updated today
Comprehensive GPU specifications database with 2,824 GPUs across NVIDIA, AMD, and Intel
โ
57
โ 9
Updated today
GPU CI/CD tool that tests CUDA kernels across multiple GPUs in parallel - Part of RightNow
Python
โ
13
Updated 2 days ago
Open-source web-based GPU performance visualization tool that transforms NVIDIA profiling data into interactive insights for CUDA engineers. Features timeline views, flame graphs, heatmaps, and AI-powered bottleneck detection.
TypeScript
โ
10
โ 2
Updated 3 days ago
RightNow Arabic LLM Corpus - One of the largest high-quality Arabic text datasets for LLM training
โ
3
โ 1
Updated 4 months ago