bzhng-development
Popular repositories Loading
-
-
sglang
sglang PublicForked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Python
-
flashinfer
flashinfer PublicForked from flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
Python
-
-
LeetCUDA
LeetCUDA PublicForked from xlite-dev/LeetCUDA
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
Cuda
-
DeepGEMM
DeepGEMM PublicForked from deepseek-ai/DeepGEMM
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Cuda
Repositories
- sglang Public Forked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
bzhng-development/sglang’s past year of commit activity - triton-but-documented Public
bzhng-development/triton-but-documented’s past year of commit activity - triton Public Forked from triton-lang/triton
Development repository for the Triton language and compiler
bzhng-development/triton’s past year of commit activity - torchtitan Public Forked from pytorch/torchtitan
A PyTorch native platform for training generative AI models
bzhng-development/torchtitan’s past year of commit activity - Model-Optimizer Public Forked from NVIDIA/Model-Optimizer
A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
bzhng-development/Model-Optimizer’s past year of commit activity - SpecForge Public Forked from sgl-project/SpecForge
Train speculative decoding models effortlessly and port them smoothly to SGLang serving.
bzhng-development/SpecForge’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…