Popular repositories Loading
-
Qwen3.6-27B-AEON-Ultimate-Uncensored-DFlash
Qwen3.6-27B-AEON-Ultimate-Uncensored-DFlash PublicLossless abliteration of Qwen3.6-27B with NVFP4 hardware quantization for DGX Spark / Blackwell. BF16 (51 GB) + NVFP4 (26 GB) deployment guide, docker-compose, and QuickStart.
-
Qwen3.6-NVFP4-DFlash
Qwen3.6-NVFP4-DFlash PublicQwen3.6-35B-A3B-heretic NVFP4 + DFlash speculative decoding on DGX Spark (GB10/sm_121a). Source-built vLLM image + 7 patches + comprehensive deployment guide.
-
vllm-dflash
vllm-dflash PublicDFlash vLLM for DGX Spark — Plug & Play Block-Diffusion Speculative Decoding
-
comfyui-aeon-spark
comfyui-aeon-spark PublicBleeding-edge ComfyUI for NVIDIA DGX Spark (GB10/Blackwell/sm_121a). CUDA 13 + SageAttention v3 (sm_121a) + NVFP4 + 14 custom-node packs + Flux 2 Dev / LTX 2.3 22B / ACE-Step v1.5 XL Turbo pre-bund…
-
supergemma4-26b-abliterated-multimodal-nvfp4
supergemma4-26b-abliterated-multimodal-nvfp4 PublicNVFP4 AWQ Full quantization of SuperGemma4-26B-Abliterated-Multimodal for Blackwell GPUs — pre-built vLLM container + patches included
Python 6
-
Nemotron-3-Nano-Omni-AEON-Ultimate-Uncensored
Nemotron-3-Nano-Omni-AEON-Ultimate-Uncensored PublicNemotron-3-Nano-Omni AEON Ultimate Uncensored: 12-D abliterated multimodal reasoning model (BF16 + NVFP4) for DGX Spark / Blackwell. Source-built vLLM v0.20.0 image + 4 patches + bench + DGX Spark …
Python 6
If the problem persists, check the GitHub status page or contact support.
