nakshatra

Distributed LLM inference across heterogeneous workers (NVIDIA / AMD / Apple Silicon / CPU). Splits one model by layer ranges; patched llama.cpp + gRPC chain protocol.

Python3 stars0 forks

view on github

Files

.gitignore1.9kb
CLAUDE.md2.7kb
Dockerfile926b
LICENSE1.1kb
pyproject.toml333b
README_PETALS.md8.9kb
README.md10.6kb
setup.cfg1.9kb

The pulse