- unbounded edge streams - gpu support - mmap model memory / nbr sampler - async data loading / prefetch thread - docs: mkdocs site, show tguf spec, api ref - conan setup