Skip to content

v0.2.0

Compare
Choose a tag to compare
@mikepapadim mikepapadim released this 04 Sep 12:42
· 11 commits to main since this release

Model Support

  • Mistral – support for GGUF-format Mistral models with optimized GPU execution.
  • Qwen2.5 – GGUF-format Qwen2.5 models supported, including performance improvements for attention layers.
  • Qwen3 – compatible with GGUF-format Qwen3 models and updated integration.
  • DeepSeek-R1-Distill-Qwen-1.5B – GGUF-format DeepSeek distilled models supported for efficient inference.
  • Phi-3 – full support for GGUF-format Microsoft Phi-3 models for high-performance workloads.

What's Changed

New Contributors

Full Changelog: v0.1.0-beta...v0.2.0