Open
Description
I've first converted to your style of Powerinfer.gguf by specified in README file.
However, My derived model seems to generate strange text outputs. Thus, I've fully compared my generated gguf file with your open source gguf file https://huggingface.co/PowerInfer/prosparse-llama-2-7b-gguf/blob/main/prosparse-llama-2-7b.gguf. Below is the result of weight comparison. It seems like you fused LoRA in Q,K weight matrix. If possible, Could you disclose your fully fused Prosparse-llama-2-7b model?