Manual QNN Model Download ~ AnythingLLM

What is this?

Sometimes you need to download the NPU models manually due to connection issues. This is a manual process but it's quite simple to do and should only be done if you are unable to download the models automatically from selecting them in the GUI on the desktop app.

Download the models

You can download the models from the following links:

Once your zip file is downloaded

Open the models/QNN folder (or create it if it doesn't exist) in the desktop storage folder.
Move the zip file into this folder.
Extract the zip file.

You should now have a folder named with the same name as the zip file and inside it will be the model files.

# Example folder structure
models/QNN/
└── llama_v3_2_3b_chat_8k/
    ├── genie_config.json
    ├── htp_backend_etc.bin
    ├── related-model-bin-file.bin
    └── tokenizer.json

Restart the desktop app. Now the model should be available in the GUI to be selected and used for inference.

Fetch failed error on embed What are beta previews?