From 2045af58c8b3b111be74d064ba4e4f5a251e69c8 Mon Sep 17 00:00:00 2001 From: Angel-HF <49835792+Pelochus@users.noreply.github.com> Date: Sat, 6 Apr 2024 12:51:36 +0200 Subject: [PATCH] Update README.md with conversion --- README.md | 20 ++++++++++++++++++++ 1 file changed, 20 insertions(+) diff --git a/README.md b/README.md index 95d51cf..b49f8b6 100755 --- a/README.md +++ b/README.md @@ -21,6 +21,26 @@ Run (cd is required): # TODO ``` +## Converting LLMs for Rockchip's NPUs +### Docker +In order to do this, you need a Linux PC x86 (Intel or AMD). Currently, Rockchip does not provide ARM support for converting models, so can't be done on a Orange Pi or similar. +Run: + +`docker run -it pelochus/ezrkllm-toolkit:1.0 bash` + +Then, inside the Docker container: + +```bash +apt install -y python3-tk # This needs some configuring from your part +cd ezrknn-llm/rkllm-toolkit/examples/huggingface/ +``` + +Now change the `test.py` with your preferred model. This container provides Qwen-1.8B and LLaMa2 Uncensored. By default, Qwen-1.8B is selected. To convert the model, run: + +`python3 test.py` + +I currently cannot convert the models, so I don't know what the output will be. I believe this is Rockchip's fault. Let me know if you could or what error gives you. + # Original README starts below