Skip to content

Commit

Permalink
chore(readme): add notes on HF_TOKEN
Browse files Browse the repository at this point in the history
Signed-off-by: Aaron Pham <contact@aarnphm.xyz>
  • Loading branch information
aarnphm committed Oct 17, 2024
1 parent 0879968 commit 14bdfdf
Showing 1 changed file with 3 additions and 0 deletions.
3 changes: 3 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -162,6 +162,9 @@ OpenLLM supports LLM cloud deployment via BentoML, the unified model serving fra
openllm deploy llama3:8b
```

> [!NOTE]
> If you are deploying a gated models, make sure to add `--env HF_TOKEN=$HF_TOKEN`
Once the deployment is complete, you can run model inference on the BentoCloud console:

<img width="800" alt="bentocloud_ui" src="https://github.com/bentoml/OpenLLM/assets/65327072/4f7819d9-73ea-488a-a66c-f724e5d063e6">
Expand Down

0 comments on commit 14bdfdf

Please sign in to comment.