Skip to content

Run Generative AI models using native OpenVINO C++ API

License

Notifications You must be signed in to change notification settings

rk119/openvino.genai

 
 

Repository files navigation

OpenVINO™ GenAI

The OpenVINO™ GenAI repository consists of the GenAI library and additional GenAI samples.

OpenVINO™ GenAI Library

OpenVINO™ GenAI is a flavor of OpenVINO, aiming to simplify running inference of generative AI models. It hides the complexity of the generation process and minimizes the amount of code required.

For installation and usage instructions, refer to the GenAI Library README.

OpenVINO™ GenAI Samples

The OpenVINO™ GenAI repository contains pipelines that implement image and text generation tasks. The implementation uses OpenVINO capabilities to optimize the pipelines. Each sample covers a family of models and suggests certain modifications to adapt the code to specific needs. It includes the following pipelines:

  1. Benchmarking script for large language models
  2. Text generation samples that support most popular models like LLaMA 2:
  3. Stable Diffuison (with LoRA) C++ image generation pipeline
  4. Latent Consistency Model (with LoRA) C++ image generation pipeline

Requirements

Requirements may vary for different samples. See respective readme files for more details, and make sure to install the OpenVINO version listed there. Refer to documentation to see how to install OpenVINO.

The supported devices are CPU and GPU including Intel discrete GPU.

See also: https://docs.openvino.ai/2023.3/gen_ai_guide.html.

License

The OpenVINO™ GenAI repository is licensed under Apache License Version 2.0. By contributing to the project, you agree to the license and copyright terms therein and release your contribution under these terms.

About

Run Generative AI models using native OpenVINO C++ API

Resources

License

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 53.2%
  • C++ 44.7%
  • CMake 1.8%
  • Dockerfile 0.3%