Use OpenVINO with your Intel integrated GPU

brucelee186

352人浏览 · 2026-03-18 13:54:55

brucelee186 · 2026-03-18 13:54:55 发布

To use OpenVINO with your Intel integrated GPU, you typically follow 4 main steps: install it, prepare a model, convert the model to OpenVINO format, and run inference on CPU/GPU.

Below is a simple beginner workflow.

1️⃣ Install OpenVINO

The easiest way is with Python + pip.

Install core OpenVINO

pip install openvino

Install extra tools (model conversion, optimization)

pip install openvino-dev

Verify installation:

python -c "import openvino as ov; print(ov.__version__)"

2️⃣ Check available hardware

OpenVINO can use:

CPU
Intel integrated GPU
Intel NPU (on new chips)

Run this Python script:

import openvino as ov

core = ov.Core()
print(core.available_devices)

Example output:

['CPU', 'GPU']

If you see GPU, your Intel iGPU is supported.

3️⃣ Get a model

OpenVINO works with models from:

Hugging Face
PyTorch
TensorFlow
ONNX

Example small models:

Phi-3 Mini
Qwen2 3B

These are better for integrated GPUs.

4️⃣ Convert the model to OpenVINO format

OpenVINO uses IR format (.xml + .bin).

Example conversion:

from openvino.tools import mo

Or CLI:

mo --input_model model.onnx

Output:

model.xml
model.bin

These files are optimized for OpenVINO.

5️⃣ Run inference on GPU

Example Python code:

import openvino as ov

core = ov.Core()

model = core.read_model("model.xml")

compiled_model = core.compile_model(model, "GPU")

infer_request = compiled_model.create_infer_request()

result = infer_request.infer({"input": input_data})

print(result)

Device options:

"CPU"
"GPU"
"AUTO"

Best option:

compiled_model = core.compile_model(model, "AUTO")

This lets OpenVINO choose the fastest hardware.

6️⃣ Example architecture

Typical workflow:

HuggingFace model
        ↓
convert to ONNX
        ↓
convert to OpenVINO IR
        ↓
OpenVINO runtime
        ↓
CPU + Intel GPU acceleration

⚠️ Important for LLMs

Intel integrated GPUs have limited memory.

Typical limits:

Model	Feasible
3B	good
7B	possible
13B	usually too large

So you should prefer small models.

✅ If your goal is OpenClaw replacement

A good stack is:

OpenClaw
   ↓
OpenVINO inference server
   ↓
Phi-3 / Qwen2
   ↓
Intel CPU + iGPU

💡 If you want, I can also show you the easiest way to run OpenVINO LLMs without coding (only 3 commands). This method is what most people use now.

AtomGit开源社区

AtomGit 是由开放原子开源基金会联合 CSDN 等生态伙伴共同推出的新一代开源与人工智能协作平台。平台坚持“开放、中立、公益”的理念，把代码托管、模型共享、数据集托管、智能体开发体验和算力服务整合在一起，为开发者提供从开发、训练到部署的一站式体验。

更多推荐

2026上海AI搜索GEO优化服务商技术路径深度解析

AtomGit开源社区

AI Agent 第五篇:【2026零基础AI教程5】第一层ReAct单智能体实战！LangChain新版最小原子Agent搭建（全程可复制、零基础跑通）

前面4篇我们全部搞定了底层认知、架构分层、全网术语扫盲，从本篇开始，正式进入手把手实战阶段。按照Harness四层架构逐级递进的原则，我们从第一层：ReAct 单智能体开始落地。ReAct 是所有AI智能体的最小原子单元，也是入门必须掌握的第一个可落地架构。很多新手直接跳过ReAct、硬学LangGraph多智能体、DeerFlow长任务架构，最后完全学崩，就是因为没有吃透最基础的「思考-行动-观

AtomGit开源社区

多平台发布中心怎么设计_CSDN_AI数字营销的架构思路值得参考

我有个朋友在做内容运营工具,前段时间找我聊,问我:如果让你设计一个多平台发布中心,你会怎么设计?我当时说了一大堆需求——支持的平台要多、格式适配要自动化、数据要汇总、界面要简洁……他听完说:你说的这些都对,但都是表层需求。真正的问题是:你怎么理解"发布"这件事?这个问题让我愣了一下。后来我反复琢磨CSDN AI数字营销的产品逻辑,才明白他问的是什么——多平台发布中心的设计,本质上是对"发布"这件事