Tier A · smooth
Runs Phi-3.5-mini (3.8B) comfortably流畅跑 Phi-3.5-mini (3.8B)
Cortex-A76/A78 big cores + A55, ARMv8.2 dotprod + FP16, often i8mm, with NPU, 4–8 GB RAM
- Qualcomm: SD 6 Gen 1, 7 Gen 1, 8-series, QCM6490 (IoT), QCM8550
- MediaTek: Dimensity 700 / 800 / 900 / 9000, Helio G99
- Rockchip: RK3588 / RK3588S (6 TOPS NPU — industrial / edge flagship)
- Amlogic: A311D2, S905X4
Measured Phi-3.5-mini Q4_K_M: 6–15 tok/s on CPU, 20–40 tok/s on Hexagon / NPU.
Tier B · feasible
Fits Granite 3.2-2B or SmolLM2-1.7B跑 Granite 3.2-2B 或 SmolLM2-1.7B
All-A55 (dotprod + FP16, no big cores) or A73+A53, 3–4 GB RAM, weak or no NPU
- Qualcomm: QCM4290, SD 4 Gen 2, SD 680 / 685
- MediaTek: Helio G85, G37
- Rockchip: RK3566 / RK3568 (0.8 TOPS NPU)
- UNISOC: T610 / T618 / T620 / T606 / T616 (global mid-tier workhorses)
2B Q4 on CPU: 3–7 tok/s. Phi-3.5-mini fits but 4 GB RAM frequently OOMs.
Tier C · skip
Do not ship on-device LLM here不要上 on-device LLM
Cortex-A53 only (no dotprod) or older, or RAM < 3 GB
- Qualcomm: SD 439 / 450 / 460 and older 4xx
- MediaTek: MT6765 / 6762, Helio A22 / A25
- UNISOC: SC9863A
- Rockchip: RK3326, PX30, RK3399 (A72+A53 — transitional)
- Allwinner: A133, A64, H616 — all A53, skip
It technically boots, but 1–2 tok/s is not an interactive experience. Route to host via adb / ssh / UART instead.