Qualcomm® AI Hub Models

June 30, 2026 · View on GitHub

Release Tag PyPi Python 3.10, 3.11, 3.12, 3.13

The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for deployment on Qualcomm® devices.

See supported: On-Device Runtimes, Hardware Targets & Precision, Chipsets, Devices

 

NEW: Quick Start with the AI Hub Models CLI

Use our lightweight command-line interface to browse and download from the collection of Qualcomm® AI Hub Models.

pip install qai_hub_models_cli # (the CLI is also available with the qai-hub-models package)

qai-hub-models models                                                # browse the catalog
qai-hub-models info mobilenet_v2                                     # model details + download options
qai-hub-models fetch mobilenet_v2 --runtime tflite --precision float # download a deployable asset
# ... and more

The CLI also offers a Python API.

See the CLI README for full usage instructions.

 

Setup

1. Install Python Package

The package is available via pip:

# NOTE for Snapdragon X Elite and Snapdragon X2 Elite users:
# Only AMDx64 (64-bit) Python is supported on Windows.
# Installation will fail when using Windows ARM64 Python.

pip install qai_hub_models

Some models (e.g. YOLOv7) require additional dependencies. View the model README (at qai_hub_models/models/model_id) for installation instructions.

 

2. Configure AI Hub Workbench Access

Many features of AI Hub Models (such as model compilation, on-device profiling, etc.) require access to Qualcomm® AI Hub Workbench:

 

Getting Started

Export and Run A Model on a Physical Device

All models in our directory can be compiled and profiled on a hosted Qualcomm® device:

pip install "qai_hub_models[yolov7]"

python -m qai_hub_models.models.yolov7.export [--target-runtime ...] [--device ...] [--help]

Using Qualcomm® AI Hub Workbench, the export script will:

  1. Compile the model for the chosen device and target runtime (see: Compiling Models on AI Hub Workbench).
  2. If applicable, Quantize the model (see: Quantization on AI Hub Workbench)
  3. Profile the compiled model on a real device in the cloud (see: Profiling Models on AI Hub Workbench).
  4. Run inference with a sample input data on a real device in the cloud, and compare on-device model output with PyTorch output (see: Running Inference on AI Hub Workbench)
  5. Download the compiled model to disk.

 

End-To-End Model Demos

Most models in our directory contain CLI demos that run the model end-to-end:

pip install "qai_hub_models[yolov7]"
# Predict and draw bounding boxes on the provided image
python -m qai_hub_models.models.yolov7.demo [--image ...] [--eval-mode {fp,on-device}] [--help]

End-to-end demos:

  1. Preprocess human-readable input into model input
  2. Run model inference
  3. Postprocess model output to a human-readable format

Many end-to-end demos use AI Hub Workbench to run inference on a real cloud-hosted device (with --eval-mode on-device). All end-to-end demos can also run locally via PyTorch (with --eval-mode fp).

 

Sample Applications

Native applications that can run our models (with pre- and post-processing) on physical devices are published in the AI Hub Apps repository.

Python applications are defined for all models (from qai_hub_models.models.<model_name> import App). These apps wrap model inference with pre- and post-processing steps written using torch & numpy. These apps are optimized to be an easy-to-follow example, rather than to minimize prediction time.

 

Model Support Data

On-Device Runtimes

RuntimeSupported OS
Qualcomm AI Engine DirectAndroid, Linux, Windows
LiteRT (TensorFlow Lite)Android, Linux
ONNXAndroid, Linux, Windows

Device Hardware & Precision

Device Compute UnitSupported Precision
CPUFP32, INT16, INT8
GPUFP32, FP16
NPU (includes Hexagon DSP, HTP)FP16*, INT16, INT8

*Some older chipsets do not support fp16 inference on their NPU.

Chipsets

and many more.

Devices

  • Samsung Galaxy S21, S22, S23, S24, and S25 Series
  • Xiaomi 12, 13, 15, and 17
  • Snapdragon X Elite CRD and Snapdragon X2 Elite CRD (Compute Reference Device)
  • Qualcomm RB3 Gen 2, RB5 Gen 2, IQ-8, IQ-9

and many more.

 

Model Directory

Computer Vision

ModelREADME
Image Classification
Beitqai_hub_models.models.beit
ConvNext-Baseqai_hub_models.models.convnext_base
ConvNext-Tinyqai_hub_models.models.convnext_tiny
DLA-102-Xqai_hub_models.models.dla102x
DenseNet-121qai_hub_models.models.densenet121
EfficientFormerqai_hub_models.models.efficientformer
EfficientNet-B0qai_hub_models.models.efficientnet_b0
EfficientNet-B4qai_hub_models.models.efficientnet_b4
EfficientNet-V2-sqai_hub_models.models.efficientnet_v2_s
EfficientViT-b2-clsqai_hub_models.models.efficientvit_b2_cls
EfficientViT-l2-clsqai_hub_models.models.efficientvit_l2_cls
GPUNetqai_hub_models.models.gpunet
GoogLeNetqai_hub_models.models.googlenet
Inception-v3qai_hub_models.models.inception_v3
InternImageqai_hub_models.models.internimage
LeViTqai_hub_models.models.levit
MNASNet05qai_hub_models.models.mnasnet05
Mobile-VITqai_hub_models.models.mobile_vit
MobileNet-v2qai_hub_models.models.mobilenet_v2
MobileNet-v3-Largeqai_hub_models.models.mobilenet_v3_large
MobileNet-v3-Smallqai_hub_models.models.mobilenet_v3_small
NASNetqai_hub_models.models.nasnet
RegNetqai_hub_models.models.regnet
RegNet-Y-800MFqai_hub_models.models.regnet_y_800mf
ResNeXt101qai_hub_models.models.resnext101
ResNeXt50qai_hub_models.models.resnext50
ResNet101qai_hub_models.models.resnet101
ResNet18qai_hub_models.models.resnet18
ResNet50qai_hub_models.models.resnet50
Sequencer2Dqai_hub_models.models.sequencer2d
Shufflenet-v2qai_hub_models.models.shufflenet_v2
SqueezeNet-1.1qai_hub_models.models.squeezenet1_1
Swin-Baseqai_hub_models.models.swin_base
Swin-Smallqai_hub_models.models.swin_small
Swin-Tinyqai_hub_models.models.swin_tiny
SwinV2-Baseqai_hub_models.models.swinv2_base
VITqai_hub_models.models.vit
WideResNet50qai_hub_models.models.wideresnet50
Image Editing
AOT-GANqai_hub_models.models.aotgan
DDColorqai_hub_models.models.ddcolor
DnCNNqai_hub_models.models.dncnn
LaMa-Dilatedqai_hub_models.models.lama_dilated
NAFNet-DeBlurqai_hub_models.models.nafnet_deblur
NAFNet-DeNoiseqai_hub_models.models.nafnet_denoise
Super Resolution
ESRGANqai_hub_models.models.esrgan
NAFSSRqai_hub_models.models.nafssr
QuickSRNetLargeqai_hub_models.models.quicksrnetlarge
QuickSRNetMediumqai_hub_models.models.quicksrnetmedium
QuickSRNetSmallqai_hub_models.models.quicksrnetsmall
Real-ESRGAN-General-x4v3qai_hub_models.models.real_esrgan_general_x4v3
Real-ESRGAN-x4plusqai_hub_models.models.real_esrgan_x4plus
SESR-M5qai_hub_models.models.sesr_m5
XLSRqai_hub_models.models.xlsr
Semantic Segmentation
DDRNet23-Slimqai_hub_models.models.ddrnet23_slim
DeepLabV3-Plus-MobileNetqai_hub_models.models.deeplabv3_plus_mobilenet
DeepLabXceptionqai_hub_models.models.deeplab_xception
EdgeTAMqai_hub_models.models.edgetam
FCN-ResNet50qai_hub_models.models.fcn_resnet50
FFNet-122NS-LowResqai_hub_models.models.ffnet_122ns_lowres
FFNet-40Sqai_hub_models.models.ffnet_40s
FFNet-54Sqai_hub_models.models.ffnet_54s
FFNet-78Sqai_hub_models.models.ffnet_78s
FFNet-78S-LowResqai_hub_models.models.ffnet_78s_lowres
FastSam-Sqai_hub_models.models.fastsam_s
FastSam-Xqai_hub_models.models.fastsam_x
HRNet-W48-OCRqai_hub_models.models.hrnet_w48_ocr
Mask2Formerqai_hub_models.models.mask2former
MaskRCNNqai_hub_models.models.maskrcnn
MediaPipe-Selfie-Segmentationqai_hub_models.models.mediapipe_selfie
MobileSamqai_hub_models.models.mobilesam
PSPNetqai_hub_models.models.pspnet
PidNetqai_hub_models.models.pidnet
PointNetqai_hub_models.models.pointnet
SINetqai_hub_models.models.sinet
SalsaNextqai_hub_models.models.salsanext
Segformer-Baseqai_hub_models.models.segformer_base
Segment-Anything-Model-2qai_hub_models.models.sam2
Segment-Anything-Model-3qai_hub_models.models.sam3
Unet-Segmentationqai_hub_models.models.unet_segmentation
YOLO26-Segmentationqai_hub_models.models.yolo26_seg
YOLOE-Segmentationqai_hub_models.models.yoloe_seg
YOLOv11-Segmentationqai_hub_models.models.yolov11_seg
YOLOv8-Segmentationqai_hub_models.models.yolov8_seg
Video Classification
ResNet-2Plus1Dqai_hub_models.models.resnet_2plus1d
ResNet-3Dqai_hub_models.models.resnet_3d
ResNet-Mixed-Convolutionqai_hub_models.models.resnet_mixed
Video-MAEqai_hub_models.models.video_mae
Video Generation
First-Order-Motion-Modelqai_hub_models.models.fomm
Video Object Tracking
Track-Anythingqai_hub_models.models.track_anything
Object Detection
3D-Deep-BOXqai_hub_models.models.deepbox
CavaFaceqai_hub_models.models.cavaface
CenterNet-2Dqai_hub_models.models.centernet_2d
Conditional-DETR-ResNet50qai_hub_models.models.conditional_detr_resnet50
DETR-ResNet101qai_hub_models.models.detr_resnet101
DETR-ResNet101-DC5qai_hub_models.models.detr_resnet101_dc5
DETR-ResNet50qai_hub_models.models.detr_resnet50
DETR-ResNet50-DC5qai_hub_models.models.detr_resnet50_dc5
Detectron2-Detectionqai_hub_models.models.detectron2_detection
Facial-Attribute-Detectionqai_hub_models.models.face_attrib_net
HRNetFaceqai_hub_models.models.hrnet_face
Lightweight-Face-Detectionqai_hub_models.models.face_det_lite
MediaPipe-Face-Detectionqai_hub_models.models.mediapipe_face
MediaPipe-Hand-Detectionqai_hub_models.models.mediapipe_hand
MediaPipe-Hand-Gesture-Recognitionqai_hub_models.models.mediapipe_hand_gesture
PPE-Detectionqai_hub_models.models.gear_guard_net
Person-Foot-Detectionqai_hub_models.models.foot_track_net
RF-DETRqai_hub_models.models.rf_detr
RTMDetqai_hub_models.models.rtmdet
ResNet34-SSDqai_hub_models.models.resnet34_ssd1200
YOLO26-Detectionqai_hub_models.models.yolo26_det
YOLOv10-Detectionqai_hub_models.models.yolov10_det
YOLOv11-Detectionqai_hub_models.models.yolov11_det
YOLOv8-Detectionqai_hub_models.models.yolov8_det
YOLOv8-OBBqai_hub_models.models.yolov8_obb
YOLOv9-Detectionqai_hub_models.models.yolov9_det
Yolo-Rqai_hub_models.models.yolor
Yolo-Xqai_hub_models.models.yolox
Yolo-v3qai_hub_models.models.yolov3
Yolo-v5qai_hub_models.models.yolov5
Yolo-v6qai_hub_models.models.yolov6
Yolo-v7qai_hub_models.models.yolov7
Pose Estimation
CenterNet-Poseqai_hub_models.models.centernet_pose
Facial-Landmark-Detectionqai_hub_models.models.facemap_3dmm
HRNetPoseqai_hub_models.models.hrnet_pose
LiteHRNetqai_hub_models.models.litehrnet
MediaPipe-Pose-Estimationqai_hub_models.models.mediapipe_pose
Posenet-Mobilenetqai_hub_models.models.posenet_mobilenet
RTMPose-Body2dqai_hub_models.models.rtmpose_body2d
SixDRepNetqai_hub_models.models.sixd_repnet
YOLO26-Poseqai_hub_models.models.yolo26_pose
YOLOv11-Poseqai_hub_models.models.yolov11_pose
Gaze Estimation
EyeGazeqai_hub_models.models.eyegaze
Depth Estimation
CREStereoqai_hub_models.models.crestereo
Depth-Anythingqai_hub_models.models.depth_anything
Depth-Anything-V2qai_hub_models.models.depth_anything_v2
Depth-Anything-V3qai_hub_models.models.depth_anything_v3
Midas-V2qai_hub_models.models.midas
StereoNetqai_hub_models.models.stereonet
Driver Assistance
BEVDetqai_hub_models.models.bevdet
BEVFusionqai_hub_models.models.bevfusion_det
CVTqai_hub_models.models.cvt
CenterNet-3Dqai_hub_models.models.centernet_3d
CenterPointqai_hub_models.models.centerpoint
GKTqai_hub_models.models.gkt
RangeNet-Plus-Plusqai_hub_models.models.rangenet_plus_plus
StateTransformerqai_hub_models.models.statetransformer
Robotics
ACTqai_hub_models.models.act

Multimodal

ModelREADME
EasyOCRqai_hub_models.models.easyocr
MiniLM-v2qai_hub_models.models.minilm_v2
Nomic-Embed-Textqai_hub_models.models.nomic_embed_text
OpenAI-Clipqai_hub_models.models.openai_clip
OpusMT-En-Esqai_hub_models.models.opus_mt_en_es
OpusMT-En-Zhqai_hub_models.models.opus_mt_en_zh
OpusMT-Es-Enqai_hub_models.models.opus_mt_es_en
OpusMT-Zh-Enqai_hub_models.models.opus_mt_zh_en
Pi0.5qai_hub_models.models.pi05
TrOCRqai_hub_models.models.trocr

Audio

ModelREADME
Speech Recognition
Distil-Whisperqai_hub_models.models.distil_whisper
Whisper-Baseqai_hub_models.models.whisper_base
Whisper-Large-V3-Turboqai_hub_models.models.whisper_large_v3_turbo
Whisper-Mediumqai_hub_models.models.whisper_medium
Whisper-Smallqai_hub_models.models.whisper_small
Whisper-Small-Quantizedqai_hub_models.models.whisper_small_quantized
Whisper-Tinyqai_hub_models.models.whisper_tiny
Zipformerqai_hub_models.models.zipformer
Audio Classification
YamNetqai_hub_models.models.yamnet
Audio Generation
MeloTTS-ENqai_hub_models.models.melotts_en
MeloTTS-ESqai_hub_models.models.melotts_es
MeloTTS-ZHqai_hub_models.models.melotts_zh
PiperTTS-DEqai_hub_models.models.pipertts_de
PiperTTS-ENqai_hub_models.models.pipertts_en
PiperTTS-ITqai_hub_models.models.pipertts_it

Generative AI

ModelREADME
Image Generation
ControlNet-Cannyqai_hub_models.models.controlnet_canny
Stable-Diffusion-v1.5qai_hub_models.models.stable_diffusion_v1_5
Stable-Diffusion-v2.1qai_hub_models.models.stable_diffusion_v2_1
Text Generation
Albert-Base-V2-Hfqai_hub_models.models.albert_base_v2_hf
Bert-Base-Uncased-Hfqai_hub_models.models.bert_base_uncased_hf
Distil-Bert-Base-Uncased-Hfqai_hub_models.models.distil_bert_base_uncased_hf
Electra-Bert-Base-Discrim-Googleqai_hub_models.models.electra_bert_base_discrim_google
Falcon3-7B-Instructqai_hub_models.models.falcon_v3_7b_instruct
IBM-Granite-v3.1-8B-Instructqai_hub_models.models.ibm_granite_v3_1_8b_instruct
IndusQ-1.1Bqai_hub_models.models.indus_1b
JAIS-6p7b-Chatqai_hub_models.models.jais_6p7b_chat
Llama-SEA-LION-v3.5-8B-Rqai_hub_models.models.llama_v3_1_sea_lion_3_5_8b_r
Llama-v3-8B-Instructqai_hub_models.models.llama_v3_8b_instruct
Llama-v3-ELYZA-JP-8Bqai_hub_models.models.llama_v3_elyza_jp_8b
Llama-v3.1-8B-Instructqai_hub_models.models.llama_v3_1_8b_instruct
Llama-v3.2-1B-Instructqai_hub_models.models.llama_v3_2_1b_instruct
Llama-v3.2-3B-Instructqai_hub_models.models.llama_v3_2_3b_instruct
Llama-v3.2-3B-Instruct-SSDqai_hub_models.models.llama_v3_2_3b_instruct_ssd
Llama3-TAIDE-LX-8B-Chat-Alpha1qai_hub_models.models.llama_v3_taide_8b_chat
Mistral-7B-Instruct-v0.3qai_hub_models.models.mistral_7b_instruct_v0_3
Mobile-Bert-Uncased-Googleqai_hub_models.models.mobile_bert_uncased_google
PLaMo-1Bqai_hub_models.models.plamo_1b
Phi-3.5-Mini-Instructqai_hub_models.models.phi_3_5_mini_instruct
Qwen2-7B-Instructqai_hub_models.models.qwen2_7b_instruct
Qwen2.5-VL-7B-Instructqai_hub_models.models.qwen2_5_vl_7b_instruct
Qwen3-4Bqai_hub_models.models.qwen3_4b
Qwen3-4B-Instruct-2507qai_hub_models.models.qwen3_4b_instruct_2507
Qwen3-8Bqai_hub_models.models.qwen3_8b

Need help?

Slack: https://aihub.qualcomm.com/community/slack

GitHub Issues: https://github.com/qualcomm/ai-hub-models/issues

Email: ai-hub-support@qti.qualcomm.com.

LICENSE

Qualcomm® AI Hub Models is licensed under BSD-3. See the LICENSE file.