Qt, OpenCV, PyTorch: The Central Dogma of GUI CV Applications

Just as the central dogma in molecular biology describes genetic information flow from DNA to RNA to protein, we can use a similar terminology to describe how pixels flow from GUI to classic CV algorithms to deep learning-based CV models in GUI CV applications.

The Central Dogma in Molecular Biology

The Central Dogma of GUI CV Applications:

graph TD
    QI[RGB32/ARGB32 QImage]
    CV[HWC BGRX 8888 ndarray]
    CHWRGB[CHW RGB 0-1 Tensor]
    PI[HWC RGB 888 PIL.Image]
    
    QI -- copy or zero-copy --> CV
    CV -- copy or zero-copy --> QI
    CV -- OpenCV processing --> CV
    CV -- convert --> CHWRGB
    CV -- convert --> PI

Step-by-Step

RGB32/ARGB32 QImage <-> HWC BGRX 8888 ndarray

Both copy and zero-copy are possible. However, pay attention to memory lifetime and whether the ndarray is contiguous!

HWC BGRX 8888 ndarray -> HWC BGRX 8888 ndarray

A common preprocessing step is letterboxing, such as for YOLO object detection models.

HWC BGRX 8888 ndarray -> CHW RGB 0-1 Tensor

A conversion procedure is required to feed images to PyTorch models, which often uses CHW RGB tensors with pixel values normalized to [0, 1].

HWC BGRX 8888 ndarray -> HWC RGB 888 PIL.Image

A conversion procedure is also required to target PIL.Images.

Data Science, Multimedia, and Process Automation

Qt, OpenCV, PyTorch: The Central Dogma of GUI CV Applications

https://jifengwu2k.github.io/2026/01/11/Qt-OpenCV-PyTorch-The-Central-Dogma-of-GUI-CV-Applications/

Author

Jifeng Wu

Posted on

January 11, 2026

Licensed under

Building an OCaml Project with Dependencies in a Conda Environment Using Dune Previous

Automate Your Workflow with GitHub Actions Next