Computer Vision Toolbox Model for OpenAI CLIP Network

The Contrastive Learning Image Pre-Training (CLIP) network is a vision language model that can be used for joint image-text classification.

MathWorks Computer Vision Toolbox Team

58 Downloads

(0)

17. Jun 2026

Herunterladen

Verfolgen

Herunterladen

Verfolgen

The CLIP network uses contrastive learning to encode image and textual data into a shared feature space for joint classification. Images and text with high similarity will be close in this feature space, and have a high CLIP score. This further enables image search from input text, and text search from an input image.

Kompatibilität der MATLAB-Version

Kompatibel mit R2026a bis R2026b

Plattform-Kompatibilität

Windows
macOS (Apple Silicon)
macOS (Intel)
Linux

Computer Vision Toolbox Model for OpenAI CLIP Network

Tags

Erfordert

Kompatibilität der MATLAB-Version

Plattform-Kompatibilität