David's raw ML reference notes

❯

01 Statistical (machine) learning (data science)

❯

30 Model classes

❯

10 Neural networks

❯

03 Architectures

❯

Two-tower neural network

Two-tower neural network

Feb 14, 20251 min read

A two-tower neural network is not really a single neural network. Rather, it is a pair of embedding models that are aligned via a contrastive loss function such that resulting pairs of embeddings can be compared by a distance metric. It is widely used for collaborative filtering, where the embedding models are usually a stack of fully connected layers. However, it is also used in other contexts. For example, CLIP employs a text transformer and a vision transformer to align text to images.

Graph View

Backlinks

Comparing Siamese, triplet, and two-tower networks
Triplet loss
BLIP-2
CLIP, BLIP, and BLIP-2
Contrastive image-language pre-training (CLIP)
Collaborative filtering using deep learning
Collaborative filtering
Vertex Matching Engine narrative

Created with Quartz v4.4.0 © 2025

Terms of Use
LinkedIn
Buy me a coffee