Cross-Lingual Knowledge Alignment Analysis

February 6, 2026 · View on GitHub

Research Question: How consistently is the same concept represented across different languages in a multilingual embedding model?

Motivation

Multilingual models map text into a shared embedding space where semantically equivalent concepts should have similar vectors regardless of language. This cross-lingual alignment enables cross-lingual retrieval, zero-shot transfer, and multilingual search—but alignment quality varies by concept type, language pair, and model architecture.

This experiment quantifies alignment using FAISS for efficient similarity search across 15 concepts in 8 languages.

Experimental Setup

Model: sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 (384-dim, 50+ languages)

Languages: English, Spanish, French, German (Latin); Chinese (Hanzi); Japanese (Kanji/Kana); Arabic; Russian (Cyrillic)

Concepts (15): Abstract (freedom, love, justice, knowledge, peace) · Concrete (water, sun, tree, house, mountain) · Scientific (gravity, energy) · Emotion (happiness, fear, anger)

Methodology

Generate embeddings for each concept × language (120 vectors)
L2-normalize for cosine similarity via inner product
Build FAISS IndexFlatIP index
Compute metrics:
- Intra-concept similarity: Avg pairwise cosine similarity between translations
- Nearest neighbor accuracy: % of k-NN that are same concept
- Cross-lingual retrieval: Accuracy retrieving correct concept across language pairs

Results

Overall Alignment Score: 0.938 (Excellent)

The model demonstrates strong cross-lingual alignment, with concepts clustering by meaning rather than by language.

Metric Breakdown

Metric	Score	Description
Intra-Concept Similarity	0.914	Avg cosine similarity within same concept across languages
Nearest Neighbor Accuracy (k=3)	96.1%	% of nearest neighbors that are the same concept
Cross-Lingual Retrieval	93.9%	Accuracy retrieving correct concept across language pairs

Intra-Concept Similarity by Concept

Concept          Similarity    Quality
─────────────────────────────────────────
justice          0.9894        Excellent
water            0.9856        Excellent
knowledge        0.9840        Excellent
energy           0.9800        Excellent
freedom          0.9798        Excellent
fear             0.9692        Excellent
happiness        0.9582        Excellent
love             0.9365        Excellent
sun              0.9349        Excellent
house            0.9330        Excellent
peace            0.8749        Good
gravity          0.8528        Good
tree             0.7966        Moderate
mountain         0.7875        Moderate
anger            0.7456        Moderate
─────────────────────────────────────────
Average          0.9139

Language Centroid Distances

Distances between language centroids (lower = more overlap in embedding space):

Closest pairs (best aligned):

French ↔ Russian: 0.004
Chinese ↔ Japanese: 0.006
Chinese ↔ Arabic: 0.011
English ↔ French: 0.012

Most distant pairs:

English ↔ Japanese: 0.045
German ↔ Japanese: 0.036
German ↔ Chinese: 0.034

Misaligned Concept Pairs

Pairs with similarity < 0.5 (indicating poor alignment):

Concept	Language Pair	Similarity
tree	English ↔ German	0.308
tree	German ↔ Arabic	0.315
tree	Spanish ↔ German	0.353
mountain	Spanish ↔ German	0.363
tree	French ↔ German	0.366
mountain	English ↔ German	0.367
tree	German ↔ Chinese	0.378
tree	German ↔ Russian	0.379
mountain	German ↔ Arabic	0.399
anger	English ↔ Spanish	0.402

Findings

1. Concept Type Matters

Best aligned (>0.95): Scientific/universal concepts (water, energy, justice, knowledge) and basic emotions (fear). Worst aligned (<0.85): Nature concepts with cultural variation (tree, mountain) and complex emotions (anger, peace). Universal, unambiguous meanings align better than culturally-loaded terms.

python3 -m venv venv && source venv/bin/activate
pip install -r requirements.txt
python cross_lingual_alignment.py

To test different models, pass model_name to CrossLingualAlignmentExperiment.

Files

File	Description
`cross_lingual_alignment.py`	Main experiment code
`fig/experiment_diagram.png`	Experiment design diagram
`fig/cross_lingual_alignment_figure.png`	Results visualization
`alignment_results.json`	Raw results (JSON)
`requirements.txt`	Dependencies

References

License

MIT

Cross-Lingual Knowledge Alignment Analysis

Motivation

Experimental Setup

Methodology

Results

Overall Alignment Score: 0.938 (Excellent)

Metric Breakdown

Intra-Concept Similarity by Concept

Language Centroid Distances

Misaligned Concept Pairs

Findings

1. Concept Type Matters

2. German Embeddings Show Systematic Divergence

3. Script Similarity ≠ Embedding Similarity

4. Strong Discriminability Between Concepts

5. Emotion Concepts Show Cultural Variation

Visualization

Implications

Reproducing This Experiment

Files

References

License