Neither Here Nor There: Cross-Lingual Representation Dynamics of Code-Mixed Text in Multilingual Encoders

arXiv:2603.19771

Summary

This study investigates how multilingual language models (MLMs) represent Hindi–English code-mixed (CM) text and its relationship with the constituent languages. Using a trilingual corpus of parallel English, Hindi (Devanagari), and Romanized CM sentences, the authors analyze cross-lingual alignment via CKA, token-level saliency, and entropy-based uncertainty. They find that standard MLMs align English and Hindi well, but CM representations remain loosely connected to both. Continued pretraining on CM data improves English–CM alignment at the cost of English–Hindi alignment. Interpretability reveals that models process CM text through an English-dominant subspace, with native-script Hindi reducing uncertainty. To address this, the authors propose a trilingual post-training alignment objective that simultaneously aligns English, Hindi, and CM representations. This approach yields more balanced cross-lingual alignment and improves downstream performance on sentiment analysis and hate speech detection, demonstrating that grounding CM representations in their constituent languages enhances cross-lingual understanding.

PDF viewer

Chunks(54)

Chunk 0 · 1,992 chars

Neither Here Nor There: Cross-Lingual Representation Dynamics of
Code-Mixed Text in Multilingual Encoders
Debajyoti Mazumder1, Divyansh Pathak1, Prashant Kodali2, Jasabanta Patro1
1Indian Institute of Science Education and Research, Bhopal, India
2Microsoft Corporation
debajyoti22@iiserb.ac.in, divyansh22@iiserb.ac.in, kodali.prashant@gmail.com,
jpatro@iiserb.ac.in
Abstract
Multilingual encoder-based language models
are widely adopted for code-mixed analysis
tasks, yet we know surprisingly little about
how they represent code-mixed inputs inter-
nally — or whether those representations mean-
ingfully connect to the constituent languages
being mixed. Using Hindi–English as a case
study, we construct a unified trilingual corpus
of parallel English, Hindi (Devanagari), and
Romanized code-mixed sentences, and probe
cross-lingual representation alignment across
standard multilingual encoders and their code-
mixed adapted variants via CKA, token-level
saliency, and entropy-based uncertainty analy-
sis. We find that while standard models align
English and Hindi well, code-mixed inputs re-
main loosely connected to either language —
and that continued pre-training on code-mixed
data improves English–code-mixed alignment
at the cost of English–Hindi alignment. In-
terpretability analyses further reveal a clear
asymmetry: models process code-mixed text
through an English-dominant semantic sub-
space, while native-script Hindi provides com-
plementary signals that reduce representational
uncertainty. Motivated by these findings, we in-
troduce a trilingual post-training alignment ob-
jective that brings code-mixed representations
closer to both constituent languages simulta-
neously, yielding more balanced cross-lingual
alignment and downstream gains on sentiment
analysis and hate speech detection — showing
that grounding code-mixed representations in
their constituent languages meaningfully helps
cross-lingual understanding.
1 Introduction:
Code-mixing is a linguistic

Chunk 1 · 1,997 chars

languages simulta-
neously, yielding more balanced cross-lingual
alignment and downstream gains on sentiment
analysis and hate speech detection — showing
that grounding code-mixed representations in
their constituent languages meaningfully helps
cross-lingual understanding.
1 Introduction:
Code-mixing is a linguistic phenomenon in mul-
tilingual societies where speakers combine words
or phrases from multiple languages within a single
CM denotes code-mixed text. The terms participating
and constituent languages are used interchangeably.
Batman I know , was humble 	बैटमैन मुझे पता है , िवनम्र था
Batman mujhe pta hai , humble tha
S1
S2 	S3
Vanilla Multilingual
model
S1 > S2 > S3
Code-mixed Adapted
Multilingual model
S2 > S1 > S3
Figure 1: Vanilla multilingual model versus code-mixed
adapted multilingual model. Here, S represents similar-
ity.
sentence (Winata et al., 2021). Despite its preva-
lence, code-mixed text poses challenges for multi-
lingual language models (MLMs), which are typi-
cally pretrained on predominantly monolingual cor-
pora (Devlin et al., 2019; CONNEAU and Lample,
2019; Conneau et al., 2020a; Zhang et al., 2023).
A widely studied example is Hindi–English code-
mixing, often written in Roman script and com-
monly referred to as Hinglish.
To support multilingual understanding, encoder-
based MLMs such as mBERT and XLM-R learn
shared representation spaces in which semantically
equivalent sentences across languages tend to align
(Pallucchini et al., 2025). Such cross-lingual align-
ment is crucial for multilingual NLP because it en-
ables knowledge learned in one language to transfer
to others in downstream tasks such as sentiment
analysis and hate speech detection (de Varda and
Marelli, 2024; Hong et al., 2025). Code-mixed
text provides a natural testbed for studying cross-
lingual alignment because lexical and semantic
cues from multiple constituent languages appear
within a single utterance, requiring models to in-
tegrate signals from different

Chunk 2 · 1,993 chars

ysis and hate speech detection (de Varda and
Marelli, 2024; Hong et al., 2025). Code-mixed
text provides a natural testbed for studying cross-
lingual alignment because lexical and semantic
cues from multiple constituent languages appear
within a single utterance, requiring models to in-
tegrate signals from different languages into a co-
herent representation. Similar to how English can
serve as a pivot language in multilingual models
(Kargaran et al., 2025), linguistically, code-mixed
text rests on the twin foundations of its constituent
monolingual languages. However, most studies on
arXiv:2603.19771v1 [cs.CL] 20 Mar 2026

-- 1 of 24 --

cross-lingual alignment examine representations of
monolingual sentences across languages, leaving
it unclear how multilingual encoders align code-
mixed inputs with their constituent monolingual
languages (Hämmerl et al., 2024). As a result, the
relationship between code-mixed representations
and those of their corresponding monolingual coun-
terparts remains underexplored.
In this study, we examine the representational
dynamics of multilingual encoders when process-
ing code-mixed text relative to their monolingual
counterparts. To enable this analysis, we con-
struct a trilingual corpus of parallel English, Hindi,
and Hindi–English Code-Mixed (CM) sentences
from existing resources. Our analysis examines
how constituent language signals influence MLM
understanding of code-mixed inputs using inter-
pretability tools such as Centered Kernel Align-
ment (CKA), token-level saliency, and entropy-
based uncertainty reduction.
Our observations reveal several key findings.
First, while multilingual models align monolin-
gual English and Hindi representations well, code-
mixed representations remain weakly anchored to
both languages. Second, continued pretraining
on code-mixed data improves English–CM align-
ment but degrades English–Hindi alignment, re-
vealing a trade-off between code-mixed adaptation
and monolingual alignment.

Chunk 3 · 1,996 chars

lin-
gual English and Hindi representations well, code-
mixed representations remain weakly anchored to
both languages. Second, continued pretraining
on code-mixed data improves English–CM align-
ment but degrades English–Hindi alignment, re-
vealing a trade-off between code-mixed adaptation
and monolingual alignment. Third, interpretabil-
ity analyses show that code-mixed representations
are largely explained by the English representa-
tion space, while native-script Hindi contributes
complementary signals that reduce representational
uncertainty.
Building on these insights, we further explore
whether explicit supervision across the participat-
ing languages can improve cross-lingual consis-
tency. Specifically, we introduce a trilingual post-
training alignment stage that encourages represen-
tations of parallel English, Hindi, and code-mixed
sentences to occupy nearby regions in the shared
embedding space. Experiments show that incorpo-
rating trilingual supervision during post-training
leads to more balanced cross-lingual alignment
across language pairs while largely preserving
strong monolingual consistency. To assess whether
improved alignment yields practical benefits, we
additionally evaluate the proposed approach on
downstream tasks for code-mixed text. Our main
contributions are as follows:
• We construct a trilingual corpus from exist-
ing resources comprising parallel EN, HI (De-
vanagari), and CM (Roman) sentences, en-
abling systematic analysis of cross-lingual
alignment.
• Using this corpus, we analyze how multilin-
gual encoders represent code-mixed text rela-
tive to their monolingual counterparts through
CKA-based layer-wise similarity, token-level
saliency, and entropy-based uncertainty analy-
sis.
• Motivated by these insights, we introduce a
trilingual post-training alignment stage that
leverages parallel monolingual and code-
mixed supervision to improve alignment
across English, Hindi, and code-mixed rep-
resentations. We further propose a

Chunk 4 · 1,998 chars

rity, token-level
saliency, and entropy-based uncertainty analy-
sis.
• Motivated by these insights, we introduce a
trilingual post-training alignment stage that
leverages parallel monolingual and code-
mixed supervision to improve alignment
across English, Hindi, and code-mixed rep-
resentations. We further propose a metric,
the Cross-Lingual Alignment Score (CLAS),
to measure balanced cross-lingual alignment
across language pairs.
• We evaluate the proposed alignment stage
on downstream sentiment analysis and hate
speech detection tasks for code-mixed text,
demonstrating improved cross-lingual consis-
tency performance.
2 Related work:
Multilingual Representation Learning. Multilin-
gual language models, including mBERT (Devlin
et al., 2019) and XLM-R (Conneau et al., 2020a),
learn shared representation spaces during multilin-
gual pretraining, enabling effective cross-lingual
transfer. Prior work shows that semantically similar
sentences across languages tend to align in these
spaces even without explicit supervision (Conneau
et al., 2020b; Pallucchini et al., 2025). This cross-
lingual alignment ensures knowledge learned in
one language benefits others in downstream tasks
like NER and sentiment analysis, with methods
like contrastive objectives further enhancing trans-
fer (Kulshreshtha et al., 2020). While this property
supports cross-lingual transfer, most studies (Häm-
merl et al., 2024) focus on monolingual text, leav-
ing the behavior of these models under code-mixed
inputs relatively underexplored.
Code-mixed Setting. Code-mixed language pro-
cessing has received increasing attention in recent
years (Sheth et al., 2025). This growing inter-
est has led to the development of several datasets
and benchmarks for Hindi–English and other code-
mixed language pairs. Resources such as PHINC
2

-- 2 of 24 --

(Srivastava and Singh, 2020) and the LinCE bench-
mark (Aguilar et al., 2020) provide parallel anno-
tated corpora for studying code-mixing in social
media

Chunk 5 · 1,999 chars

est has led to the development of several datasets
and benchmarks for Hindi–English and other code-
mixed language pairs. Resources such as PHINC
2

-- 2 of 24 --

(Srivastava and Singh, 2020) and the LinCE bench-
mark (Aguilar et al., 2020) provide parallel anno-
tated corpora for studying code-mixing in social
media text. Using these datasets, prior work has
primarily focused on downstream tasks such as sen-
timent analysis, hate speech detection, and machine
translation for code-mixed text (Niederreiter and
Gromann, 2025; Appicharla et al., 2025; Shanmu-
gavadivel et al., 2024). Recent studies also explore
continued pretraining on large code-mixed corpora
to improve task performance (Nayak and Joshi,
2022; Yoo et al., 2025). While these approaches
improve task-level performance, they provide lim-
ited insight into how multilingual models internally
represent code-mixed inputs.
Constituent Language Influence. Several prior
works leverage monolingual language data to im-
prove code-mixed sequence labeling and sequence
classification tasks (Jayanthi et al., 2021; Gau-
tam et al., 2021; Kumar et al., 2022; Ògúnr e. `mí
et al., 2023; Mazumder et al., 2025a,b). These ap-
proaches consistently report performance gains on
downstream tasks, demonstrating the practical util-
ity of monolingual signals. Past studies also show
that introducing code-mixing in in-context learn-
ing can improve performance, especially for low-
resource languages (Shankar et al., 2024), while de-
coder LLMs mix languages during their reasoning
process (Wang et al., 2025). This marks a broader
synergy between code-mixed settings and their con-
stituent languages.
Interpretability. Following the success of mul-
tilingual pretraining and fine-tuning approaches,
subsequent works (Rogers et al., 2020; Resck et al.,
2025) have investigated the internal mechanisms of
multilingual language models using interpretability
tools. In particular, representation similarity met-
rics such as CKA and SVCCA have

Chunk 6 · 1,989 chars

ollowing the success of mul-
tilingual pretraining and fine-tuning approaches,
subsequent works (Rogers et al., 2020; Resck et al.,
2025) have investigated the internal mechanisms of
multilingual language models using interpretability
tools. In particular, representation similarity met-
rics such as CKA and SVCCA have been used to an-
alyze how cross-lingual alignment emerges across
transformer layers (Kornblith et al., 2019; Wu et al.,
2020). Similar analyses have also examined how ar-
chitectural factors and training dynamics influence
representational alignment across independently
trained networks (Morcos et al., 2018; Raghu et al.,
2017). Complementary studies (Banerjee et al.,
2025) explore attribution-based methods to un-
derstand token importance and language reliance
within multilingual models.
Despite advances in multilingual modeling and
interpretability, little work has examined how mul-
tilingual models internally represent code-mixed
inputs (Santy et al., 2021). To our knowledge, exist-
Dataset 	# Samples Languages
CM-En parallel (Dhar et al., 2018) 6,096 CM, EN
PHINC (Srivastava and Singh, 2020) 13,738 CM, EN
LINCE 20211 	8060 CM, EN, HI
Table 1: Summary of the datasets used in this work,
where EN = English, HI = Hindi (Devanagari script),
and CM = Hindi (Roman-script) - English code-mixed
text.
ing studies do not systematically compare the repre-
sentations of code-mixed inputs with those of their
monolingual counterparts. Our work addresses this
gap by performing cross-lingual alignment evalua-
tion driven by mechanistic interpretability analyses.
3 Dataset:
We construct a unified trilingual corpus from ex-
isting resources by merging three publicly avail-
able datasets containing Hindi–English code-mixed
text and their monolingual counterparts: CM-En
Parallel (Dhar et al., 2018), PHINC (Srivastava
and Singh, 2020), and the LinCE 2021 multiview
Hinglish dataset1 (Aguilar et al., 2020; Chen et al.,
2022). These resources contain code-mixed

Chunk 7 · 1,996 chars

y merging three publicly avail-
able datasets containing Hindi–English code-mixed
text and their monolingual counterparts: CM-En
Parallel (Dhar et al., 2018), PHINC (Srivastava
and Singh, 2020), and the LinCE 2021 multiview
Hinglish dataset1 (Aguilar et al., 2020; Chen et al.,
2022). These resources contain code-mixed sen-
tences together with English (and/or Hindi) transla-
tions.
Since the first two datasets do not provide
Hindi translations, we generate the missing Hindi
sentences using the Google Translate API2
and verify consistency of our results using the
IndicTrans2 model (Gala et al., 2023), special-
ized English-to-Indic translation tool. After ob-
taining the translations, we unify all datasets
into a trilingual corpus consisting of parallel En-
glish, Hindi (Devanagari), and Hindi–English code-
mixed sentences.
Following prior work (Nayak and Joshi, 2022),
we remove duplicate entries and filter out short
sentences with fewer than five tokens to ensure
sufficient code-mixing. The final dataset con-
tains 21,139 aligned sentence triples. Each in-
stance contains three fields: (i) CM, the Roman-
ized Hindi–English code-mixed sentence, (ii) En-
glish, its monolingual English counterpart, and (iii)
Hindi, the corresponding Hindi sentence written
in Devanagari script. Additional dataset details are
provided in Appendix A.
1https://github.com/devanshg27/cm_translation/
tree/main
2https://cloud.google.com/translate?hl=en
The terms alignment accuracy and retrieval accuracy are
used interchangeably.
3

-- 3 of 24 --

4 Experiments:
We design our experiments to systematically evalu-
ate cross-lingual alignment in multilingual encoder
language models using our trilingual corpus. Our
observations are supported by mechanistic inter-
pretability tools to connect observed behaviors to
internal representation dynamics.
Models. We evaluate a set of encoder-based
multilingual language models (MLMs), includ-
ing general-purpose multilingual encoders trained
on more than

Chunk 8 · 1,997 chars

s using our trilingual corpus. Our
observations are supported by mechanistic inter-
pretability tools to connect observed behaviors to
internal representation dynamics.
Models. We evaluate a set of encoder-based
multilingual language models (MLMs), includ-
ing general-purpose multilingual encoders trained
on more than one hundred languages (mBERT
and XLM-R) and their Hindi–English code-mixed
adapted variants from the Hing family (Nayak and
Joshi, 2022) (see Table 2). This setup allows us to
examine how different pretraining strategies influ-
ence cross-lingual alignment behavior.
Model 	Version
mBERT 	bert-base-multilingual-cased
Hing-mBERT 	l3cube-pune/hing-bert
Hing-mBERT-Mixed l3cube-pune/hing-bert-mixed
XLM-R 	xlm-roberta-base
Hing-RoBERTa 	l3cube-pune/hing-roberta
Hing-RoBERTa-Mixed l3cube-pune/hing-roberta-mixed
Table 2: Multilingual encoder models used in our ex-
periments. Hing models denote mBERT and XLM-R
variants further trained on the L3Cube-HingCorpus.
4.1 Cross-Lingual Alignment
Here, we evaluate whether multilingual models
learn structurally consistent representations be-
tween code-mixed text and its parallel monolin-
gual counterparts. Structural consistency refers to
the extent to which representations of parallel sen-
tences occupy nearby regions in the latent space,
independent of downstream task supervision. To
assess this property, we perform representation
similarity analysis across three language pairings:
English–Hindi, English–code-mixed, and Hindi–
code-mixed.
Our core evaluation method is a dot product-
based parallel sentence retrieval protocol. For
each sentence representation hsrc
i in the source lan-
guage, we compute dot products with a pool of
candidate sentence representations in the target lan-
guage consisting of one parallel (positive) sentence
htgt
i and 10 sampled negative sentences {htgt
j }j̸ =i
(Rahamim and Belinkov, 2024). Negative samples
are drawn from a length-normalized candidate pool
using percentile-based sentence

Chunk 9 · 1,995 chars

dot products with a pool of
candidate sentence representations in the target lan-
guage consisting of one parallel (positive) sentence
htgt
i and 10 sampled negative sentences {htgt
j }j̸ =i
(Rahamim and Belinkov, 2024). Negative samples
are drawn from a length-normalized candidate pool
using percentile-based sentence length matching to
avoid trivial mismatches. Formally,
score(hsrc
i , htgt
k ) = hsrc
i · htgt
k (1)
We retrieve the sentence with the highest similarity
score:
ˆk = arg max
k∈{i}∪{j1,...,j10}
score(hsrc
i , htgt
k ) (2)
Retrieval accuracy, defined as the proportion of
queries for which ˆk = i, measures representa-
tional coherence across languages. To verify that
results are not sensitive to the size of the negative
pool, we additionally repeat the evaluation using
100 negatives. Details of the negative sample scal-
ing, length-percentile sampling, and FAISS-based
negative sampling procedure are provided in Ap-
pendix D.
Observations: Here are our observations,
• Off-the-shelf model alignment. Across base
multilingual encoders, the EN–HI pair shows
the strongest alignment (average of forward
and backward) (see Figure 2). Code-mixed
(CM) representations align weakly with both
languages, yielding the ordering EN ↔ HI >
EN ↔ CM > HI ↔ CM . This suggests that
under standard multilingual pretraining, CM
representations occupy a peripheral subspace
that is weakly anchored to either monolingual
language.
• Trade-off between code-mixed and mono-
lingual alignment. Continued pretraining on
Hinglish data produces a systematic trade-off:
(i) EN↔CM alignment improves substantially
(ii) while EN↔HI alignment degrades—a pair
that the model had previously learned to align
well through standard multilingual pretraining.
After code-mixed adaptation, the alignment
ordering inverts to: EN ↔ CM > EN ↔
HI > HI ↔ CM. This occurs because code-
mixed data provides alignment signals primar-
ily between English and code-mixed setting,
causing English representations to be

Chunk 10 · 1,991 chars

viously learned to align
well through standard multilingual pretraining.
After code-mixed adaptation, the alignment
ordering inverts to: EN ↔ CM > EN ↔
HI > HI ↔ CM. This occurs because code-
mixed data provides alignment signals primar-
ily between English and code-mixed setting,
causing English representations to be pulled
toward the code-mixed manifold.
• Script-dependent modulation of Hindi–
code-mixed alignment. When Hindi is
Romanized during pretraining, HI↔CM
alignment degrades (Hing-mBERT: ↓23.33%,
Hing-RoBERTa: ↓64.3% backward). In
contrast, preserving Hindi in Devanagari
4

-- 4 of 24 --

improves alignment (Hing-mBERT-Mixed:
↑36.6%, Hing-RoBERTa-Mixed: ↑92.0%).
These results suggest that script-level fea-
tures influence representation geometry be-
yond lexical overlap, with Devanagari acting
as a stronger cross-lingual anchor for Hindi–
code-mixed alignment.
• Reshaping anchor choice. Beyond pretrain-
ing script choice, code-mixed adaptation also
reshapes query anchor preference at evalua-
tion time. In vanilla mBERT, the model fa-
vors Roman-script & English-language inputs
as the dominant alignment source EN→CM
(55.7%) and EN→HI (72.3%) substantially
outperform their reverse directions. After
Hinglish adaptation, this preference shifts
toward Roman-script & Hindi-language an-
chors: the backward directions (CM→EN:
76.2%, HI→CM: 51.5%) now dominate,
while forward English-as-source directions de-
grade.
• Average layer-wise cross-lingual alignment.
When averaged across layers, base models
(mBERT and XLM-R) show the highest align-
ment for EN↔HI, followed by EN↔CM and
HI↔CM (Figure 4; detailed scores in Ap-
pendix Table 12). After code-mixed adapta-
tion, this ordering typically shifts to EN↔CM
> EN↔HI > HI↔CM, reflecting a drop in
EN↔HI alignment.
Finding
Code-mixed adaptation of multilingual en-
coders improve EN-CM alignment at the
cost of EN-HI. It reorients the anchor choice
for crosslingual alignment.
4.2 Interpretability Analysis
To better

Chunk 11 · 1,987 chars

de-mixed adapta-
tion, this ordering typically shifts to EN↔CM
> EN↔HI > HI↔CM, reflecting a drop in
EN↔HI alignment.
Finding
Code-mixed adaptation of multilingual en-
coders improve EN-CM alignment at the
cost of EN-HI. It reorients the anchor choice
for crosslingual alignment.
4.2 Interpretability Analysis
To better understand cross-lingual alignment
among HI↔CM, EN↔CM, and EN↔HI, we ana-
lyze internal representations of MLMs across lay-
ers. Instead of relying only on final-layer em-
beddings, we examine how cross-lingual structure
evolves across network layers. For this analysis,
we use Centered Kernel Alignment (CKA), which
measures similarity between representation spaces
and correlates well with sentence-level alignment
(Conneau et al., 2020b). This allows us to track
mBERT family
XLM-R family
Figure 2: Directional cross-lingual alignment accuracy
across model families. Solid bars show retrieval from
language A→B, hatched bars show B→A.
mBERT family
XLM-R family
Figure 3: t-SNE visualization of sentence representa-
tions using mBERT and XLM-R-family models.
how monolingual and code-mixed representations
interact across layers.
Layer-wise CKA evolution. Layer-wise CKA
(Figure 5) reveals different alignment trajectories
before and after code-mixed adaptation. In vanilla
multilingual models (mBERT and XLM-R), CKA
similarity typically peaks in the middle layers
(around layers 3–6). In mBERT, EN–HI align-
ment dominates and peaks around layers 6–8 (CKA
∼0.56), while EN–CM and HI–CM similarities
decline after the early layers (CKA ∼0.52 and
∼0.33). This partially contrasts with prior findings
that cross-lingual similarity peaks in early layers
(Conneau et al., 2020b), but aligns with Liu and
Niehues (2025), who report middle-layer transfer
behavior.
5

-- 5 of 24 --

Figure 4: Directional dot-product retrieval accuracy
across encoder models for EN–HI, EN–CM, and
HI–CM, averaged across all layers. Solid bars denote
A→B and hatched bars denote B→A.
mBERT

Chunk 12 · 1,998 chars

yers
(Conneau et al., 2020b), but aligns with Liu and
Niehues (2025), who report middle-layer transfer
behavior.
5

-- 5 of 24 --

Figure 4: Directional dot-product retrieval accuracy
across encoder models for EN–HI, EN–CM, and
HI–CM, averaged across all layers. Solid bars denote
A→B and hatched bars denote B→A.
mBERT family
XLM-R family
Figure 5: Layer-wise CKA alignment for mBERT and
XLM-R family. Each subplot shows cross-lingual repre-
sentation alignment for EN-CM, EN-HI, and HI-CM
across layers.
After continued pretraining on code-mixed data,
the pattern changes. In Hing-mBERT, EN–CM
similarity strengthens and peaks in the middle lay-
ers (CKA ∼0.78 around layers 4–5), while EN–HI
similarity decreases. In the mixed-script variant
(Hing-mBERT-Mixed), EN–CM similarity remains
strong (CKA ∼0.75 at layer 4) and persists into
deeper layers, while EN–HI pair remains compara-
tively weaker. Similar trends appear in the XLM-R
family, where Hing-RoBERTa-Mixed maintains
high similarity (CKA>∼0.50) until approximately
the 9th layer. SVCCA analysis also shows similar
trends (Figure 9). These results indicate that code-
mixed pretraining shifts the representation space
toward stronger EN–CM similarity.
Language reliance in code-mixed understand-
ing. We next analyze token-level language im-
portance using the Relative Importance (RI) score
(Banerjee et al., 2025) (detailed in Appendix E.3).
This analysis measures how much models rely on
English versus Hindi tokens when processing code-
mixed inputs. As shown in Figure 6, all models
assign substantially higher importance to English
tokens. This suggests that code-mixed inputs are
largely interpreted through an English-dominant
semantic space, explaining the stronger EN–CM
Figure 6: Language-wise per-token saliency for encoder-
only models on Hinglish inputs. For each model, we
report the average Relative Importance (RI) score as-
signed to English and Hindi tokens Special tokens (e.g.,
<bos>, <eos>) are excluded.
alignment

Chunk 13 · 1,997 chars

English-dominant
semantic space, explaining the stronger EN–CM
Figure 6: Language-wise per-token saliency for encoder-
only models on Hinglish inputs. For each model, we
report the average Relative Importance (RI) score as-
signed to English and Hindi tokens Special tokens (e.g.,
<bos>, <eos>) are excluded.
alignment compared to HI–CM.
Geometric organization of multilingual rep-
resentations. We further examine representation
geometry using t-SNE projections (Figure 3). In
vanilla mBERT, EN, HI, and CM form clearly sep-
arated clusters. After code-mixed adaptation, EN
and CM representations collapse into a shared re-
gion, while Hindi remains relatively distant. Simi-
lar patterns appear in XLM-R and Hing-RoBERTa.
However, the strongest alignment scores (e.g.,
Hing-RoBERTa-Mixed with 86.6% EN↔CM and
72.7% HI↔CM retrieval) correspond to a different
geometry where CM representations lie between
EN and HI with partial overlap. This suggests that
effective alignment emerges when CM acts as a
bridge between EN and HI.
Entropy-based uncertainty reduction. Finally,
we analyze how monolingual signals explain code-
mixed representations using entropy-based uncer-
tainty reduction. For each layer, we compute the
entropy of CM representations and the reduction
in entropy when conditioned on English, Hindi, or
both languages (details in Appendix E.2). Larger
reductions indicate stronger explanatory power.
Across models, conditioning on English consis-
tently yields greater uncertainty reduction than con-
ditioning on Hindi, indicating that CM represen-
tations are more strongly explained by English.
However, conditioning jointly on both languages
produces the largest reduction. This suggests that
Hindi still provides complementary information for
understanding CM inputs.
These findings highlight the importance of
native-script information. Hindi written in Devana-
gari preserves orthographic and lexical cues that
6

-- 6 of 24 --

are often lost in Romanized CM text, helping

Chunk 14 · 1,994 chars

rgest reduction. This suggests that
Hindi still provides complementary information for
understanding CM inputs.
These findings highlight the importance of
native-script information. Hindi written in Devana-
gari preserves orthographic and lexical cues that
6

-- 6 of 24 --

are often lost in Romanized CM text, helping an-
chor code-mixed representations in the multilingual
space.
mBERT family
XLM-R family
Figure 7: Uncertainty reduction in code-mixed (CM)
representations across encoder layers in MLMs.
Finding
Code-mixed representations are largely
aligned with English in multilingual mod-
els, while native-script Hindi provides com-
plementary signals that help reduce uncer-
tainty.
4.3 Our proposed: Trilingual Post-Training
Alignment Stage
Let D = {(xen
i , xhi
i , xcm
i )}N
i=1 be a trilingual cor-
pus of aligned sentence triples in English, Hindi,
and romanized code-mixed. The corpus is split into
80% training and 20% test data for post-training
and alignment evaluation. We introduce a trilingual
post-training alignment stage that adapts pretrained
multilingual encoders using the objective:
L = Lbase + λ · Lalign (3)
where λ = 0.05 and Lbase denotes the original pre-
training objective of the underlying encoder. For
mBERT, this includes the Masked Language Mod-
eling (MLM) and Next Sentence Prediction (NSP)
losses (Devlin et al., 2019), while for XLM-R it
corresponds to the MLM objective alone (Conneau
et al., 2020a).
For architectures with a sentence-level objec-
tive (e.g., NSP in mBERT), we apply it to cross-
lingual sentence pairs from (xen, xhi), (xen, xcm),
and (xhi, xcm), treating translation pairs as posi-
tives and randomly sampled sentences as negatives.
Models without such objectives (e.g., XLM-R) are
trained using MLM together with the proposed
alignment loss.
Cross-Lingual alignment loss. The key compo-
nent of our method is Lalign, which encourages se-
mantically equivalent sentences across EN, HI, and
CM to occupy nearby regions in the shared

Chunk 15 · 1,997 chars

tences as negatives.
Models without such objectives (e.g., XLM-R) are
trained using MLM together with the proposed
alignment loss.
Cross-Lingual alignment loss. The key compo-
nent of our method is Lalign, which encourages se-
mantically equivalent sentences across EN, HI, and
CM to occupy nearby regions in the shared embed-
ding space. For each example i in a batch of size
B, we obtain ℓ2-normalized [CLS] embeddings:
ei = fθ(xen
i )
∥fθ(xen
i )∥ , hi = fθ(xhi
i )
∥fθ(xhi
i )∥ ,
ci = fθ(xcm
i )
∥fθ(xcm
i )∥
(4)
where fθ(·) is the encoder. The alignment loss is
the mean cosine distance across all three language
pairs:
Lalign = 1
3
X
(u,v)
1
B
B	X
i=1

1 − u⊤
i vi

,
where (u, v) ∈ {(e, h), (e, c), (h, c)}.
(5)
Note that the alignment objective does not ex-
plicitly optimize directional retrieval symmetry or
variance across language pairs. Instead, by jointly
aligning EN, HI, and CM sentence representations,
the loss encourages a shared embedding geome-
try that implicitly promotes more balanced cross-
lingual alignment across language pairs. Rest of
the training details are provided in Appendix B,
and the pseudocode for mBERT training is shown
in Algorithm 1.
Cross-Lingual alignment evaluation. Beyond
forward and backward accuracies, we evalu-
ate whether the model learns a balanced cross-
lingual representation space across the three setups:
EN↔CM, EN↔HI, and HI↔CM. Let A→
xy and
A←
xy denote forward and backward alignment accu-
racies between languages x and y. We define the
CROSS-LINGUAL ALIGNMENT SCORE (CLAS)
as
CLAS = MeanAcc − DirBias − SetupStd.
Here, MeanAcc = 1
6
P
(x,y)(A→
xy + A←
xy) mea-
sures the average bidirectional retrieval accuracy
across language pairs, DirBias = 1
3
P
(x,y) |A→
xy −
A←
xy| captures directional asymmetry between lan-
guages, and SetupStd = Std
 A→
xy +A←
xy
2

measures
performance variance across the three cross-lingual
setups. Since each accuracy term lies in [0, 100],
CLAS is bounded in the range [−150, 100]. Higher
values

Chunk 16 · 1,996 chars

across language pairs, DirBias = 1
3
P
(x,y) |A→
xy −
A←
xy| captures directional asymmetry between lan-
guages, and SetupStd = Std
 A→
xy +A←
xy
2

measures
performance variance across the three cross-lingual
setups. Since each accuracy term lies in [0, 100],
CLAS is bounded in the range [−150, 100]. Higher
values indicate strong bidirectional alignment with
low directional bias and balanced performance
across language pairs.
7

-- 7 of 24 --

EN↔CM 	EN↔HI 	HI↔CM 	CLAS
Model 	→ 	← 	→ 	← 	→ 	←
Last Layer Retrieval
mBERT Family
mBERT 	54.75 42.90 74.87 33.80 26.30 39.05 14.20
mBERT Trilingual 	69.43 63.49 71.52 73.23 54.49 49.69 50.97
Hing-mBERT 	52.78 71.86 32.30 33.43 24.70 28.99 17.01
Hing-mBERT Trilingual 	57.73 62.96 78.05 76.40 53.00 59.70 51.07
Hing-mBERT-Mixed 	54.26 74.63 41.81 60.73 49.51 48.94 34.95
Hing-mBERT-Mixed Trilingual 	54.64 62.42 71.52 70.06 41.96 49.15 42.51
XLM-R Family
XLM-R 	50.56 50.67 58.81 54.33 38.37 40.46 39.53
XLM-R Trilingual 	56.06 54.18 53.43 54.50 48.37 46.93 47.50
Hing-RoBERTa 	64.94 72.14 15.02 27.54 26.19 14.48 	3.74
Hing-RoBERTa Trilingual 	81.43 89.85 69.11 76.70 63.84 62.58 58.98
Hing-RoBERTa-Mixed 	86.95 92.37 83.20 84.98 72.88 77.78 73.09
Hing-RoBERTa-Mixed Trilingual 86.40 93.31 84.65 84.08 73.27 80.96 73.50
Mean Retrieval
mBERT Family
mBERT 	58.77 57.51 59.35 60.15 45.10 42.73 45.35
mBERT Trilingual 	65.15 63.47 71.05 70.55 48.26 49.20 50.98
Hing-mBERT 	65.76 68.57 44.43 49.58 50.64 46.15 40.84
Hing-mBERT Trilingual 	64.87 65.90 58.17 60.18 60.86 59.98 57.67
Hing-mBERT-Mixed 	60.57 62.31 48.89 50.63 56.25 55.72 49.62
Hing-mBERT-Mixed Trilingual 	66.78 67.82 56.11 60.56 58.01 55.44 53.45
XLM-R Family
XLM-R 	67.77 68.34 68.61 68.24 66.24 67.04 66.36
XLM-R Trilingual 	72.26 72.16 71.96 72.11 71.07 70.84 71.02
Hing-RoBERTa 	69.40 68.93 54.33 56.81 55.75 52.45 50.75
Hing-RoBERTa Trilingual 	71.62 73.85 67.55 69.61 64.36 64.21 63.60
Hing-RoBERTa-Mixed 	72.05 70.72 64.73 66.92 65.41 63.02 62.10
Hing-RoBERTa-Mixed Trilingual

Chunk 17 · 1,993 chars

R 	67.77 68.34 68.61 68.24 66.24 67.04 66.36
XLM-R Trilingual 	72.26 72.16 71.96 72.11 71.07 70.84 71.02
Hing-RoBERTa 	69.40 68.93 54.33 56.81 55.75 52.45 50.75
Hing-RoBERTa Trilingual 	71.62 73.85 67.55 69.61 64.36 64.21 63.60
Hing-RoBERTa-Mixed 	72.05 70.72 64.73 66.92 65.41 63.02 62.10
Hing-RoBERTa-Mixed Trilingual 70.41 68.01 65.57 65.87 65.68 62.90 62.51
Table 3: Bidirectional cross-lingual alignment accuracy
(%) using dot-product similarity with 10 negative sam-
ples evaluated on the test split. All scores are average
of three random seeds.
Observations The crosslingual alignment perfor-
mances are presented in Table 3. Here, we present
our observations:
• Our proposed method yields large improve-
ments in CLAS, particularly for mBERT
(14.20 → 50.97) and XLM-R (39.53 →
47.50), indicating reduced directional bias and
more balanced alignment across the three lan-
guage pairs. The alignment objective also mit-
igates the degradation of EN↔HI alignment
observed after code-mixed adaptation.
• After code-mixed adaptation, preserving
cross-lingual capability becomes crucial; our
method improves the Hing models by restor-
ing balanced alignment across languages.
• Similar trends hold when the negative sample
pool is increased to 100 (Table 5). A variant
trained only with parallel supervision (with-
out the alignment loss) also performs compet-
itively (Table 4).
From an interpretability perspective, Figure 8
shows layer-wise CKA alignment where the lan-
guage pairs exhibit closer and more consistently
high alignment across layers.
Downstream task validation. To examine
whether improved alignment translates to practi-
cal benefits, we evaluate the proposed post-training
alignment stage on two code-mixed classification
tasks: sentiment analysis and hate speech detec-
tion. Detailed experimental settings and results
are provided in Appendix F. Overall, the aligned
models, in majority of the cases, improve cross-
lingual prediction consistency across CM, EN, and
HI

Chunk 18 · 1,989 chars

roposed post-training
alignment stage on two code-mixed classification
tasks: sentiment analysis and hate speech detec-
tion. Detailed experimental settings and results
are provided in Appendix F. Overall, the aligned
models, in majority of the cases, improve cross-
lingual prediction consistency across CM, EN, and
HI inputs. This trend holds across both tasks and
multiple model families, suggesting that trilingual
alignment encourages stable multilingual represen-
tations and improves cross-lingual robustness.
Finding
Trilingual supervision improves cross-
lingual alignment across EN–HI–CM and
leads to better cross-lingual performance on
downstream tasks.
5 Conclusion:
In this work, we studied how multilingual encoders
represent Hindi–English code-mixed text and how
these representations relate to their constituent lan-
guages. Our analysis shows that in multilingual
language models (MLMs), code-mixed representa-
tions often lie on the periphery of the representation
space and are weakly anchored to both languages.
Continued adaptation on code-mixed data improves
alignment between English and code-mixed inputs
but can degrade alignment between the original
monolingual languages. Our interpretability anal-
yses further reveal that multilingual models tend
to interpret code-mixed text through an English-
dominant semantic space, while native-script Hindi
provides complementary signals that help reduce
representational uncertainty. Motivated by these
observations, we introduce a simple yet effective
trilingual post-training alignment stage that en-
courages consistent representations across English,
Hindi, and code-mixed inputs. Experiments demon-
strate improved cross-lingual alignment while re-
ducing degradation in EN–HI alignment. As a re-
sult, downstream sentiment and hate speech tasks
also show improved cross-lingual consistency.
Limitations:
Here in this section, we list down the limitations
and future directions of our work. This study fo-
cuses on

Chunk 19 · 1,999 chars

on-
strate improved cross-lingual alignment while re-
ducing degradation in EN–HI alignment. As a re-
sult, downstream sentiment and hate speech tasks
also show improved cross-lingual consistency.
Limitations:
Here in this section, we list down the limitations
and future directions of our work. This study fo-
cuses on Hindi–English code-mixing due to our
linguistic expertise. While this pair serves as a rep-
8

-- 8 of 24 --

resentative case study, the mechanisms we analyze
are expected to generalize to other code-mixed lan-
guage pairs. Future work can extend the proposed
analysis and post-training alignment approach to
additional languages to further validate these find-
ings across diverse multilingual settings.
Our post-training alignment stage uses a rela-
tively small trilingual corpus (21K sentence triples).
Although the results show improved alignment,
larger and more diverse code-mixed corpora may
lead to stronger and more stable gains. Future work
could scale the alignment stage with larger datasets
or explore data augmentation strategies for gener-
ating additional code-mixed samples.
Our study focuses on encoder-based multilingual
models (e.g., mBERT, XLM-R). While these mod-
els remain widely used for multilingual representa-
tion learning, extending the proposed alignment ap-
proach to other encoders and larger decoder-based
multilingual LLMs remains an open direction for
future work.
Another limitation arises from the use of auto-
matic translations when constructing the trilingual
corpus and preparing certain evaluation data. Al-
though machine translation systems provide reason-
able approximations, they may not fully preserve
the linguistic nuances, stylistic variations, or prag-
matic cues present in the original text. This issue
is particularly relevant for code-mixed language,
where subtle interactions between languages may
be altered during translation. Future work could
address this limitation by incorporating human-
verified translations or

Chunk 20 · 1,988 chars

guistic nuances, stylistic variations, or prag-
matic cues present in the original text. This issue
is particularly relevant for code-mixed language,
where subtle interactions between languages may
be altered during translation. Future work could
address this limitation by incorporating human-
verified translations or collecting larger trilingual
corpora directly from multilingual speakers.
Ethical considerations:
This work studies social media text and may con-
tain potentially offensive or harmful language
present in the original datasets. These instances
are included solely for research purposes in order
to study linguistic phenomena in code-mixed set-
tings, and the authors do not endorse or promote
any harmful or abusive content. AI-based writing
assistants were used for grammar checking and
language polishing during manuscript preparation.
References
Gustavo Aguilar, Sudipta Kar, and Thamar Solorio.
2020. LinCE: A centralized benchmark for linguis-
tic code-switching evaluation. In Proceedings of the
Twelfth Language Resources and Evaluation Confer-
ence, pages 1803–1813, Marseille, France. European
Language Resources Association.
Ramakrishna Appicharla, Kamal Kumar Gupta, Asif
Ekbal, and Pushpak Bhattacharyya. 2025. Improv-
ing neural machine translation through code-mixed
data augmentation. Computational Intelligence,
41(2):e70033.
Somnath Banerjee, Pratyush Chatterjee, Shanu Kumar,
Sayan Layek, Parag Agrawal, Rima Hazra, and Ani-
mesh Mukherjee. 2025. Attributional safety failures
in large language models under code-mixed perturba-
tions. arXiv preprint arXiv:2505.14469.
Aditya Bohra, Deepanshu Vijay, Vinay Singh, Syed Sar-
faraz Akhtar, and Manish Shrivastava. 2018. A
dataset of Hindi-English code-mixed social media
text for hate speech detection. In Proceedings of
the Second Workshop on Computational Modeling
of People’s Opinions, Personality, and Emotions in
Social Media, pages 36–41, New Orleans, Louisiana,
USA. Association for Computational

Chunk 21 · 1,995 chars

htar, and Manish Shrivastava. 2018. A
dataset of Hindi-English code-mixed social media
text for hate speech detection. In Proceedings of
the Second Workshop on Computational Modeling
of People’s Opinions, Personality, and Emotions in
Social Media, pages 36–41, New Orleans, Louisiana,
USA. Association for Computational Linguistics.
Shuguang Chen, Gustavo Aguilar, Anirudh Srinivasan,
Mona Diab, and Thamar Solorio. 2022. Calcs 2021
shared task: Machine translation for code-switched
data. arXiv preprint arXiv:2202.09625.
Alexis Conneau, Kartikay Khandelwal, Naman Goyal,
Vishrav Chaudhary, Guillaume Wenzek, Francisco
Guzmán, Edouard Grave, Myle Ott, Luke Zettle-
moyer, and Veselin Stoyanov. 2020a. Unsupervised
cross-lingual representation learning at scale. In Pro-
ceedings of the 58th Annual Meeting of the Asso-
ciation for Computational Linguistics, pages 8440–
8451, Online. Association for Computational Lin-
guistics.
Alexis CONNEAU and Guillaume Lample. 2019.
Cross-lingual language model pretraining. In Ad-
vances in Neural Information Processing Systems,
volume 32. Curran Associates, Inc.
Alexis Conneau, Shijie Wu, Haoran Li, Luke Zettle-
moyer, and Veselin Stoyanov. 2020b. Emerging
cross-lingual structure in pretrained language mod-
els. In Proceedings of the 58th Annual Meeting of
the Association for Computational Linguistics, pages
6022–6034, Online. Association for Computational
Linguistics.
Andrea Gregor de Varda and Marco Marelli. 2024. The
emergence of semantic units in massively multilin-
gual models. In Proceedings of the 2024 Joint In-
ternational Conference on Computational Linguis-
tics, Language Resources and Evaluation (LREC-
COLING 2024), pages 15910–15921, Torino, Italia.
ELRA and ICCL.
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and
Kristina Toutanova. 2019. BERT: Pre-training of
deep bidirectional transformers for language under-
standing. In Proceedings of the 2019 Conference of
the North American Chapter of the Association for
9

-- 9 of 24 --

Chunk 22 · 1,998 chars

24), pages 15910–15921, Torino, Italia.
ELRA and ICCL.
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and
Kristina Toutanova. 2019. BERT: Pre-training of
deep bidirectional transformers for language under-
standing. In Proceedings of the 2019 Conference of
the North American Chapter of the Association for
9

-- 9 of 24 --

Computational Linguistics: Human Language Tech-
nologies, Volume 1 (Long and Short Papers), pages
4171–4186, Minneapolis, Minnesota. Association for
Computational Linguistics.
Mrinal Dhar, Vaibhav Kumar, and Manish Shrivastava.
2018. Enabling code-mixed translation: Parallel cor-
pus creation and MT augmentation approach. In
Proceedings of the First Workshop on Linguistic Re-
sources for Natural Language Processing, pages 131–
140, Santa Fe, New Mexico, USA. Association for
Computational Linguistics.
Jay Gala, Pranjal A Chitale, A K Raghavan, Varun
Gumma, Sumanth Doddapaneni, Aswanth Kumar M,
Janki Atul Nawale, Anupama Sujatha, Ratish Pudup-
pully, Vivek Raghavan, Pratyush Kumar, Mitesh M
Khapra, Raj Dabre, and Anoop Kunchukuttan. 2023.
Indictrans2: Towards high-quality and accessible ma-
chine translation models for all 22 scheduled indian
languages. Transactions on Machine Learning Re-
search.
Devansh Gautam, Kshitij Gupta, and Manish Shrivas-
tava. 2021. Translate and classify: Improving se-
quence level classification for English-Hindi code-
mixed data. In Proceedings of the Fifth Workshop
on Computational Approaches to Linguistic Code-
Switching, pages 15–25, Online. Association for
Computational Linguistics.
Katharina Hämmerl, Jindˇrich Libovický, and Alexan-
der Fraser. 2024. Understanding cross-lingual
Alignment—A survey. In Findings of the Associa-
tion for Computational Linguistics: ACL 2024, pages
10922–10943, Bangkok, Thailand. Association for
Computational Linguistics.
Jiwoo Hong, Noah Lee, Rodrigo Martínez-Castaño,
César Rodríguez, and James Thorne. 2025. Cross-
lingual transfer of reward models in multilingual
alignment. In Proceedings

Chunk 23 · 1,998 chars

ings of the Associa-
tion for Computational Linguistics: ACL 2024, pages
10922–10943, Bangkok, Thailand. Association for
Computational Linguistics.
Jiwoo Hong, Noah Lee, Rodrigo Martínez-Castaño,
César Rodríguez, and James Thorne. 2025. Cross-
lingual transfer of reward models in multilingual
alignment. In Proceedings of the 2025 Conference
of the Nations of the Americas Chapter of the Asso-
ciation for Computational Linguistics: Human Lan-
guage Technologies (Volume 2: Short Papers), pages
82–94, Albuquerque, New Mexico. Association for
Computational Linguistics.
Sai Muralidhar Jayanthi, Kavya Nerella, Khy-
athi Raghavi Chandu, and Alan W Black. 2021.
CodemixedNLP: An extensible and open NLP toolkit
for code-mixing. In Proceedings of the Fifth Work-
shop on Computational Approaches to Linguistic
Code-Switching, pages 113–118, Online. Associa-
tion for Computational Linguistics.
Jeff Johnson, Matthijs Douze, and Herve Jegou. 2021.
Billion-Scale Similarity Search with GPUs . IEEE
Transactions on Big Data, 7(03):535–547.
Amir Hossein Kargaran, Ali Modarressi, Nafiseh
Nikeghbal, Jana Diesner, François Yvon, and Hinrich
Schuetze. 2025. MEXA: Multilingual evaluation of
English-centric LLMs via cross-lingual alignment.
In Findings of the Association for Computational
Linguistics: ACL 2025, pages 27001–27023, Vienna,
Austria. Association for Computational Linguistics.
Simon Kornblith, Mohammad Norouzi, Honglak Lee,
and Geoffrey Hinton. 2019. Similarity of neural net-
work representations revisited. In Proceedings of
the 36th International Conference on Machine Learn-
ing, volume 97 of Proceedings of Machine Learning
Research, pages 3519–3529. PMLR.
Saurabh Kulshreshtha, Jose Luis Redondo Garcia, and
Ching-Yun Chang. 2020. Cross-lingual alignment
methods for multilingual BERT: A comparative
study. In Findings of the Association for Compu-
tational Linguistics: EMNLP 2020, pages 933–942,
Online. Association for Computational Linguistics.
Vishwajeet Kumar, Rudra Murthy, and

Chunk 24 · 1,992 chars

ulshreshtha, Jose Luis Redondo Garcia, and
Ching-Yun Chang. 2020. Cross-lingual alignment
methods for multilingual BERT: A comparative
study. In Findings of the Association for Compu-
tational Linguistics: EMNLP 2020, pages 933–942,
Online. Association for Computational Linguistics.
Vishwajeet Kumar, Rudra Murthy, and Tejas Dhamecha.
2022. On utilizing constituent language resources
to improve downstream tasks in Hinglish. In Find-
ings of the Association for Computational Linguistics:
EMNLP 2022, pages 3859–3865, Abu Dhabi, United
Arab Emirates. Association for Computational Lin-
guistics.
Danni Liu and Jan Niehues. 2025. Middle-layer rep-
resentation alignment for cross-lingual transfer in
fine-tuned LLMs. In Proceedings of the 63rd An-
nual Meeting of the Association for Computational
Linguistics (Volume 1: Long Papers), pages 15979–
15996, Vienna, Austria. Association for Computa-
tional Linguistics.
Debajyoti Mazumder, Aakash Kumar, and Jasabanta
Patro. 2025a. Improving code-mixed hate detection
by native sample mixing: A case study for hindi-
english code-mixed scenario. ACM Trans. Asian
Low-Resour. Lang. Inf. Process., 24(5).
Debajyoti Mazumder, Aakash Kumar, and Jasabanta Pa-
tro. 2025b. Revealing the impact of synthetic native
samples and multi-tasking strategies in Hindi-English
code-mixed humour and sarcasm detection. In Find-
ings of the Association for Computational Linguistics:
EMNLP 2025, pages 24077–24107, Suzhou, China.
Association for Computational Linguistics.
Ari S. Morcos, Maithra Raghu, and Samy Bengio.
2018. Insights on representational similarity in neu-
ral networks with canonical correlation. In Proceed-
ings of the 32nd International Conference on Neu-
ral Information Processing Systems, NIPS’18, page
5732–5741, Red Hook, NY, USA. Curran Associates
Inc.
Ravindra Nayak and Raviraj Joshi. 2022. L3Cube-
HingCorpus and HingBERT: A code mixed Hindi-
English dataset and BERT language models. In
Proceedings of the WILDRE-6 Workshop within

Chunk 25 · 1,991 chars

d International Conference on Neu-
ral Information Processing Systems, NIPS’18, page
5732–5741, Red Hook, NY, USA. Curran Associates
Inc.
Ravindra Nayak and Raviraj Joshi. 2022. L3Cube-
HingCorpus and HingBERT: A code mixed Hindi-
English dataset and BERT language models. In
Proceedings of the WILDRE-6 Workshop within the
13th Language Resources and Evaluation Confer-
ence, pages 7–12, Marseille, France. European Lan-
guage Resources Association.
Karin Niederreiter and Dagmar Gromann. 2025. Word-
level detection of code-mixed hate speech with mul-
tilingual domain transfer. In Findings of the Asso-
ciation for Computational Linguistics: ACL 2025,
10

-- 10 of 24 --

pages 21093–21104, Vienna, Austria. Association
for Computational Linguistics.
Tolúlo. p e. ´ Ògúnre. `mí, Christopher D. Manning, and
Dan Jurafsky. 2023. Multilingual self-supervised
speech representations improve the speech recog-
nition of low-resource African languages with
codeswitching. In Proceedings of the 6th Workshop
on Computational Approaches to Linguistic Code-
Switching, pages 83–88, Singapore. Association for
Computational Linguistics.
Filippo Pallucchini, Lorenzo Malandri, Fabio Merco-
rio, and Mario Mezzanzanica. 2025. Lost in align-
ment: A survey on cross-lingual alignment meth-
ods for contextualized representation. ACM Comput.
Surv., 58(5).
Parth Patwa, Gustavo Aguilar, Sudipta Kar, Suraj
Pandey, Srinivas PYKL, Björn Gambäck, Tanmoy
Chakraborty, Thamar Solorio, and Amitava Das.
2020. SemEval-2020 task 9: Overview of sentiment
analysis of code-mixed tweets. In Proceedings of the
Fourteenth Workshop on Semantic Evaluation, pages
774–790, Barcelona (online). International Commit-
tee for Computational Linguistics.
Maithra Raghu, Justin Gilmer, Jason Yosinski, and
Jascha Sohl-Dickstein. 2017. Svcca: singular vec-
tor canonical correlation analysis for deep learning
dynamics and interpretability. In Proceedings of the
31st International Conference on Neural Information
Processing

Chunk 26 · 1,970 chars

ational Commit-
tee for Computational Linguistics.
Maithra Raghu, Justin Gilmer, Jason Yosinski, and
Jascha Sohl-Dickstein. 2017. Svcca: singular vec-
tor canonical correlation analysis for deep learning
dynamics and interpretability. In Proceedings of the
31st International Conference on Neural Information
Processing Systems, NIPS’17, page 6078–6087, Red
Hook, NY, USA. Curran Associates Inc.
Adir Rahamim and Yonatan Belinkov. 2024. Con-
traSim – analyzing neural representations based on
contrastive learning. In Proceedings of the 2024
Conference of the North American Chapter of the
Association for Computational Linguistics: Human
Language Technologies (Volume 1: Long Papers),
pages 6325–6339, Mexico City, Mexico. Association
for Computational Linguistics.
Lucas Resck, Isabelle Augenstein, and Anna Korhonen.
2025. Explainability and interpretability of multilin-
gual large language models: A survey. In Proceed-
ings of the 2025 Conference on Empirical Methods in
Natural Language Processing, pages 20454–20486,
Suzhou, China. Association for Computational Lin-
guistics.
Anna Rogers, Olga Kovaleva, and Anna Rumshisky.
2020. A primer in BERTology: What we know about
how BERT works. Transactions of the Association
for Computational Linguistics, 8:842–866.
Sebastin Santy, Anirudh Srinivasan, and Monojit Choud-
hury. 2021. BERTologiCoMix: How does code-
mixing interact with multilingual BERT? In Proceed-
ings of the Second Workshop on Domain Adaptation
for NLP, pages 111–121, Kyiv, Ukraine. Association
for Computational Linguistics.
Bhavani Shankar, Preethi Jyothi, and Pushpak Bhat-
tacharyya. 2024. In-context mixing (ICM): Code-
mixed prompts for multilingual LLMs. In Proceed-
ings of the 62nd Annual Meeting of the Association
for Computational Linguistics (Volume 1: Long Pa-
pers), pages 4162–4176, Bangkok, Thailand. Associ-
ation for Computational Linguistics.
Kogilavani Shanmugavadivel, Sowbharanika J S,
Navbila K, and Malliga Subramanian.

Chunk 27 · 1,993 chars

mixed prompts for multilingual LLMs. In Proceed-
ings of the 62nd Annual Meeting of the Association
for Computational Linguistics (Volume 1: Long Pa-
pers), pages 4162–4176, Bangkok, Thailand. Associ-
ation for Computational Linguistics.
Kogilavani Shanmugavadivel, Sowbharanika J S,
Navbila K, and Malliga Subramanian. 2024.
Code_Makers@DravidianLangTech-EACL 2024 :
Sentiment analysis in code-mixed Tamil using ma-
chine learning techniques. In Proceedings of the
Fourth Workshop on Speech, Vision, and Language
Technologies for Dravidian Languages, pages 129–
133, St. Julian’s, Malta. Association for Computa-
tional Linguistics.
Rajvee Sheth, Samridhi Raj Sinha, Mahavir Patil, Hi-
manshu Beniwal, and Mayank Singh. 2025. Be-
yond monolingual assumptions: A survey of code-
switched nlp in the era of large language models.
arXiv e-prints, pages arXiv–2510.
Vivek Srivastava and Mayank Singh. 2020. PHINC:
A parallel Hinglish social media code-mixed cor-
pus for machine translation. In Proceedings of the
Sixth Workshop on Noisy User-generated Text (W-
NUT 2020), pages 41–49, Online. Association for
Computational Linguistics.
Mingyang Wang, Lukas Lange, Heike Adel, Yunpu
Ma, Jannik Strötgen, and Hinrich Schuetze. 2025.
Language mixing in reasoning language models: Pat-
terns, impact, and internal causes. In Proceedings of
the 2025 Conference on Empirical Methods in Natu-
ral Language Processing, pages 2637–2665, Suzhou,
China. Association for Computational Linguistics.
Genta Indra Winata, Samuel Cahyawijaya, Zihan Liu,
Zhaojiang Lin, Andrea Madotto, and Pascale Fung.
2021. Are multilingual models effective in code-
switching? In Proceedings of the Fifth Workshop
on Computational Approaches to Linguistic Code-
Switching, pages 142–153, Online. Association for
Computational Linguistics.
John Wu, Yonatan Belinkov, Hassan Sajjad, Nadir Dur-
rani, Fahim Dalvi, and James Glass. 2020. Similar-
ity analysis of contextual word representation mod-
els. In Proceedings of the 58th

Chunk 28 · 1,998 chars

rkshop
on Computational Approaches to Linguistic Code-
Switching, pages 142–153, Online. Association for
Computational Linguistics.
John Wu, Yonatan Belinkov, Hassan Sajjad, Nadir Dur-
rani, Fahim Dalvi, and James Glass. 2020. Similar-
ity analysis of contextual word representation mod-
els. In Proceedings of the 58th Annual Meeting of
the Association for Computational Linguistics, pages
4638–4655, Online. Association for Computational
Linguistics.
Haneul Yoo, Cheonbok Park, Sangdoo Yun, Alice Oh,
and Hwaran Lee. 2025. Code-switching curriculum
learning for multilingual transfer in LLMs. In Find-
ings of the Association for Computational Linguis-
tics: ACL 2025, pages 7816–7836, Vienna, Austria.
Association for Computational Linguistics.
Ruochen Zhang, Samuel Cahyawijaya, Jan Chris-
tian Blaise Cruz, Genta Winata, and Alham Fikri
Aji. 2023. Multilingual large language models are
11

-- 11 of 24 --

not (yet) code-switchers. In Proceedings of the 2023
Conference on Empirical Methods in Natural Lan-
guage Processing, pages 12567–12582, Singapore.
Association for Computational Linguistics.
Appendix
A Additional Dataset Details
A.1 CM-En parallel (Dhar et al., 2018)
We used the English–Hindi code-mixed parallel
corpus introduced by Dhar et al. (2018). The
dataset contained 6,096 English–Hindi code-mixed
sentences paired with gold-standard English trans-
lations, which are produced by fluent bilingual an-
notators and have achieved high inter-annotator
agreement (Fleiss’ κ ≈ 0.88). The sentences are
written in Roman script and reflect informal so-
cial communication. The corpus showed a Code-
Mixing Index (CMI) of 30.5, indicating a substan-
tial degree of code-mixing.
A.2 PHINC: Parallel Hinglish Social Media
Corpus (Srivastava and Singh, 2020)
The second dataset we integrated was the PHINC
corpus (Srivastava and Singh, 2020). The dataset
contained 13,738 Hindi–English code-mixed sen-
tences, each paired with a parallel human-written
English translation. The sentences

Chunk 29 · 1,991 chars

code-mixing.
A.2 PHINC: Parallel Hinglish Social Media
Corpus (Srivastava and Singh, 2020)
The second dataset we integrated was the PHINC
corpus (Srivastava and Singh, 2020). The dataset
contained 13,738 Hindi–English code-mixed sen-
tences, each paired with a parallel human-written
English translation. The sentences are collected
from social media platforms such as Twitter (now
X) and Facebook. Thus similar to the previous one,
the dataset reflected an informal style of writing.
The corpus is curated through extensive filtering
and manually annotated by 54 expert-level anno-
tators, with each annotator labeling 400 randomly
selected samples. The code-mixed variant here ex-
hibited a relatively higher degree of code-mixing
with CMI being 75.76.
A.3 LinCE Multiview Hinglish Dataset
(Aguilar et al., 2020)
We also used the Hindi–English code-mixed dataset
from the LinCE benchmark1 introduced by Aguilar
et al. (2020); Chen et al. (2022). It contained 8060
samples crawled from wikipedia article about a
particular movie. This subset is compiled by con-
solidating and re-curating existing Hindi–English
code-mixed corpora collected from social media
platforms such as Twitter (now X) and Facebook.
Each instance included (i) an English sentence, (ii)
its Hindi translation in Devanagari script, (iii) a
Hindi (Roman-script) - English code-mixed form,
and (iv) a Hindi (Devanagari-script) - English code-
mixed variant along with token-level language tags.
Here, the code-mixed corpus showed a CMI of
22.68.
B Trilingual Training Details
We post-train the base multilingual encoders
(mBERT and XLM-R) using the proposed trilin-
gual alignment objective on the curated English–
Hindi–code-mixed corpus. The dataset is split into
80% training and 20% test data. Models are trained
for up to 10 epochs using the AdamW optimizer
with a batch size of 16. We explore learning rates
from the set {5e−6, 6e−6, 6.5e−6, 1e−5} and
apply a linear warmup schedule over the first 10%
of the total

Chunk 30 · 1,997 chars

ated English–
Hindi–code-mixed corpus. The dataset is split into
80% training and 20% test data. Models are trained
for up to 10 epochs using the AdamW optimizer
with a batch size of 16. We explore learning rates
from the set {5e−6, 6e−6, 6.5e−6, 1e−5} and
apply a linear warmup schedule over the first 10%
of the total training steps. During training, we
follow the standard BERT masked language mod-
eling (MLM) protocol with 15% token masking,
where 80% of the selected tokens are replaced with
[MASK], 10% with random tokens, and 10% are
left unchanged. The alignment loss weight λ is
tuned over {0.05, 0.10, 0.15}. Each configuration
is trained with three random seeds, and we report
the average performance across runs. Although
training loss continues to decrease across epochs,
improvements in cross-lingual alignment become
marginal in later stages. Based on alignment per-
formance on the validation split, we select the 6th
epoch for XLM-R and the 8th epoch for mBERT
for final evaluation.
C Additional Results
This section presents additional results supporting
our analysis. We include an ablation study in which
the alignment loss is removed from the training ob-
jective to examine its impact on cross-lingual align-
ment. Table 4 reports the corresponding CLAS
values. Even without the alignment loss, train-
ing with parallel trilingual supervision alone yields
slightly better cross-lingual alignment compared
to the base models, in most of the cases. We also
report perplexity scores on the code-mixed test set
in Table 6. The perplexity of the trilingual models
remains lower than, or comparable to, that of the
base models, indicating that the alignment objec-
tive does not adversely affect language modeling
performance on code-mixed inputs. Finally, Fig-
ure 8 shows the layer-wise CKA representations of
the model trained without the alignment loss.
12

-- 12 of 24 --

EN↔CM 	EN↔HI 	HI↔CM 	CLAS
Model 	→ 	← 	→ 	← 	→ 	←
Last Layer Retrieval
mBERT Family
mBERT 	54.75 42.90

Chunk 31 · 1,998 chars

ec-
tive does not adversely affect language modeling
performance on code-mixed inputs. Finally, Fig-
ure 8 shows the layer-wise CKA representations of
the model trained without the alignment loss.
12

-- 12 of 24 --

EN↔CM 	EN↔HI 	HI↔CM 	CLAS
Model 	→ 	← 	→ 	← 	→ 	←
Last Layer Retrieval
mBERT Family
mBERT 	54.75 42.90 74.87 33.80 26.30 39.05 14.20
mBERT Trilingual 69.43 63.49 71.52 73.23 54.49 49.69 50.97
–w/o Alignment 52.90 47.49 66.83 30.40 22.91 36.55 15.06
XLM-R Family
XLM-R 	50.56 50.67 58.81 54.33 38.37 40.46 39.53
XLM-R Trilingual 56.06 54.18 53.43 54.50 48.37 46.93 47.50
–w/o Alignment 55.92 54.23 63.94 60.96 42.92 43.14 43.88
Mean Retrieval
mBERT Family
mBERT 	58.60 57.42 59.37 60.10 44.99 42.65 45.35
mBERT Trilingual 65.15 63.47 71.05 70.55 48.26 49.20 50.98
–w/o Alignment 55.94 53.24 68.33 63.32 42.74 43.10 42.40
XLM-R Family
XLM-R 	67.77 68.34 68.61 68.24 66.24 67.04 66.36
XLM-R Trilingual 72.26 72.16 71.96 72.11 71.07 70.84 71.02
–w/o Alignment 72.62 71.50 73.06 72.30 71.09 70.62 70.32
Table 4: Bidirectional alignment scores using dot prod-
uct similarity. Last layer accuracy (%), mean accuracy
(%) and CLAS (%) are reported for model families eval-
uated on the test split. All scores are average of three
random seeds.
EN↔CM 	EN↔HI 	HI↔CM 	CLAS
Model 	→ 	← 	→ 	← 	→ 	←
Last Layer Retrieval
mBERT Family
mBERT 	36.46 28.11 52.07 16.41 11.66 19.06 	1.68
mBERT Trilingual 	51.73 46.61 57.66 60.80 36.02 31.57 32.70
Hing-mBERT 	35.11 46.36 26.85 14.70 12.03 10.72 	3.82
Hing-mBERT Trilingual 	27.70 42.55 53.19 51.80 16.84 24.92 15.10
Hing-mBERT-Mixed 	38.86 57.58 23.51 43.53 32.36 29.23 15.91
Hing-mBERT-Mixed Trilingual 	31.91 42.67 62.98 62.13 30.13 39.83 	25.35
XLM-R Family
XLM-R 	30.62 30.27 36.73 34.23 19.76 20.65 21.11
XLM-R Trilingual 	47.72 45.17 46.19 47.07 40.28 38.60 39.12
Hing-RoBERTa 	50.43 57.43 	3.64 	10.10 	9.09 	3.35 	-6.39
Hing-RoBERTa Trilingual 	67.55 78.90 50.54 61.75 44.68 44.25 38.47
Hing-RoBERTa-Mixed 	72.58 81.60 71.84 74.72 58.15 63.56

Chunk 32 · 1,996 chars

.13 39.83 	25.35
XLM-R Family
XLM-R 	30.62 30.27 36.73 34.23 19.76 20.65 21.11
XLM-R Trilingual 	47.72 45.17 46.19 47.07 40.28 38.60 39.12
Hing-RoBERTa 	50.43 57.43 	3.64 	10.10 	9.09 	3.35 	-6.39
Hing-RoBERTa Trilingual 	67.55 78.90 50.54 61.75 44.68 44.25 38.47
Hing-RoBERTa-Mixed 	72.58 81.60 71.84 74.72 58.15 63.56 57.70
Hing-RoBERTa-Mixed Trilingual 85.43 93.35 86.07 83.99 71.33 82.71 71.56
Mean Retrieval
mBERT Family
mBERT 	34.70 32.39 44.49 44.24 22.74 20.85 22.54
mBERT Trilingual 	49.43 48.10 55.46 55.24 38.55 38.10 39.81
Hing-mBERT 	50.28 52.38 29.70 34.26 35.12 31.00 26.32
Hing-mBERT Trilingual 	51.18 53.00 40.22 43.82 41.10 38.96 36.91
Hing-mBERT-Mixed 	42.61 45.22 30.04 33.87 38.80 36.85 30.22
Hing-mBERT-Mixed Trilingual 	47.02 48.99 40.45 42.75 43.21 42.33 39.62
XLM-R Family
XLM-R 	56.97 58.12 57.11 57.05 55.53 56.49 55.51
XLM-R Trilingual 	63.37 63.41 63.06 63.50 62.57 62.24 62.32
Hing-RoBERTa 	53.69 56.13 36.62 38.38 35.28 35.65 32.28
Hing-RoBERTa Trilingual 	60.57 62.91 57.25 58.17 51.59 53.74 51.86
Hing-RoBERTa-Mixed 	59.95 59.37 52.45 54.55 53.61 51.50 50.49
Hing-RoBERTa-Mixed Trilingual 70.26 68.66 64.61 66.77 66.28 62.84 62.08
Table 5: Bidirectional cross-lingual alignment accuracy
(%) using dot-product similarity with 100 negative sam-
ples evaluated on the test split. All scores are average
of three random seeds.
Model Family Model 	PPL (CM)
mBERT
mBERT 	462.41
mBERT Trilingual 	119.28
Hing-mBERT 	10.98
Hing-mBERT Trilingual 	8.08
Hing-mBERT-Mixed 	10.40
Hing-mBERT-Mixed Trilingual 	8.37
XLM-R
XLM-R 	52.54
XLM-R Trilingual 	24.85
Hing-RoBERTa 	16.19
Hing-RoBERTa Trilingual 	9.19
Hing-RoBERTa-Mixed 	16.41
Hing-RoBERTa-Mixed Trilingual 13.97
Table 6: Perplexity scores computed on the code-mixed
test set.
D Negative Sample Scaling and Sampling
Strategies
This section provides additional implementation
details for the cross-lingual retrieval evaluation de-
scribed in Section 4.1. In particular, we describe
how candidate pools are constructed for

Chunk 33 · 1,993 chars

ngual 13.97
Table 6: Perplexity scores computed on the code-mixed
test set.
D Negative Sample Scaling and Sampling
Strategies
This section provides additional implementation
details for the cross-lingual retrieval evaluation de-
scribed in Section 4.1. In particular, we describe
how candidate pools are constructed for the re-
trieval task and the negative sampling strategies
used to generate challenging distractors. For each
query sentence representation in the source lan-
guage, the candidate set consists of the correct par-
allel sentence (positive sample) together with a set
of negative sentences drawn from the target lan-
guage corpus. Unless otherwise stated, the main
experiments use ten negative samples per query.
We further analyze the robustness of the retrieval
protocol by scaling the number of negatives to 100
and by evaluating different sampling strategies for
constructing the negative pool.
D.1 Scaling negative samples
To examine whether our findings are sensitive to
the size of the negative candidate pool in the re-
trieval protocol, we repeat the cross-lingual align-
ment evaluation using 100 randomly sampled neg-
ative sentences for each query instead of 10. This
increases the difficulty of the retrieval task and pro-
vides a stricter test of representational alignment.
The resulting alignment accuracies are presented in
Table 5. Overall, the trends remain consistent with
our earlier results, indicating that our observations
are robust to larger negative pools.
D.2 Length-Aware Percentile Sampling
The first strategy constructs candidate pools using
a length-aware sampling procedure. For each sen-
tence in the dataset, we compute the token length
and determine its percentile rank within the cor-
13

-- 13 of 24 --

pus. Given a query sentence with percentile rank
p, candidate negatives are restricted to sentences
whose percentile ranks fall within a window of
±5 percentiles around p. This constraint ensures
that negative examples have comparable

Chunk 34 · 1,994 chars

e token length
and determine its percentile rank within the cor-
13

-- 13 of 24 --

pus. Given a query sentence with percentile rank
p, candidate negatives are restricted to sentences
whose percentile ranks fall within a window of
±5 percentiles around p. This constraint ensures
that negative examples have comparable sentence
lengths, reducing trivial cues that could make re-
trieval artificially easy.
From this candidate pool, ten negative sentences
are sampled uniformly at random while excluding
the true parallel sentence. This approach creates re-
trieval settings where negative examples are length-
matched but otherwise randomly selected.
Layerwise retrieval accuracies obtained using
this sampling strategy are reported in Table 12.
D.3 FAISS-Based Hard Negative Sampling
To construct more challenging retrieval scenarios,
we additionally employ a FAISS(Johnson et al.,
2021) based negative sampling strategy. In this set-
ting, candidate pools are first constructed using the
same length-aware percentile filtering described
above. However, instead of sampling negatives
uniformly, we use the final-layer sentence represen-
tations to guide the sampling process.
Specifically, cosine similarities between the
query representation and all candidate sentences
are computed using a FAISS inner-product in-
dex. Negative examples are then sampled prob-
abilistically based on these similarity scores, with
lower-similarity sentences receiving higher sam-
pling probability. This procedure increases the like-
lihood of selecting semantically similar distractors
while still avoiding trivial matches.
Layerwise retrieval results obtained using
FAISS-based sampling are reported in Table 13.
These results provide a more stringent evaluation
of cross-lingual alignment by introducing harder
negative examples.
E Representation Similarity Analysis
We now formalize the representation-level metrics
and experimental protocols used in our analysis,
which operationalize the hypotheses and

Chunk 35 · 1,995 chars

are reported in Table 13.
These results provide a more stringent evaluation
of cross-lingual alignment by introducing harder
negative examples.
E Representation Similarity Analysis
We now formalize the representation-level metrics
and experimental protocols used in our analysis,
which operationalize the hypotheses and research
questions outlined in the preceding section.
E.1 Representation Similarity Metrics
We employ two complementary representation sim-
ilarity measures Centered Kernel Alignment (CKA)
and Singular Vector Canonical Correlation Anal-
ysis (SVCCA) to quantify layer-wise alignment
between sentence representations across languages.
Algorithm 1 Trilingual Post-training Alignment of
mBERT
1: Input: Trilingual dataset D =
{(xen
i , xhi
i , xcm
i )}N
i=1, pretrained encoder
fθ, hyperparameters λ, T
2: Output: Trilingual-aligned encoder fθ
3: for epoch = 1 to T do
4: for mini-batch B ⊂ D do
MLM & NSP:
5: Sample sentence pair from
{(en, hi), (en, cm), (hi, cm)}
6: Construct positive/negative NSP pair
and apply token masking
7: Compute LM LM and LN SP using fθ
Cross-lingual alignment:
8: Encode each language independently:
9: ei ← ℓ2-normalize(fθ(xen
i )[CLS])
10: hi ← ℓ2-normalize(fθ(xhi
i )[CLS])
11: ci ← ℓ2-normalize(fθ(xcm
i )[CLS])
12: Lalign = 1
3B
B	X
i=1
h
(1 − e⊤
i hi) + (1 −
e⊤
i ci) + (1 − h⊤
i ci)
i
13: L ← LM LM + LN SP + λ Lalign
14: end for
15: end for
16: return fθ
Centered Kernel Alignment (CKA). Given two
representation matrices X ∈ Rn×d1 and Y ∈
Rn×d2 , linear CKA is defined as:
CKA(X, Y) =
˜	X⊤ ˜	Y 2
F
˜	X⊤ ˜	X F
˜	Y⊤ ˜	Y F
, (6)
where ˜	X = X − E[X] and ˜	Y = Y − E[Y] de-
note mean-centered representations, and ∥·∥F is
the Frobenius norm.
CKA is invariant to isotropic scaling and or-
thogonal transformations, making it well-suited for
comparing representations across layers, languages,
and model architectures.
Singular Vector Canonical Correlation Anal-
ysis (SVCCA). To complement CKA, we also
employ SVCCA, which emphasizes alignment

Chunk 36 · 1,990 chars

he Frobenius norm.
CKA is invariant to isotropic scaling and or-
thogonal transformations, making it well-suited for
comparing representations across layers, languages,
and model architectures.
Singular Vector Canonical Correlation Anal-
ysis (SVCCA). To complement CKA, we also
employ SVCCA, which emphasizes alignment be-
tween dominant low-dimensional subspaces. Given
representations X and Y, we first apply PCA to
retain the top-k principal components:
X′ = PCAk(X), Y′ = PCAk(Y). (7)
14

-- 14 of 24 --

Canonical Correlation Analysis is then per-
formed on X′ and Y′, yielding canonical corre-
lation coefficients {ρ1, . . . , ρk}. The SVCCA sim-
ilarity is defined as:
SVCCA(X, Y) = 1
k
k	X
i=1
ρi. (8)
While CKA captures global representational
alignment, SVCCA focuses on shared subspaces,
providing a complementary view of cross-lingual
representation structure.
E.2 Entropy-Based Analysis of Code-Mixed
Representations
To quantify how much information monolingual
representations provide about code-mixed (CM)
representations across transformer layers, we adopt
an information-theoretic perspective based on dif-
ferential entropy. Our goal is to measure how un-
certainty in CM representations is reduced when
conditioning on Hindi and/or English representa-
tions.
For a given model and layer ℓ, let CMℓ, Hiℓ,
and Enℓ denote the hidden representations of code-
mixed, Hindi, and English inputs, respectively.
We estimate the entropy of CM representations as
H(CMℓ) and the conditional entropies H(CMℓ |
Hiℓ), H(CMℓ | Enℓ), and H(CMℓ | Hiℓ, Enℓ).
All entropy quantities are computed under a
linear-Gaussian assumption. Specifically, condi-
tional entropies are estimated by regressing CMℓ
onto the corresponding conditioning representa-
tions using ridge regression, and computing the
entropy of the resulting residuals via the log-
determinant of their covariance matrix. This proce-
dure captures how much variance in CM represen-
tations remains unexplained after accounting

Chunk 37 · 1,995 chars

re estimated by regressing CMℓ
onto the corresponding conditioning representa-
tions using ridge regression, and computing the
entropy of the resulting residuals via the log-
determinant of their covariance matrix. This proce-
dure captures how much variance in CM represen-
tations remains unexplained after accounting for
monolingual signals.
To facilitate comparison across conditions, we
define uncertainty reduction as
∆Hℓ = H(CMℓ) − H(CMℓ | ·), (9)
where larger values of ∆Hℓ indicate greater re-
duction in uncertainty of CM representations due
to conditioning. We compute ∆Hℓ separately for
conditioning on Hindi, English, and their joint rep-
resentation.
By examining uncertainty reduction across lay-
ers and architectures, we assess the relative and
complementary contributions of Hindi and English
in explaining code-mixed representations.
E.3 Language-Wise Saliency Analysis via
Rank-Inverse Attribution
To complement the entropy-based analysis in Ap-
pendix E.2, we analyze token-level saliency un-
der code-mixed inputs using a rank-based attri-
bution framework. Our approach is inspired by
Saliency Drift Attribution (SDA), which quantifies
how token-level importance shifts when semanti-
cally aligned inputs are perturbed via code-mixing.
In contrast to representation-level uncertainty mea-
sures, this analysis directly examines how attribu-
tional importance is distributed across tokens.
Rank-Inverse Saliency. Let an input sentence
be denoted by x = {w1, w2, . . . , wn}. For a given
model, we compute token-level importance scores
using a gradient-based attribution method A(·),
specifically Integrated Gradients. Given raw at-
tribution scores {A(wi)}n
i=1, we define the Rank-
Inverse (RI) score for token wi as
RI(wi) = 1
rank(A(wi)) , (10)
where rank(A(wi)) denotes the rank of token wi
when tokens are sorted in descending order of attri-
bution magnitude. This rank-based normalization
removes sensitivity to absolute attribution scales
and input length, enabling

Chunk 38 · 1,995 chars

)}n
i=1, we define the Rank-
Inverse (RI) score for token wi as
RI(wi) = 1
rank(A(wi)) , (10)
where rank(A(wi)) denotes the rank of token wi
when tokens are sorted in descending order of attri-
bution magnitude. This rank-based normalization
removes sensitivity to absolute attribution scales
and input length, enabling comparison across mod-
els and sentences.
For encoder-based architectures, RI scores are
computed with respect to the sentence-level repre-
sentation. Integrated Gradients is applied to the in-
put embeddings, and token attributions are defined
as the contribution of each token to the ℓ2 norm of
the [CLS] (or equivalent) representation. Encoder-
specific special tokens (e.g., [CLS], [SEP]) and
punctuation-only tokens are excluded.
Language-Wise Aggregation. Each input sen-
tence is annotated with word-level language labels
indicating English or Hindi. For a given model,
we compute language-wise average saliency by ag-
gregating RI scores over all tokens belonging to a
given language and normalizing by the number of
tokens of that language:
RIlang = 1
|Tlang|
X
w∈Tlang
RI(w), (11)
where Tlang denotes the set of tokens labeled with a
given language. This yields a corpus-level estimate
of how much attributional importance is assigned
to English versus Hindi tokens under code-mixed
inputs.
15

-- 15 of 24 --

Interpretation. Higher RI values indicate greater
influence of tokens on model behavior. Differences
between RIEnglish and RIHindi reflect attributional
bias in how models process code-mixed inputs. Un-
like entropy-based uncertainty reduction, which
captures representational dependence across lay-
ers, RI analysis directly measures how saliency is
allocated at the token level.
F Downstream Tasks
To evaluate the effectiveness of our post-training
alignment procedure, we conduct experiments on
two code-mixed downstream tasks: sentiment anal-
ysis and hate speech detection. We focus on
the Hindi–English code-mixed setting to assess
whether improved

Chunk 39 · 1,987 chars

how saliency is
allocated at the token level.
F Downstream Tasks
To evaluate the effectiveness of our post-training
alignment procedure, we conduct experiments on
two code-mixed downstream tasks: sentiment anal-
ysis and hate speech detection. We focus on
the Hindi–English code-mixed setting to assess
whether improved alignment benefits semantic clas-
sification tasks in this language pair. We do not
include tasks such as humor or sarcasm detection,
as prior work (Mazumder et al., 2025b) shows that
translations often fail to preserve the underlying
meaning in tasks that rely heavily on linguistic nu-
ances.
F.1 Task Descriptions
Sentiment Analysis. We use the Hinglish sub-
set of the SemEval-2020 Task 9 dataset (Patwa
et al., 2020), which contains Hindi–English code-
mixed tweets annotated with sentence-level senti-
ment labels (Positive, Negative, and Neutral).2 The
dataset contains 14,000 training, 3,000 validation,
and 3,000 test instances, with a label distribution
of 6,616 positive, 7,492 neutral, and 5,892 nega-
tive instances across all splits. Each code-mixed
sentence is translated into English and Hindi to pro-
duce trilingual sentence triples for each instance,
as detailed in Section F.2.
Hate Speech Detection. For hate speech detec-
tion, we use the Hindi–English code-mixed hate
speech dataset introduced by Bohra et al. (2018),
which consists of tweets annotated at the sentence
level as either Hate Speech or Non-Hate Speech.
The dataset contains 4,567 instances, with 1,656
Hate and 2,911 Non-Hate instances. Since the
dataset does not provide a predefined train-test split,
we construct an 80/10/10 stratified split to create
training, validation, and test sets while preserving
the original label distribution. Each instance is sim-
ilarly translated into English and Hindi, as detailed
in Section F.2.
2The dataset can be accessed at https://github.com/
singhnivedita/SemEval2020-Task9.
F.2 Translation Procedure
All translations are generated using

Chunk 40 · 1,994 chars

ng, validation, and test sets while preserving
the original label distribution. Each instance is sim-
ilarly translated into English and Hindi, as detailed
in Section F.2.
2The dataset can be accessed at https://github.com/
singhnivedita/SemEval2020-Task9.
F.2 Translation Procedure
All translations are generated using QWEN2.5-
72B-INSTRUCT-GPTQ-INT4 deployed through
the vLLM inference framework. To ensure label con-
sistency across translations, we use task-specific
prompts that instruct the model to preserve seman-
tic meaning and task labels during translation.
For the sentiment analysis dataset, the prompts
explicitly specify the original sentiment label and
instruct the model not to alter the sentiment during
translation. For the hate speech dataset, prompts
instruct the model to preserve the intent and sever-
ity of the original sentence during translation.
Since hate speech content may trigger refusals, the
prompts explicitly instruct the model to translate
all sentences regardless of their content.
All translations are returned in a structured
JSON format containing the translated sentence,
the original label, and the target language, ensuring
consistent parsing across all instances. As a qual-
ity control step, we manually verified 30 randomly
sampled instances from each dataset to confirm that
the translations preserved both the semantic mean-
ing and the associated labels. The dataset statistics
are presented in Table 7. The following prompts
were used for each task during translation:
Sentiment Translation Prompt
You are a sentiment-preserving translator. Trans-
late the following code-mixed (Hinglish) sentence
into <TARGET LANGUAGE>.
Rules:
• Do not change the sentiment of the sen-
tence.
• The original sentiment label is provided as
<SENTIMENT>.
• Return only a JSON object in the following
format:
{"translated_sentence": "<translation>",
"sentiment": "<SENTIMENT>",
"language": "<TARGET_LANGUAGE>"}
Sentence: <INPUT SENTENCE>
Hate Speech Translation

Chunk 41 · 1,997 chars

ules:
• Do not change the sentiment of the sen-
tence.
• The original sentiment label is provided as
<SENTIMENT>.
• Return only a JSON object in the following
format:
{"translated_sentence": "<translation>",
"sentiment": "<SENTIMENT>",
"language": "<TARGET_LANGUAGE>"}
Sentence: <INPUT SENTENCE>
Hate Speech Translation Prompt
You are a professional linguist working on an aca-
demic research dataset for hate speech detection.
Your task is to translate code-mixed (Hinglish)
sentences into <TARGET LANGUAGE> for research
purposes.
This is a research translation task. You must
translate all sentences regardless of their con-
tent. The dataset contains both hateful and non-
16

-- 16 of 24 --

hateful content and accurate translation of both
is critical for the research.
Rules:
• Translate the sentence while preserving the
original intent, aggression level, and emo-
tional tone.
• The sentence may contain hateful or non-
hateful content and must be translated re-
gardless of its content.
• You may replace slurs with semantically
equivalent expressions that preserve the
hate or non-hate property.
• Do not add warnings, disclaimers, or re-
fusals. Just translate.
• Return only a JSON object in the following
format:
{"translated_sentence": "<translation>",
"label": "<LABEL>",
"language": "<TARGET_LANGUAGE>"}
Sentence: <INPUT SENTENCE>
Label Train Val Test Total
Sentiment(Patwa et al., 2020)
Neutral 5,426 1,124 1,071 7,621
Positive 4,853 979 979 6,811
Negative 4,232 887 876 5,995
Total 14,511 2,990 2,926 20,427
Hate Speech(Bohra et al., 2018)
Non-Hate 2,328 292 291 2,911
Hate 	1,325 165 166 1,656
Total 3,653 457 457 4,567
Table 7: Dataset statistics for the sentiment analysis and
hate speech detection tasks after preprocessing.
F.3 Implementation Details
All models are fine-tuned using the HuggingFace
Transformers library with PyTorch. We use the
AdamW optimizer with learning rates selected
from {2e-5, 1.5e-5, 6.5e-6, 6e-6} and train for up
to 10 epochs with a batch size of

Chunk 42 · 1,992 chars

timent analysis and
hate speech detection tasks after preprocessing.
F.3 Implementation Details
All models are fine-tuned using the HuggingFace
Transformers library with PyTorch. We use the
AdamW optimizer with learning rates selected
from {2e-5, 1.5e-5, 6.5e-6, 6e-6} and train for up
to 10 epochs with a batch size of 32. During train-
ing, model checkpoints are evaluated on the valida-
tion set at each epoch, and the checkpoint with the
best validation score is selected for final evaluation.
For each model, training is performed separately
on the code-mixed, English, and Hindi variants of
the dataset, while evaluation is conducted on all
three variants.
Consistency To evaluate model robustness
across different evaluation settings, we compute
a Consistency score using the Macro-F1 values
obtained on the original test set and its English
and Hindi translations. The score is computed
as the difference between the mean Macro-F1
and the population standard deviation across the
three evaluations. Let s1, s2, s3 denote the Macro-
F1 scores on the original, English-translated, and
Hindi-translated test sets, respectively:
Consistency = mean(s1, s2, s3) − std(s1, s2, s3).
This metric favors models that achieve both
strong performance and stable behavior across the
three evaluation conditions.
F.4 Observations
F.4.1 Sentiment Analysis
Tables 8 and 9 report the results for each model
across all train-test language combinations for the
sentiment analysis task. We summarize the key
observations below.
• Trilingual alignment improves cross-
lingual consistency for mBERT. As shown
in Table 8, applying trilingual alignment to
mBERT improves the consistency score from
0.3283 to 0.4464 (∼ 36% ↑) when training on
code-mixed data, and from 0.5079 to 0.5519
(∼ 9% ↑) when training on Hindi. Although
the trilingual-aligned models do not always
achieve the highest individual macro-F1
scores, the consistency improvements suggest
that the alignment objective encourages
more balances

Chunk 43 · 1,999 chars

re from
0.3283 to 0.4464 (∼ 36% ↑) when training on
code-mixed data, and from 0.5079 to 0.5519
(∼ 9% ↑) when training on Hindi. Although
the trilingual-aligned models do not always
achieve the highest individual macro-F1
scores, the consistency improvements suggest
that the alignment objective encourages
more balances performance across language
variants.
• Similar trends are observed for code-mixed-
adapted models. In Table 9, trilingual align-
ment applied to Hing-mBERT improves the
consistency score from 0.3558 to 0.4052 (∼
14% ↑) when training on code-mixed data,
and from 0.2918 to 0.4470 (∼ 53% ↑) when
training on English. Improvements are also
visible for Hing-RoBERTa models, particu-
larly when training on code-mixed or English
data, where the consistency score improves
from 0.3350 to 0.6265 (∼ 87% ↑) and from
0.2666 to 0.5343 (∼ 100% ↑)respectively.
• Trilingual alignment yields moderate im-
provements for mixed-language models. As
shown in Table 9, Hing-mBERT (mixed) and
Hing-RoBERTa (mixed) already achieve rela-
tively strong cross-lingual consistency owing
17

-- 17 of 24 --

to their mixed-language pretraining. Nonethe-
less, trilingual alignment further improves
performance in several configurations, with
the some gains observed for Hing-RoBERTa
(mixed), where the consistency score im-
proves from 0.6124 to 0.6633 (∼ 8% ↑) when
training on English and marginal gain from
0.6864 to 0.7051 (∼ 3% ↑) when training on
Hindi. These results indicate that the trilingual
alignment objective provides complementary
benefits even for models that already possess
a degree of cross-lingual robustness through
pretraining.
• Ablation provides insight into the role of
alignment loss. As shown in Table 8, the –w/o
Alignment variant generally performs com-
petitively with the base models and in sev-
eral cases approaches the performance of the
fully aligned models, indicating that contin-
ued pretraining on code-mixed data itself con-
tributes meaningfully to cross-lingual

Chunk 44 · 1,996 chars

he role of
alignment loss. As shown in Table 8, the –w/o
Alignment variant generally performs com-
petitively with the base models and in sev-
eral cases approaches the performance of the
fully aligned models, indicating that contin-
ued pretraining on code-mixed data itself con-
tributes meaningfully to cross-lingual perfor-
mance. For instance, for mBERT trained on
code-mixed data, the ablation model achieves
a consistency of 0.3780 compared to 0.3283
for the base model. However, the trilingual-
aligned model still delivers the strongest gains
in linguistically diverse settings, such as when
training on Hindi, where it achieves the high-
est consistency of 0.5519 compared to 0.5457
for the ablation variant and 0.5079 for the
base model. Together, these results suggest
that while code-mixed pretraining provides a
useful foundation, the explicit trilingual align-
ment objective complements it by enabling
more balanced cross-lingual generalization.
F.4.2 Hate Speech Detection
Tables 10 and 11 report the results for each model
across all train–test language combinations for the
hate speech detection task. We summarize the key
observations below.
• Trilingual alignment improves cross-
lingual consistency for XLM-R. As shown
in Table 10, applying trilingual alignment to
XLM-R improves the consistency score from
0.6217 to 0.6794 (∼ 9% ↑) when training
on code-mixed data. Improvements are also
observed when training on English and Hindi,
where the aligned model improves from
0.6326 to 0.6732 (∼ 6% ↑) and from 0.6379
to 0.6577 (∼ 3% ↑) respectively.
• Stronger gains observed for code-mixed-
adapted models. In Table 11, trilingual align-
ment applied to Hing-mBERT improves the
consistency score from 0.4386 to 0.6272 (∼
43% ↑) when training on Hindi. A more pro-
nounced improvement is observed for Hing-
RoBERTa, where the consistency score in-
creases from 0.4130 to 0.6352 (∼ 54% ↑)
when training on Hindi, suggesting that trilin-
gual alignment is more effective when the

Chunk 45 · 1,993 chars

g-mBERT improves the
consistency score from 0.4386 to 0.6272 (∼
43% ↑) when training on Hindi. A more pro-
nounced improvement is observed for Hing-
RoBERTa, where the consistency score in-
creases from 0.4130 to 0.6352 (∼ 54% ↑)
when training on Hindi, suggesting that trilin-
gual alignment is more effective when the fine-
tuning language is Hindi.
• Trilingual alignment yields consistent im-
provements for mixed-language models. As
shown in Table 11, Hing-mBERT (mixed) and
Hing-RoBERTa (mixed) already achieve rela-
tively strong cross-lingual consistency owing
to their mixed-language pretraining. Nonethe-
less, trilingual alignment further improves
performance in several configurations, with
the most notable gains observed when train-
ing on Hindi, where Hing-mBERT-Mixed
Trilingual improves from 0.5440 to 0.6720
(∼ 24% ↑) and Hing-RoBERTa-Mixed-
Trilingual improves from 0.6162 to 0.6720
(∼ 9% ↑). These results indicate that the
trilingual alignment objective provides com-
plementary benefits even for models that al-
ready possess a degree of cross-lingual robust-
ness through pretraining.
• Ablation provides insight into the role of
alignment loss. As shown in Table 10, con-
tinued pretraining on code-mixed data alone
already yields substantial gains over the base
mBERT model, with the –w/o Alignment vari-
ant achieving a consistency score of 0.6072
against 0.5542 for the base model when
trained on code-mixed data. Nevertheless,
the trilingual-aligned model consistently out-
performs both the base and ablation variants
across all three training languages, achiev-
ing consistency scores of 0.6194, 0.6119, and
0.5545 for code-mixed, English, and Hindi
training respectively. A similar pattern holds
for XLM-R, where the aligned model achieves
the best consistency scores of 0.6794, 0.6732,
and 0.6577 across all three training languages.
Taken together, these findings suggest that the
trilingual alignment objective provides consis-
tent and additive benefits over

Chunk 46 · 1,987 chars

Hindi
training respectively. A similar pattern holds
for XLM-R, where the aligned model achieves
the best consistency scores of 0.6794, 0.6732,
and 0.6577 across all three training languages.
Taken together, these findings suggest that the
trilingual alignment objective provides consis-
tent and additive benefits over code-mixed pre-
training alone, particularly in the hate speech
18

-- 18 of 24 --

detection setting.
19

-- 19 of 24 --

mBERT family
XLM-R family
Figure 8: Layer-wise CKA alignment for encoders and their trilingual versions. Each subplot shows cross-lingual
representation alignment for EN-CM, EN-HI, and HI-CM across layers.
mBERT family
XLM-R family
Figure 9: Layer-wise SVCCA alignment for mBERT and XLM-R with their code-mixed adapted models. Each
subplot shows cross-lingual representation alignment for EN-CM, EN-HI, and HI-CM across layers.
20

-- 20 of 24 --

Train Model Test Set Consistency
CM EN HI
CM
mBERT 	0.5192 0.5672 0.3095 0.3283
mBERT Trilingual 0.6290 0.6420 0.4281 0.4464
–w/o Alignment 0.6537 0.6177 0.3538 0.3780
EN
mBERT 	0.4196 0.7323 0.4604 0.3674
mBERT Trilingual 0.5333 0.7277 0.5103 0.4710
–w/o Alignment 0.5286 0.7163 0.5201 0.4774
HI
mBERT 	0.4962 0.6271 0.6577 0.5079
mBERT Trilingual 0.5460 0.6325 0.6768 0.5519
–w/o Alignment 0.5575 0.5868 0.6730 0.5457
CM
XLM-R 	0.6571 0.6301 0.6445 0.6304
XLM-R Trilingual 0.6892 0.6905 0.6591 0.6618
–w/o Alignment 0.6839 0.6936 0.6456 0.6490
EN
XLM-R 	0.4589 0.7381 0.6315 0.4686
XLM-R Trilingual 0.4970 0.7316 0.6312 0.5022
–w/o Alignment 0.4898 0.7275 0.5946 0.4848
HI
XLM-R 	0.6102 0.7240 0.6935 0.6170
XLM-R Trilingual 0.6249 0.7276 0.7044 0.6318
–w/o Alignment 0.6323 0.7169 0.6934 0.6372
Table 8: Performance of models on sentiment classification task when trained on different source languages and
evaluated on code-mixed (CM), English (EN), and Hindi (HI) test sets. Notation: Except consistency, all others are
macro-F1 scores averaged across three random seeds.
Train Model Test Set

Chunk 47 · 1,996 chars

0.7169 0.6934 0.6372
Table 8: Performance of models on sentiment classification task when trained on different source languages and
evaluated on code-mixed (CM), English (EN), and Hindi (HI) test sets. Notation: Except consistency, all others are
macro-F1 scores averaged across three random seeds.
Train Model Test Set Consistency
CM EN HI
CM
Hing-mBERT 	0.6916 0.7141 0.3221 0.3558
Hing-mBERT Trilingual 	0.6722 0.6895 0.3784 0.4052
Hing-mBERT Mixed 	0.7241 0.7122 0.6812 0.6837
Hing-mBERT Mixed Trilingual 0.7139 0.7110 0.6900 0.6919
EN
Hing-mBERT 	0.5613 0.7311 0.2748 0.2918
Hing-mBERT Trilingual 	0.5572 0.7260 0.4607 0.4470
Hing-mBERT Mixed 	0.5524 0.7498 0.6104 0.5361
Hing-mBERT Mixed Trilingual 0.5268 0.7304 0.6725 0.5383
HI
Hing-mBERT 	0.4749 0.4386 0.6720 0.4029
Hing-mBERT Trilingual 	0.6137 0.6462 0.6415 0.6162
Hing-mBERT Mixed 	0.6878 0.7322 0.7062 0.6864
Hing-mBERT Mixed Trilingual 0.6886 0.7200 0.7044 0.6886
CM
Hing-RoBERTa 	0.7074 0.7127 0.2982 0.3350
Hing-RoBERTa Trilingual 	0.7049 0.6889 0.6204 0.6265
Hing-RoBERTa Mixed 	0.7063 0.7288 0.7075 0.7015
Hing-RoBERTa Mixed Trilingual 0.7304 0.7122 0.6975 0.6969
EN
Hing-RoBERTa 	0.5425 0.7351 0.2519 0.2666
Hing-RoBERTa Trilingual 	0.5535 0.7742 0.6163 0.5343
Hing-RoBERTa Mixed 	0.6200 0.7703 0.6750 0.6124
Hing-RoBERTa Mixed Trilingual 0.6678 0.7349 0.6899 0.6633
HI
Hing-RoBERTa 	0.6681 0.7361 0.7150 0.6716
Hing-RoBERTa Trilingual 	0.6899 0.7252 0.7092 0.6904
Hing-RoBERTa Mixed 	0.6907 0.7377 0.7036 0.6864
Hing-RoBERTa Mixed Trilingual 0.7057 0.7466 0.7243 0.7051
Table 9: Performance of Hing-mBERT and Hing-RoBERTa models on sentiment classification task when trained on
different source languages and evaluated on code-mixed (CM), English (EN), and Hindi (HI) test sets. Notation:
Except consistency, all others are macro-F1 scores averaged across three random seeds.
21

-- 21 of 24 --

Train Model Test Set Consistency
CM EN HI
CM
mBERT 	0.6706 0.6194 0.5373 0.5542
mBERT Trilingual 0.6631 0.6188 0.6427 0.6194
–w/o

Chunk 48 · 1,989 chars

nd evaluated on code-mixed (CM), English (EN), and Hindi (HI) test sets. Notation:
Except consistency, all others are macro-F1 scores averaged across three random seeds.
21

-- 21 of 24 --

Train Model Test Set Consistency
CM EN HI
CM
mBERT 	0.6706 0.6194 0.5373 0.5542
mBERT Trilingual 0.6631 0.6188 0.6427 0.6194
–w/o Alignment 0.6847 0.6575 0.5937 0.6072
EN
mBERT 	0.5777 0.6294 0.4732 0.4805
mBERT Trilingual 0.6391 0.6362 0.6094 0.6119
–w/o Alignment 0.6850 0.6779 0.5692 0.5791
HI
mBERT 	0.5879 0.5361 0.6653 0.5314
mBERT Trilingual 0.5520 0.6105 0.6529 0.5545
–w/o Alignment 0.5287 0.6308 0.6402 0.5381
CM
XLM-R 	0.6794 0.6856 0.6071 0.6217
XLM-R Trilingual 0.6972 0.6803 0.6863 0.6794
–w/o Alignment 0.6720 0.6540 0.4738 0.5104
EN
XLM-R 	0.6698 0.6889 0.6226 0.6326
XLM-R Trilingual 0.6816 0.7003 0.6759 0.6732
–w/o Alignment 0.6736 0.6967 0.6394 0.6464
HI
XLM-R 	0.6315 0.6632 0.6850 0.6379
XLM-R Trilingual 0.6628 0.6635 0.6856 0.6577
–w/o Alignment 0.6403 0.6894 0.6972 0.6504
Table 10: Performance of models on hate speech classification task when trained on different source languages and
evaluated on code-mixed (CM), English (EN), and Hindi (HI) test sets. Notation: Except consistency, all others are
macro-F1 scores averaged across three random seeds.
Train Model Test Set Consistency
CM EN HI
CM
Hing-mBERT 	0.6981 0.6748 0.3890 0.4468
Hing-mBERT Trilingual 	0.6556 0.6212 0.5136 0.5363
Hing-mBERT Mixed 	0.6961 0.6660 0.6109 0.6224
Hing-mBERT Mixed Trilingual 0.6915 0.6692 0.6475 0.6514
EN
Hing-mBERT 	0.6970 0.6883 0.3890 0.4482
Hing-mBERT Trilingual 	0.6722 0.6569 0.4189 0.4667
Hing-mBERT Mixed 	0.6980 0.7060 0.5836 0.6066
Hing-mBERT Mixed Trilingual 0.7098 0.6847 0.5845 0.6055
HI
Hing-mBERT 	0.5268 0.4189 0.6361 0.4386
Hing-mBERT Trilingual 	0.6414 0.6236 0.6534 0.6272
Hing-mBERT Mixed 	0.5830 0.5450 0.6948 0.5440
Hing-mBERT Mixed Trilingual 0.6729 0.6779 0.6984 0.6720
CM
Hing-RoBERTa 	0.7263 0.7093 0.3890 0.4530
Hing-RoBERTa Trilingual 	0.6872 0.6727 0.6609

Chunk 49 · 1,999 chars

gual 0.7098 0.6847 0.5845 0.6055
HI
Hing-mBERT 	0.5268 0.4189 0.6361 0.4386
Hing-mBERT Trilingual 	0.6414 0.6236 0.6534 0.6272
Hing-mBERT Mixed 	0.5830 0.5450 0.6948 0.5440
Hing-mBERT Mixed Trilingual 0.6729 0.6779 0.6984 0.6720
CM
Hing-RoBERTa 	0.7263 0.7093 0.3890 0.4530
Hing-RoBERTa Trilingual 	0.6872 0.6727 0.6609 0.6628
Hing-RoBERTa Mixed 	0.6971 0.6841 0.6271 0.6390
Hing-RoBERTa Mixed Trilingual 0.7223 0.6814 0.6717 0.6699
EN
Hing-RoBERTa 	0.7259 0.7294 0.3890 0.4551
Hing-RoBERTa Trilingual 	0.7036 0.6955 0.6929 0.6928
Hing-RoBERTa Mixed 	0.7191 0.7275 0.6645 0.6758
Hing-RoBERTa Mixed Trilingual 0.6778 0.6885 0.6759 0.6752
HI
Hing-RoBERTa 	0.4950 0.4047 0.6710 0.4130
Hing-RoBERTa Trilingual 	0.6318 0.6640 0.6493 0.6352
Hing-RoBERTa Mixed 	0.6303 0.6273 0.7196 0.6162
Hing-RoBERTa Mixed Trilingual 0.6862 0.6683 0.6973 0.6720
Table 11: Performance of Hing-mBERT and Hing-RoBERTa models on hate speech classification task when trained
on different source languages and evaluated on code-mixed (CM), English (EN), and Hindi (HI) test sets. Notation:
Except consistency, all others are macro-F1 scores averaged across three random seeds.
22

-- 22 of 24 --

EN→CM
Model L1 L2 L3 L4 L5 L6 L7 L8 L9 L10 L11 L12
mBERT 49.11 53.99 50.98 53.91 57.90 55.03 54.60 53.72 50.43 52.52 53.49 55.73
Hing-mBERT 53.05 61.80 61.41 65.28 75.38 100.00 72.00 69.00 64.27 43.75 57.42 54.20
Hing-mBERT-Mixed 50.24 58.76 51.89 65.65 54.73 85.50 63.21 66.26 60.06 43.96 62.06 55.95
XLM-R 58.89 99.84 94.72 88.82 72.44 100.00 47.75 47.58 55.04 59.50 45.33 49.71
Hing-RoBERTa 58.41 36.27 41.95 98.18 99.99 97.97 53.98 55.22 72.18 71.18 64.19 64.86
Hing-RoBERTa-Mixed 41.24 73.63 80.07 92.95 87.22 100.00 54.40 70.18 54.00 62.76 56.73 86.61
CM→EN
Model L1 L2 L3 L4 L5 L6 L7 L8 L9 L10 L11 L12
mBERT 49.76 58.39 54.53 50.49 55.37 51.59 50.70 49.03 47.19 45.54 44.95 43.11
Hing-mBERT 52.77 64.69 61.53 63.00 76.41 100.00 78.65 68.46 64.72 42.96 56.43 71.79
Hing-mBERT-Mixed 51.08 63.66 53.23 58.96 52.85 81.22 60.76

Chunk 50 · 1,994 chars

7.22 100.00 54.40 70.18 54.00 62.76 56.73 86.61
CM→EN
Model L1 L2 L3 L4 L5 L6 L7 L8 L9 L10 L11 L12
mBERT 49.76 58.39 54.53 50.49 55.37 51.59 50.70 49.03 47.19 45.54 44.95 43.11
Hing-mBERT 52.77 64.69 61.53 63.00 76.41 100.00 78.65 68.46 64.72 42.96 56.43 71.79
Hing-mBERT-Mixed 51.08 63.66 53.23 58.96 52.85 81.22 60.76 66.14 59.48 42.20 62.27 76.17
XLM-R 55.69 99.74 95.64 89.38 81.66 100.00 52.50 47.26 55.65 58.66 42.49 50.38
Hing-RoBERTa 53.81 36.88 29.87 98.40 99.99 99.70 54.31 53.93 73.67 72.89 66.70 71.52
Hing-RoBERTa-Mixed 38.11 72.31 65.58 93.18 84.56 100.00 55.84 70.34 54.00 62.61 57.39 92.54
EN→HI
Model L1 L2 L3 L4 L5 L6 L7 L8 L9 L10 L11 L12
mBERT 53.03 56.83 54.27 61.46 68.30 68.82 69.69 73.88 71.11 70.33 70.02 72.31
Hing-mBERT 35.34 27.82 24.13 26.21 62.36 100.00 75.64 53.32 40.16 25.77 48.28 33.75
Hing-mBERT-Mixed 27.14 50.77 41.25 51.02 36.18 86.52 57.67 55.68 49.29 34.64 56.02 47.35
XLM-R 61.34 99.80 92.02 90.27 73.31 100.00 49.88 48.51 55.54 60.84 41.86 58.12
Hing-RoBERTa 26.19 43.29 60.66 99.62 98.57 100.00 45.73 50.84 19.50 12.48 12.28 15.06
Hing-RoBERTa-Mixed 29.46 57.39 59.66 94.15 78.76 100.00 52.07 66.51 49.93 58.80 40.65 82.35
HI→EN
Model L1 L2 L3 L4 L5 L6 L7 L8 L9 L10 L11 L12
mBERT 55.40 60.46 55.19 60.29 74.83 75.16 72.61 74.05 68.19 65.78 62.48 32.64
Hing-mBERT 36.74 39.94 36.31 31.68 67.77 100.00 88.19 46.87 39.89 21.69 42.90 32.93
Hing-mBERT-Mixed 35.58 51.50 40.93 43.37 33.36 82.85 56.69 51.54 47.24 31.97 55.98 61.90
XLM-R 57.90 99.72 89.69 89.67 77.41 100.00 51.99 47.32 56.29 60.85 41.40 54.15
Hing-RoBERTa 39.43 33.32 30.74 99.56 98.69 99.95 48.29 49.00 34.18 25.68 23.44 26.47
Hing-RoBERTa-Mixed 34.01 60.12 75.76 94.12 76.81 100.00 53.51 67.61 49.40 57.45 41.47 84.14
HI→CM
Model L1 L2 L3 L4 L5 L6 L7 L8 L9 L10 L11 L12
mBERT 37.30 47.43 47.41 47.31 43.56 41.10 39.25 39.42 36.83 37.43 38.53 25.29
Hing-mBERT 30.64 37.16 36.29 39.78 72.17 100.00 83.99 45.34 45.20 27.36 49.77 24.52
Hing-mBERT-Mixed 39.78 50.39 44.09 56.37 46.84 89.32 60.14

Chunk 51 · 1,994 chars

6.81 100.00 53.51 67.61 49.40 57.45 41.47 84.14
HI→CM
Model L1 L2 L3 L4 L5 L6 L7 L8 L9 L10 L11 L12
mBERT 37.30 47.43 47.41 47.31 43.56 41.10 39.25 39.42 36.83 37.43 38.53 25.29
Hing-mBERT 30.64 37.16 36.29 39.78 72.17 100.00 83.99 45.34 45.20 27.36 49.77 24.52
Hing-mBERT-Mixed 39.78 50.39 44.09 56.37 46.84 89.32 60.14 58.03 57.16 37.86 59.61 50.90
XLM-R 56.47 99.98 92.92 88.37 72.34 100.00 48.13 46.20 54.85 57.64 44.90 37.65
Hing-RoBERTa 33.14 29.15 40.07 99.62 98.88 98.15 47.70 49.39 33.28 24.44 22.50 25.34
Hing-RoBERTa-Mixed 24.13 62.49 80.28 92.28 90.18 100.00 52.52 66.43 49.22 52.78 37.74 72.68
CM→HI
Model L1 L2 L3 L4 L5 L6 L7 L8 L9 L10 L11 L12
mBERT 37.06 48.61 50.91 46.62 37.19 36.71 36.77 36.82 36.30 34.78 34.01 37.65
Hing-mBERT 28.72 26.49 23.43 27.32 67.07 100.00 79.03 49.08 43.35 29.75 53.19 28.92
Hing-mBERT-Mixed 27.43 51.67 44.25 57.42 47.63 89.38 60.14 60.10 57.86 38.72 59.17 51.55
XLM-R 56.69 99.99 95.58 89.56 77.30 100.00 50.48 46.77 54.46 56.80 42.33 40.39
Hing-RoBERTa 20.57 37.46 52.29 99.66 98.87 100.00 45.35 50.46 19.84 12.49 12.06 14.43
Hing-RoBERTa-Mixed 19.64 58.49 57.95 92.57 89.00 100.00 52.14 66.32 49.60 52.66 37.54 77.57
Table 12: Layer-wise cross-lingual alignment accuracy (%) across transformer layers (L1–L12) for multilingual and
code-mixed adapted models using dot-product similarity with percentile-based negative sampling between English
(EN), Hindi (HI), and code-mixed (CM).
23

-- 23 of 24 --

EN→CM
Model L1 L2 L3 L4 L5 L6 L7 L8 L9 L10 L11 L12
mBERT 50.05 54.75 51.04 54.80 59.83 56.90 56.99 56.50 53.46 55.32 56.45 59.93
Hing-mBERT 53.91 62.75 62.67 66.40 75.96 100.00 72.06 69.15 64.45 43.94 57.58 55.40
Hing-mBERT-Mixed 50.59 59.86 53.29 66.94 56.43 86.00 64.19 66.90 61.24 44.94 62.21 58.48
XLM-R 59.14 99.81 94.53 88.84 72.45 100.00 47.12 47.85 55.28 59.66 45.18 50.30
Hing-RoBERTa 58.99 36.52 40.55 98.22 99.99 97.73 54.40 55.14 73.76 73.05 66.99 67.85
Hing-RoBERTa-Mixed 42.63 74.01 79.89 92.94 87.29 100.00 54.81 70.64 55.58 64.09

Chunk 52 · 1,994 chars

xed 50.59 59.86 53.29 66.94 56.43 86.00 64.19 66.90 61.24 44.94 62.21 58.48
XLM-R 59.14 99.81 94.53 88.84 72.45 100.00 47.12 47.85 55.28 59.66 45.18 50.30
Hing-RoBERTa 58.99 36.52 40.55 98.22 99.99 97.73 54.40 55.14 73.76 73.05 66.99 67.85
Hing-RoBERTa-Mixed 42.63 74.01 79.89 92.94 87.29 100.00 54.81 70.64 55.58 64.09 58.57 88.23
CM→EN
Model L1 L2 L3 L4 L5 L6 L7 L8 L9 L10 L11 L12
mBERT 52.04 59.62 53.65 54.46 59.36 56.40 55.84 54.12 52.21 50.51 50.40 49.20
Hing-mBERT 52.79 64.92 62.13 62.93 76.53 100.00 78.78 68.47 64.83 43.07 56.74 71.97
Hing-mBERT-Mixed 51.01 63.81 53.12 59.20 52.99 81.53 60.77 65.63 59.17 42.14 62.21 76.89
XLM-R 55.83 99.76 95.46 89.47 81.64 100.00 51.89 47.50 55.23 58.67 42.09 50.85
Hing-RoBERTa 54.88 36.82 29.72 98.35 99.99 99.73 54.36 53.69 74.59 74.55 69.11 73.80
Hing-RoBERTa-Mixed 38.99 72.39 65.59 93.11 84.56 100.00 56.39 70.85 55.08 64.14 58.93 93.25
EN→HI
Model L1 L2 L3 L4 L5 L6 L7 L8 L9 L10 L11 L12
mBERT 53.21 57.68 53.97 64.13 71.34 73.05 73.73 76.66 74.59 73.99 73.70 75.73
Hing-mBERT 34.98 28.08 24.50 26.81 62.34 100.00 75.72 53.51 40.30 25.71 48.79 34.15
Hing-mBERT-Mixed 26.07 51.54 42.52 52.23 37.56 86.88 58.20 56.26 49.82 35.41 56.21 48.99
XLM-R 61.04 99.77 92.05 90.24 73.27 100.00 49.89 48.15 55.60 60.49 42.24 58.34
Hing-RoBERTa 25.80 43.49 60.72 99.58 98.57 100.00 46.14 50.56 19.78 14.06 14.01 15.96
Hing-RoBERTa-Mixed 30.72 57.99 60.33 94.10 78.60 100.00 52.50 67.02 51.78 60.10 41.98 84.13
HI→EN
Model L1 L2 L3 L4 L5 L6 L7 L8 L9 L10 L11 L12
mBERT 57.61 61.76 54.22 63.68 78.15 78.30 76.63 78.04 73.05 71.27 68.66 40.27
Hing-mBERT 36.21 39.61 36.54 31.43 67.61 100.00 88.26 46.81 39.49 21.45 42.42 33.49
Hing-mBERT-Mixed 34.46 51.06 40.99 43.35 33.02 83.08 56.11 50.86 46.75 31.47 55.79 63.24
XLM-R 58.03 99.69 89.69 89.52 77.43 100.00 52.26 47.50 56.13 60.81 41.35 54.59
Hing-RoBERTa 40.54 34.04 30.65 99.56 98.68 99.98 48.41 49.11 37.33 27.04 25.21 29.29
Hing-RoBERTa-Mixed 36.39 60.92 76.19 94.12 76.72 100.00 54.19 68.01 50.80 58.81

Chunk 53 · 1,726 chars

xed 34.46 51.06 40.99 43.35 33.02 83.08 56.11 50.86 46.75 31.47 55.79 63.24
XLM-R 58.03 99.69 89.69 89.52 77.43 100.00 52.26 47.50 56.13 60.81 41.35 54.59
Hing-RoBERTa 40.54 34.04 30.65 99.56 98.68 99.98 48.41 49.11 37.33 27.04 25.21 29.29
Hing-RoBERTa-Mixed 36.39 60.92 76.19 94.12 76.72 100.00 54.19 68.01 50.80 58.81 42.79 85.76
HI→CM
Model L1 L2 L3 L4 L5 L6 L7 L8 L9 L10 L11 L12
mBERT 39.23 47.87 48.11 47.89 46.86 43.78 42.04 42.44 40.03 41.25 42.75 31.38
Hing-mBERT 30.92 37.77 37.15 40.44 71.92 100.00 84.01 45.41 44.98 27.45 49.04 25.72
Hing-mBERT-Mixed 40.00 51.03 44.69 57.19 47.74 89.61 60.78 57.89 57.49 37.42 59.92 53.67
XLM-R 56.61 99.98 93.02 88.33 72.38 100.00 47.80 46.81 54.96 57.83 44.51 38.32
Hing-RoBERTa 34.27 29.51 38.86 99.62 98.88 97.94 47.41 49.45 37.04 25.77 24.72 28.25
Hing-RoBERTa-Mixed 26.13 63.17 80.18 92.17 90.15 100.00 53.14 67.11 50.84 53.82 39.33 75.27
CM→HI
Model L1 L2 L3 L4 L5 L6 L7 L8 L9 L10 L11 L12
mBERT 36.75 48.61 50.23 48.75 40.60 42.08 42.00 41.99 41.59 40.12 39.40 43.65
Hing-mBERT 28.52 26.64 23.81 27.53 67.27 100.00 78.96 49.65 43.54 29.95 53.63 29.71
Hing-mBERT-Mixed 26.19 52.01 44.67 58.04 48.19 89.67 60.43 60.55 58.42 39.12 59.70 52.98
XLM-R 56.70 99.97 95.58 89.67 77.30 100.00 50.77 47.00 54.93 56.73 42.53 40.37
Hing-RoBERTa 20.17 37.22 51.56 99.68 98.87 100.00 46.33 49.93 19.92 13.96 13.82 15.34
Hing-RoBERTa-Mixed 20.50 58.64 58.40 92.62 88.97 100.00 52.92 66.48 50.98 54.10 38.58 79.39
Table 13: Layer-wise cross-lingual alignment accuracy (%) across transformer layers (L1–L12) for multilingual and
code-mixed adapted models using dot-product similarity with FAISS-based negative sampling between English
(EN), Hindi (HI), and code-mixed (CM).
24

-- 24 of 24 --