You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguistic and the acoustic cues. This dataset is a potential benchmark for DCS in spontaneous speech.
Fine-tuning NLLB-200 for Hinglish & Spanglish → English translation using the LinCE benchmark — with training, BLEU/ChrF/COMET evaluation, batch inference, and ONNX export.