The MuHSiC (Multilingual Hispanic Speech in California) project is a corpus of bilingual Spanish-English speech from sociolinguistic interviews and naturalistic conversations across California. Speech data of diverse social and regional profiles were collected by undergraduate students trained by graduate students and the 3 faculty PIs.

This corpus features 1200 35-minute recordings of California Spanish-English bilinguals, 600 recorded in Spanish and 600 in English.

The Multilingual Hispanic Speech in California Corpus has been made possible in part by a Multicampus Research Programs and Initiatives Grant (MRPI GrantM23PL5866) awarded to Mark Amengual, Ji Young Kim, and Justin Davidson.

Any views, findings, conclusions, or recommendations expressed in this website and project, do not necessarily represent those of the University of California.

Principal Investigators

Dr. Mark Amengual portrait

Dr. Mark Amengual

Department of Languages and Applied Linguistics

University of California, Santa Cruz

amengualatucsc.edu

Website
Dr. Ji Young Kim portrait

Dr. Ji Young Kim

Department of Spanish and Portuguese

University of California, Los Angeles

jiyoungkimatucla.edu

Website
Dr. Justin Davidson portrait

Dr. Justin Davidson

Department of Spanish and Portuguese

University of California, Berkeley

justindavidsonatberkeley.edu

Website

Project Manager

Julian Vargo (Ph.D. Student) portrait

Julian Vargo (Ph.D. Student)

Department of Spanish and Portuguese

University of California, Berkeley

julianvargoatberkeley.edu

Website

For all inquiries regarding corpus processing, downloading, web development, or dataset access, please contact julianvargoatberkeley.edu