81 researchers
The MuHSiC (Multilingual Hispanic Speech in California) project is
a corpus of bilingual Spanish-English speech from
sociolinguistic interviews and naturalistic conversations across California.
Speech data of diverse social and regional profiles were collected by undergraduate students trained by graduate
students and the 3 faculty PIs.
This corpus features 1200 35-minute recordings of
California Spanish-English bilinguals, 600 recorded in Spanish and 600 in English.
The Multilingual Hispanic Speech in California Corpus has been made possible in part by a Multicampus Research Programs and Initiatives Grant (MRPI GrantM23PL5866) awarded to Mark Amengual, Ji Young Kim, and Justin Davidson.
Any views, findings, conclusions, or recommendations expressed in this website and project, do not necessarily represent those of the University of California.
Principal Investigators
Department of Languages and Applied Linguistics
University of California, Santa Cruz
amengual
ucsc.edu
Department of Spanish and Portuguese
University of California, Los Angeles
jiyoungkim
ucla.edu
Department of Spanish and Portuguese
University of California, Berkeley
justindavidson
berkeley.edu
Project Manager
Department of Spanish and Portuguese
University of California, Berkeley
julianvargo
berkeley.edu
For all inquiries regarding corpus processing, downloading, web development, or dataset access,
please contact julianvargo
berkeley.edu