Project BV - Catalan (ca-ES: Spain) ↔ English
Description
Budget: $42
Multilingual Voice Recording Project
- Code-Switching Conversations
Project Name: BV Project Type: Remote | Ongoing (Limited Slots per Locale)
Project Overview Project BV is a multilingual speech data collection initiative designed to enhance Automatic Speech Recognition (ASR) systems for high-value multilingual call center scenarios, including financial services, healthcare, and telecommunications.
The project focuses on collecting natural code-switching conversational audio, where speakers alternate between two languages within a single conversational turn. Scripts will be provided; however, natural delivery, fluency, and context-appropriate language switching are essential.
Language Requirements Primary Language (Native level
- one required): Catalan (ca-ES: Spain)
Secondary Language (Required): Catalan or Basque
- Fluent
Participants may be located anywhere globally, provided they are native in Catalan (ca-ES: Spain) and fluent in English
The Project Manager will request language verification samples for both primary and secondary languages.
Participation Structure Participation is one-time only per speaker. Two speakers will form a pair (agent and customer roles). Speakers do not need to be in the same location. Scripts and recording instructions will be provided. Zoom is the preferred method for recording the conversations. Audacity is the preferred tool for converting audio files to the required sample rates and for verifying signal-to-noise ratio compliance.
Recording Scope Target volume: 200 hours of audio per locale Per pair output: 1.2 hours of recorded audio Quality acceptance: Minimum 80% usable speech hours
Compensation (USD
- Per Pair, 2 Speakers) All rates below are paid in United States Dollars (USD) and apply to one hour of final audio (30 minutes per speaker).
Payment is split evenly between the two participants. Catalan (ca-ES: Spain) USD 40.00 per participant
Recording Specifications File Format: Stereo WAV Left channel: Agent Right channel: Customer
Sample Rates: 8kHz (telephone quality)
- 50% 16kHz
- 50%
Code-Switching Ratio: 70% primary language 30% English ±10% tolerance
Delivery Style: Natural, context-pecific language switching No forced or artificial transitions
Speaker Requirements Age Distribution Targets: 18–25 years: 30% 25–40 years: 30% 40–65 years: 20% 65+ years: 20%
Gender Balance: 50% male / 50% female
Language Proficiency: Fluent Catalan or Basque is mandatory; non-fluent speakers will be rejected Catalan (ca-ES: Spain) is mandatory
Participation Limit: Each speaker may participate once only
Audio Quality Standards Minimum 25dB signal-to-noise ratio No background noise, distortion, clipping, or echo Quiet, controlled recording environment required Headset or professional microphone strongly recommended
How to Apply Applicants should be prepared to provide:
- Native language and Catalan or Basque proficiency details
- Age range and gender
- Confirmation of one-time participation
- Language verification samples upon request
- Participants will be placed in a group chat with others of the same language to work out who will work with whom for the recordings.
The project will close once all required recordings are collected.
Apply and wait for the next steps if deemed by our team that you qualify. Note: If you do not hear from us, consider that you do not qualify for this project
Skills
Want AI to find more roles like this?
Upload your CV once. Get matched to relevant assignments automatically.