Multilingual AI Lyrics Sync
Description
Budget: $250 - $750
I’m building a tool very close to Youka, but I need it to go a step further by aligning and timing lyrics in several languages at once. My goal is for a single audio or video file to display perfectly synced English, Spanish, French (and any other future additions) in parallel lines without forcing users to switch tracks.
Core requirements • The AI must detect, segment, and timestamp each lyric line automatically. • It has to handle multiple languages simultaneously, displaying them side-by-side on the same timeline. • An editor panel should let me adjust individual timestamps, merge or split lines, and correct recognition errors before exporting. • Final output formats:.lrc and.ass at minimum.
Tech notes I’m open to TensorFlow, PyTorch, or a lightweight custom model—whichever you’re fastest with—as long as accuracy stays high and the code is clean enough for future language expansion.
If you’ve already tackled lyric alignment, subtitle generation, or multilingual speech-to-text, let’s talk details. A quick demo or proof-of-concept that shows simultaneous English, Spanish, and French sync will be the first milestone.
Skills
Want AI to find more roles like this?
Upload your CV once. Get matched to relevant assignments automatically.