Role: Linguist II – Brazilian Portuguese
Location: Onsite Sunnyvale, CA, New York, NY or Burlingame, CA
Job Term: 09 months contract with possibility of extension (only W2)
Our client is seeking a Linguist to help develop language components for various voice-enabled technologies and products. We are seeking candidates with native or near-native fluency in Brazilian Portuguese with strong linguistic data analysis and language technology experience to manage data collection, data synthesis and data annotation tasks, translation-localization, and ML model improvements.
Must-Have Skills
SQL and Python
Language skills: Brazilian Portugues
Job Responsibilities
• Provide linguistic expertise in the areas of syntax, semantics, pragmatics and sociolinguistics
• Collaborate with other linguists and data operations teams in data collection, data curation, translation, localization and annotation efforts
• Create annotation systems and guidelines
• Evaluate and curate data sets for ML models
• Assess model and data quality
• Collaboratively develop complex and consistent linguistic analyses
Required Qualifications
• Master’s degree in general Linguistics or Linguistics with an emphasis on Romance languages, Computational Linguistics, Speech Science, or a related field
• Native or near-native fluency in Brazilian Portuguese
• Awareness of Brazilian Portuguese linguistic, cultural, local norms
• Knowledge of syntax, semantics, pragmatics, sociolinguistics, corpus linguistics, and other areas of linguistics
• Experience working with speech and text data in multiple languages
• Familiar with Large Language Models (LLMs) and their applications
• Comfortable working in a fast paced, highly collaborative, dynamic work environment
• Strong organizational skills and detail oriented
• Excellent communication skills both verbal and written
Preferred (additional) Qualifications
• PhD in Linguistics or Romance languages, language technologies, computational linguistics, speech science, or related field
• Experience with database queries and data analysis processes (i.e. SQL, spreadsheets, R, Unix, or others)
• intermediate or above skills in Python
• Experience with machine learning frameworks, NLP Libraries and Tools