Linguist III
Onsite in Redmond or San Francisco Bay Area
Native Italian required
Summary:
We are looking for a Linguist III to help us develop language components for a variety of voice-enabled technologies and products. We are seeking candidates with native or near-native fluency in French and/or Italian with strong linguistic data analysis and language technology experience to manage data collection, data synthesis and data annotation tasks, translation-localization and ML model improvements.
Responsibilities:
- Provide linguistic expertise in the areas of syntax, semantics, pragmatics and sociolinguistics
- Collaborate with other linguists and data operations teams in data collection, data curation, translation, localization and annotation efforts
- Create annotation systems and guidelines
- Evaluate and curate data sets for ML models
- Assess model and data quality
- Collaboratively develop complex and consistent linguistic analyses
Required Qualifications:
- Master’s degree in general Linguistics or Linguistics with an emphasis on Romance languages, Computational Linguistics, Speech Science, or related field
- Native or near-native fluency in French and/or Italian
- Awareness of Italian and/or French linguistic, cultural, local norms
- Knowledge of syntax, semantics, pragmatics, sociolinguistics, corpus linguistics, and other areas of linguistics
- Experience working with speech and text data in multiple languages
- Familiar with Large Language Models (LLMs) and their applications
- Comfortable working in a fast paced, highly collaborative, dynamic work environment
- Strong organizational skills and detail oriented
- Excellent communication skills both verbal and written
Preferred Qualifications:
- PhD in Linguistics or Romance languages, language technologies, computational linguistics, speech science, or related field
- Experience with database queries and data analysis processes (i.e. SQL, spreadsheets, R, Unix, or others)
- intermediate or above skills in Python
- Experience with machine learning frameworks, NLP Libraries and Tools