Top 3 must-have HARD skills:
1 Tech skills: SQL and Python
2 Regular Expression
Nice-to-have Skills:
1 Java
Summary:
We are looking for a Linguist III to help us develop language components for a variety of voice-enabled technologies and products. linguistic data analysis and language technology experience to manage data collection, data synthesis and data annotation tasks, translation-localization and ML model improvements.
Job Responsibilities
- Provide linguistic expertise in the areas of syntax, semantics, pragmatics and sociolinguistics
- Collaborate with other linguists and data operations teams in data collection, data curation, translation, localization and annotation efforts
- Create annotation systems and guidelines
- Evaluate and curate data sets for ML models
- Assess model and data quality
- Collaboratively develop complex and consistent linguistic analyses
Required Qualifications:
- Master’s degree in general Linguistics or Linguistics with an emphasis on Romance languages, Computational Linguistics, Speech Science, or related field.
- Knowledge of syntax, semantics, pragmatics, sociolinguistics, corpus linguistics, and other areas of linguistics
- Experience working with speech and text data in multiple languages
- Familiar with Large Language Models (LLMs) and their applications
- Comfortable working in a fast paced, highly collaborative, dynamic work environment
- Strong organizational skills and detail oriented
- Excellent communication skills both verbal and written
Preferred (additional) Qualifications:
- PhD in Linguistics or Romance languages, language technologies, computational linguistics, speech science, or related field
- Experience with database queries and data analysis processes (i.e. SQL, spreadsheets, R, Unix, or others)
- intermediate or above skills in Python
- Experience with machine learning frameworks, NLP Libraries and Tools