Linguist II
Location: Remote
Duration: 12 months contract with possibility of extension.
Start: Targeting September 2024
Benefits: Medical, dental, vision, sick time based on state laws
Summary
We are looking for a Linguist to help us develop language components for a variety of voice-enabled technologies and products. We are seeking candidates with native fluency in Mexican Spanish and with strong linguistic data analysis and language technology experience to manage data collection, data synthesis and data annotation tasks, localization and ML model improvements.
Job Responsibilities
Provide linguistic expertise in the areas of syntax, semantics, pragmatics and sociolinguistics
Collaborate with other linguists and data operations teams in data collection, data curation, localization and annotation efforts
Create annotation systems and guidelines
Evaluate and curate data sets for ML models
Use grammar-based methods and programmatic templates for generating annotated data
Design and conduct experiments for evaluating model and data quality
Collaboratively develop complex and consistent linguistic analyses
Basic Qualifications
Master’s degree in general Linguistics or Linguistics with an emphasis on Romance languages, Computational Linguistics, Speech Science, or related field
Two (2) or more years of experience in Linguistics, Language Technologies, or NLP
Native fluency in Italian
Knowledge of syntax, semantics, pragmatics, sociolinguistics, corpus linguistics, and other areas of linguistics
Experience with database queries and data analysis processes (i.e. SQL, spreadsheets, R, Unix, or others)
Experience with language annotation or other forms of data markup and tagging
Experience working with speech and text data in multiple languages
Excellent communication skills both verbal and written
Preferred Qualifications
PhD in Linguistics or Romance languages, language technologies, computational linguistics, speech science, or related field
Experience in building semantic ontologies and semantic relation frameworks
Experience with statistical language modeling
Comfortable working in a fast paced, highly collaborative, dynamic work environment
Strong organizational skills and detail oriented