Description
Who is Defined.ai? Well, from a technical point of view, we leverage the power of a global crowd to provide some of the world’s biggest companies with the high-quality data they need to power their artificial intelligence. We’re instrumental to the progression and development of artificial intelligence and we couldn’t be prouder or more inspired to be involved in an industry that is changing the world.
From a personal point of view, we’re a group of big thinkers, high achievers and creative problem solvers. We bond over our shared love of software engineering, data science, and strong coffee. We like online gaming, running marathons, and team drinks. We celebrate authenticity and diversity and we’re invested in what we do. Our mission? World domination, obviously!
About the role
At Defined.ai, we make machines smarter for Fortune 500 companies. Our focus is on improving Artificial Intelligence technologies through natural language processing – and we need linguists to train new models. We are looking for Senior Computational Linguists native in Italian, English (UK), English (Ireland) and English (Australia) to work in a project with Meta
Start & completion dates:
Start date is at the earliest and the initial contract is until January 2026.
Weekly scope:
40h per week.
General:
Linguists will help build out the NLU workflows and workstreams required by defining and delivering data annotation pipelines, annotation guidelines, golden datasets, training datasets, evaluation criteria, process improvements, upskilling programs, etc. You would be working on the following work streams:
- language modeling
- building test and evaluation training sets
- evaluation and analysis of product features
- rules engine, tweaking the engine for the model outputs
- triage
- multimodal features, chatbot experiences, packing the AI into glasses
Job description:
The role of the computational linguist is to help develop and improve our client’s NLP/NLU systems. Tasks may include but are not limited to:
- Annotating and reviewing linguistic data – part of speech annotation, semantic annotation, phonetic transcription
- Collect data and perform data analysis
- Labeling text for disambiguation, and (inverse) text normalization
- Evaluating current system outputs, detect incidental and systemic errors and provide solutions
- Translation and localization tasks
- Creating and evaluating training and test sets
- Prompt engineering
Minimum Requirements:
- Native speaker of the target language and fluent in English
- Experience in using, adapting and creating scripts in python
- Knowledge of relational databases and using, adapting and creating SQL queries
- Experience in annotation work
- Knowledge of semantics, syntax, morphology or lexicography
- Excellent oral and written communication skills
- Attention to detail and good organizational skills
- Be able to work independently with confidence and little oversight
Desired Skills:
- Degree in Linguistics or Computational Linguistics, or a degree computer science with a minor in linguistics
- Ability to quickly grasp technical concepts; learn in-house tools
About Us
Defined.ai offers a platform with multiple data delivery options that leverages machine learning technology and human intelligence to deliver quality-guaranteed training data for AI systems. The platform offers self-service and fully customizable solutions that deliver high-quality project-specific training data, enabling AI products reach market quicker. It is this business model that has allowed Defined.ai to raise a total of $63.6M in funding over 4 rounds. Our value proposition is quality, privacy, speed and scale, covering more than 50 different languages. With strong expertise in speech and natural language processing technologies, we have been serving AI companies and Fortune 500 companies since day one. Defined.ai was founded in Seattle and has an office in Lisbon.
Privacy Notice: https://defined.ai/dataset/privacy-notice-career