Match score not available

ML Data Linguist - Chinese (Mandarin), Bedrock

extra holidays - extra parental leave
Remote: 
Full Remote
Contract: 
Experience: 
Mid-level (2-5 years)
Work from: 
Massachusetts (USA), United States

Offer summary

Qualifications:

Bachelor's degree in Linguistics or related field, Native or Proficient in Mandarin, Strong communication skills in English, Experience with data annotation and markup, Proficiency in Python, Java, or scripting languages.

Key responsabilities:

  • Conduct training sessions and monitor performance
  • Annotate and perform QA on data
  • Collaborate with ML Data Linguists on data issues
  • Dive deep into data quality and trends
  • Develop language artifacts for model development
Amazon Web Services, Inc. logo
Amazon Web Services, Inc. XLarge https://aws.amazon.com/
10001 Employees
See more Amazon Web Services, Inc. offers

Job description

Description

Amazon Web Services (AWS) is looking for a data associate to help with annotations, data analysis and quality assurance. As part of the Ai Data Team at AWS you will responsible for delivering high-quality training data to ensure the best performance of the AWS machine learning systems.

This role will be responsible to lead training and provide continuous feedback for a vendor team.

Key job responsibilities

  • Conduct regular training sessions, monitor performance, provide feedback, and analyze calibration tests results to identify trends, gaps in knowledge, and areas of improvement of the BDB Team.
  • Build a thorough understanding of data collection and annotation guidelines and various annotation tools.
  • Annotate, generate and QA data, identifying linguistic categories based on detailed annotation and adhering to guidelines.
  • Perform annotation related tasks; you participate in data generation, collection and quality assurance tasks
  • Collaborate with other ML Data Linguists to resolve data ambiguities and annotation disagreements.
  • Dive deep into the data to perform qualitative error trend analysis, and devise action plan to improve data quality.
  • Provide feedback to Language Engineers and Scientists on tool improvements and annotation processes.
  • Diving deep into issues and implement solutions independently
  • Contribute to process improvements to reduce handling time and improve resource output.
  • Develop a variety of language artifacts crucial for model development such as datasets for training and evaluation.
  • Collaborate with LEs, scientists, and Ops Manager to innovate processes, tracker automations, and workflows.

Basic Qualifications

  • Bachelor's degree in Linguistics, Philosophy, Cognitive Science, a foreign language, or Literature.
  • Native or Proficient (C1, C2 level) in Mandarin.
  • Strong communication skills and comfortable leading group calls, training sessions and delivering feedback.
  • Proficiency in American English vocabulary, sentence structure and nuances and ability to assess naturalness in a wide range of contexts.
  • Ability to identifying linguistic ambiguity, and other inaccuracies in linguistic data, as well as identify basic parts of speech, and produce reports of analyzed data.
  • Experience with natural language data labeling, data annotation, linguistic annotation or other forms of data markup.
  • Teaching experience and/or experience leading a team of peers.
  • Knowledge of different domains such as Finance, Health Care, and/or Insurance.
  • Ability to generate innovative and diverse inputs to explore various aspects of an AI model's capabilities
  • Familiarity with json, yaml, xml or other forms of text markup.
  • Ability to navigate a Unix terminal and use common command line tools
  • Knowledge of Python, Java or any other scripting language.
  • Strong organizational and leadership skills and detail-oriented.
  • Comfortable working in a fast paced, collaborative work environment.
  • Be able to start at 8 am EST

Preferred Qualifications

  • Master's degree in a relevant field, such as Linguistics, Communications, a foreign language,- computational linguistics or other language or data-related disciplines is a plus.
  • Proficient in another foreign language.
  • Familiarity with common text processing tools.
  • Passion for language, linguistics, human language technology and AI.
  • Ability to work in different operating systems (Windows, MacOS, or Linux).
  • Strong understanding of NLP concepts and techniques

Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.


Company - Amazon Web Services, Inc.

Job ID: A2782012

Required profile

Experience

Level of experience: Mid-level (2-5 years)
Spoken language(s):
EnglishEnglish
Check out the description to know which languages are mandatory.

Other Skills

  • Detail Oriented
  • Problem Solving
  • Verbal Communication Skills
  • Organizational Skills
  • Financial Literacy
  • Leadership
  • Quality Assurance
  • Collaboration

Computational Linguist Related jobs