Note: The job is a remote job and is open to candidates in USA. iSoftStone, Inc. is a global IT service and consulting company seeking Data Analysts to join their team in Menlo Park, CA. The role focuses on managing data labeling workflows, maintaining data processing pipelines, and ensuring data governance while collaborating with engineers to enhance model performance through comprehensive data curation and evaluation solutions.
Responsibilities
- Data Curation: Manage data labeling workflows, including data enqueueing for labeling, UI for labeling, and extracting labels into datasets for the modeling team
- Data Engineering (Pipelines): Maintain large-scale, efficient, and reliable data processing pipelines (billions of images). This includes data sourcing, running machine learning models to understand content, and using LLMs to clean data
- Data Engineering (Governance): Maintain our portfolio of datasets, ensuring governance of access, retention, and privacy compliance
- Spend time manually annotating training data based on modeling team requirements
- Use of LLMs and other models to annotate training data or to evaluate generated content. Then apply auditing to understand these model performances
- Collaborate with engineers to identify and summarize model gaps based on evaluations. Utilize these findings to identify necessary data, and then mine and prepare that data for subsequent model training iterations
- Scale validated evaluation protocols with PDO teams, including coordination and auditing. Also, audit and correct human-labeled data
Skills
- Verbal and written communication skills, problem solving skills, and interpersonal skills
- Attention to details and an aptitude to experimental investigations
- Basic ability to work independently and manage one's time
- Basic knowledge of Python, and SQL
- Basic knowledge of computer vision and generative models
- Basic knowledge with data ETL workflows & pipelines
- Usage of LLM for data labeling related work
- Associate's degree or equivalent training required in Computer Science, Electronic Engineering, Physics, Bioinformatics, or other STEM subjects
- Be onsite, in Menlo Park, CA, working with engineers
- Prior industrial experience in software development and testing and / or research experience in human computer interaction
- Worked at Meta before
Benefits
- 1099 – No Benefits
- Medical, dental, vision, 401k
- Medical, dental, vision, 401k, holidays
Company Overview
- 19 Years of Dedicated Dance Education Click here for REGISTRATION info It was founded in undefined, and is headquartered in West Columbia, South Carolina, us, with a workforce of 11-50 employees. Its website is http://celestialstarsdance.com.