Offer summary
Qualifications:
Expertise in multi-modal data processing, Proficiency in Python and PyTorch, Experience with distributed systems frameworks, Familiarity with cloud storage and databases.Key responsabilities:
- Lead ingestion and organization of large datasets
- Develop and optimize distributed data processing systems
- Build pipelines for synthetic data generation
- Conduct experiments on dataset quality