Logo for Compass.uol

Data Science Specialist | Feature Store & ML Platform| Especialista (Remote)

Roles & Responsibilities

  • Proven expertise in feature engineering on corporate ML platforms (Feast, Tecton, Hopsworks or equivalents)
  • Advanced proficiency in Apache Spark / PySpark for large-scale distributed processing
  • Deep knowledge of Apache Iceberg and lakehouse architectures (comparison with Delta Lake and Hudi)
  • Expertise in Redis for low-latency feature serving, including cache invalidation strategies and efficient serialization

Requirements:

  • Lead the development and evolution of Feature Store capabilities: data lineage, feature views, feature recommendation, and new query engines
  • Design and implement Apache Iceberg tables focused on read performance, versioning, and schema evolution
  • Architect and optimize the serving layer with Redis for real-time features with strict latency SLOs
  • Integrate and optimize Amazon EMR as a query engine and processing at scale

Job description

JOB DESCRIPTION


.


RESPONSIBILITIES AND ASSIGNMENTS


  • Liderar o desenvolvimento e evolução das capacidades da Feature Store: data lineage, feature views, feature recommendation e novos motores de consulta;
  • Projetar e implementar tabelas Apache Iceberg com foco em performance de leitura, versionamento e evolução de schema;
  • Arquitetar e otimizar a camada de serving com Redis para features em tempo real com SLOs rigorosos de latência;
  • Integrar e otimizar Amazon EMR como motor de consulta e processamento em escala;
  • Definir e implementar pipelines de feature selection e transformação com rastreabilidade end-to-end;
  • Estabelecer padrões de qualidade, versionamento e governança de features para toda a plataforma;
  • Atuar como referência técnica para times de dados e ciência que consomem a Feature Store.



REQUIREMENTS AND QUALIFICATIONS


  • Expertise comprovada em engenharia de features em plataformas de ML corporativas (Feast, Tecton, Hopsworks ou equivalentes)
  • Domínio avançado de Apache Spark / PySpark para processamento distribuído em escala
  • Profundo conhecimento de Apache Iceberg e arquiteturas lakehouse (comparativo com Delta Lake e Hudi)
  • Expertise em Redis para feature serving em baixa latência, incluindo estratégias de cache invalidation e serialização eficiente
  • Experiência sólida com AWS e seus serviços de dados em produção (S3, Glue, EMR, Redshift, Athena)


  • Desejáveis:


  • Domínio de data lineage e catálogos de metadados (DataHub, OpenMetadata, Marquez) em produção;
  • Experiência com Amazon EMR: configuração, otimização de clusters e tuning de jobs Spark;
  • Expertise em práticas de MLOps com foco em versionamento e rastreabilidade de artefatos de dados;
  • Atuação anterior em contexto financeiro com dados de alta cardinalidade, alta frequência e requisitos regulatórios;
  • Conhecimento de ferramentas de qualidade de dados em escala (Great Expectations, Soda, dbt tests).



Become a Compasser, be part of AI/R.


Compass UOL is a global firm and part of the AI Revolution Company, together transforming organizations using Artificial Intelligence, Generative AI, and other of today’s most advanced technologies.


We equip our team with proprietary and external AI-driven tools to design and build digital-native platforms, integrating cutting-edge technologies and enabling companies to innovate, transform their businesses, and drive success in their markets.

To achieve this, we attract and develop the best talent, creating opportunities that enhance people’s lives and highlight the positive impact of disruptive technologies.

We empower borderless talent and promote knowledge and opportunities in the latest market trends, driving significant personal and professional growth.

Join us and be part of the AI-driven revolution.


Data Scientist Related jobs

Other jobs at Compass.uol

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.