Logo for Cloudera

Staff Software Engineer (Java Or Scala)

Roles & Responsibilities

  • Master's degree in Computer Science or a related field with 4–6 years of experience, or Bachelor's degree with more than 6 years of relevant industry experience, or 6–8 years of relevant industry experience.
  • Strong backend engineering skills with expertise in Java, Scala, or Kotlin.
  • Ability to read large codebases and write succinct, clean code.
  • Experience with system software design and development.

Requirements:

  • Build and maintain large-scale replication systems on top of the Cloudera Data Platform stack.
  • Be responsible for our products running in production.
  • Design cloud-based, low RPO, low RTO replication architectures and support replication across multiple Cloudera components such as HDFS, Ozone, Hive, HBase, Iceberg, Atlas, and Ranger.
  • Mentor junior engineers and collaborate with product management and field engineers on the product roadmap and early access feature introductions.

Job description

Business Area:

Engineering

Seniority Level:

Mid-Senior level

Job Description: 

At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry.  Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.

The Data Platform Pillar is the bedrock of Cloudera’s technology, where we design and build the core components that let our customers store, manage, and process data with unmatched scalability, security, and performance.

The Replication Manager team is seeking passionate developers to enhance replication support for the Cloudera Data Platform. The team’s mission is to provide a seamless experience for customers moving data and associated entities to support migration, replication, and disaster recovery.

Replication Manager enables customers to replicate data across data centers or between on-premises and cloud environments. This includes data in HDFS, Ozone, or cloud buckets; Hive, HBase, or Iceberg tables; Ranger permissions; and Atlas lineage. Datasets range from terabytes to petabytes, with challenges such as millions of directories, large file sizes, and near real-time HBase WAL replication.

As a Staff Software Engineer, you will

  • Build and maintain large-scale replication systems on top of the Cloudera Data Platform stack

  • Be responsible for our products running in production

  • Work with a distributed team of engineers to design cloud-based, low RPO, RTO replication architectures

  • Support replication across multiple Cloudera components like HDFS, Ozone,  Hive, HBase, Iceberg, Atlas, and Ranger

  • Give and take actionable feedback

  • Mentor junior engineers

  • Work with product management and occasionally, with field engineers on the product roadmap and early access feature introductions

We’re excited about you if you have:

  • Masters in Computer Science or related field and 4-6 years of experience - or Bachelors and more than 6 years of relevant industry experience - or 6-8 years of relevant industry experience

  • Strong backend engineering skill set with expertise in Java or Scala or Kotlin

  • Ability to read large codebases and write succinct, clean code

  • Experience with system software design and development

You may also have

  • Experience with large-scale, distributed systems design and development with an understanding of scaling, replication, consistency, and high availability

  • Understanding of computer architecture, storage, network, and IO subsystems

  • Current expertise with Java/Scala/Kotlin developer ecosystems

  • Experience with AWS, Azure, or GCP

  • Test automation experience along with Python basics

  • Background in performance tuning, identifying performance bottlenecks, and implementing performance optimizations

  • Systems/DevOps experience

Why this role matters: 

You will tackle complex distributed systems challenges, crafting the foundational software for the control and data planes that powers CDP and keeps it running at massive scale. Working at the forefront of hybrid and multi-cloud technology, you will empower data scientists, engineers, and analysts with the tools and infrastructure they need for advanced analytics and modeling.

Collaboration is key, you will work alongside brilliant minds across product, data science, and engineering to drive innovation, standardize best practices, and shape the future of enterprise AI and data platforms. This is your chance to build the future of data and see your work make a global impact.

This role is not eligible for immigration sponsorship.

What you can expect from us:

  • Generous PTO Policy 

  • Support work life balance with Unplugged Days

  • Flexible WFH Policy 

  • Mental & Physical Wellness programs 

  • Phone and Internet Reimbursement program 

  • Access to Continued Career Development 

  • Comprehensive Benefits and Competitive Packages 

  • Paid Volunteer Time

  • Employee Resource Groups

EEO/VEVRAA

#LI-VG1

#LI-REMOTE

Software Engineer Related jobs

Other jobs at Cloudera

We help you get seen. Not ignored.

We help you get seen faster — by the right people.

🚀

Auto-Apply

We apply for you — automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.