Logo for Embrace Software Inc

Platform Engineer (AWS)

Key Facts

Remote From: 
Full time
Senior (5-10 years)
English

Roles & Responsibilities

  • 5+ years of progressive DevOps/SRE experience in SaaS or enterprise environments
  • Infrastructure as Code using Terraform (AWS provider)
  • AWS core services: EKS, ECR, RDS, VPC, IAM, CloudWatch
  • Kubernetes administration and Docker containerization experience

Requirements:

  • Own and evolve the AWS infrastructure for the multi-tenant SaaS platform
  • Build and maintain delivery pipelines for rapid, reliable deployments
  • Design, operate, and optimize container orchestration platform
  • Build and maintain monitoring, alerting, and observability systems

Job description

Embrace Legal Group is a trusted legal technology solutions provider within the Embrace portfolio of companies.

The Opportunity

We are seeking a highly capable Platform Engineer to join our Platform Engineering team and help build, operate, and scale the infrastructure that powers Embrace’s legal technology platform.

In this role, you will own critical areas of our cloud infrastructure, container orchestration platform, CI/CD systems, observability stack, and operational reliability practices. You will work closely with product engineering, security, and data teams to create resilient, secure, observable, and cost-efficient systems that enable rapid software delivery while maintaining the reliability and compliance standards expected by enterprise clients.

This is a high-impact opportunity for an engineer who thrives at the intersection of cloud infrastructure, automation, developer enablement, and operational excellence in a regulated, multi-tenant SaaS environment.

Key Responsibilities

Cloud Infrastructure & Platform Architecture

  • Own and evolve the AWS infrastructure that underpins our multi-tenant SaaS platform.
  • Design, provision, and manage production-grade AWS services including EC2, S3, RDS, ECR, VPC, IAM, CloudFront, Route 53, and EKS/ECS clusters.
  • Implement and maintain Infrastructure as Code (IaC) using Terraform or CloudFormation to ensure repeatable, version-controlled, and auditable environments across development, staging, and production.
  • Architect and optimize PostgreSQL infrastructure including automated backups, replication, failover strategies, and performance tuning for high-throughput transactional workloads.
  • Drive high availability, disaster recovery planning, scalability, and cloud cost optimization initiatives across the platform.
  • Contribute to infrastructure standards, platform governance, and operational best practices.

CI/CD & Release Engineering

  • Build and maintain delivery pipelines that enable rapid, safe, and reliable deployments.
  • Design and operate CI/CD workflows for Python (Django/Flask/FastAPI) and React applications across multiple services.
  • Automate build, test, deployment, and rollback workflows using GitHub Actions, GitLab CI, Jenkins, or equivalent tooling.
  • Implement deployment strategies including blue-green, canary, and rolling deployments to reduce production risk.
  • Manage artifact repositories, container registries (ECR), and deployment manifests with full traceability and rollback support.
  • Improve developer workflows and deployment automation to increase engineering velocity and platform reliability.

Container Platform & Orchestration

  • Design, operate, and optimize our container orchestration platform for scalability, reliability, and tenant isolation.
  • Manage Docker-based development and production environments, including image hardening and registry governance.
  • Implement and maintain Kubernetes (EKS) or ECS infrastructure for scalable application deployments.
  • Define and maintain Helm charts, Kubernetes manifests, and environment-specific deployment configurations.
  • Enforce networking policies, namespace isolation, resource quotas, and workload security standards.
  • Support platform scalability, cluster health, autoscaling, and operational resilience.

Observability, Reliability & Security

  • Build and maintain monitoring, alerting, and observability systems using CloudWatch, Datadog, Prometheus, Grafana, or similar tooling.
  • Implement centralized logging and audit trail solutions across application and infrastructure layers.
  • Define operational standards for incident response, alerting, reliability, and system health monitoring.
  • Enforce infrastructure security best practices including secrets management, IAM least-privilege access, network segmentation, and certificate management.
  • Support compliance initiatives including SOC 2 and HIPAA through infrastructure controls, audit readiness, and vulnerability management.
  • Lead incident response, root cause analysis, and blameless postmortem reviews.

Cross-Functional Collaboration & Platform Enablement

  • Partner with engineering teams to improve deployment reliability, operational efficiency, and developer experience.
  • Troubleshoot infrastructure, deployment, networking, and performance issues across environments.
  • Author and maintain infrastructure documentation, architecture diagrams, operational runbooks, and deployment playbooks.
  • Mentor team members on platform engineering, infrastructure-as-code practices, operational excellence, and cloud-native tooling.
  • Contribute to long-term platform scalability, automation, and engineering enablement initiatives.



Requirements

Must-Have Skills

  • 5+ years of progressive DevOps/SRE experience in SaaS or enterprise environments.
  • Infrastructure as Code using Terraform (AWS provider, modules, multi-environment state management).
  • AWS core services: EKS, ECR, RDS, VPC, IAM, CloudWatch, ALB, EFS, S3, CloudFront, Route 53.
  • Kubernetes administration: Helm charts, pods, deployments, services, kubectl, autoscaling.
  • Docker containerization including multi-stage builds and registry operations.
  • CI/CD pipelines: AWS CodeBuild, GitHub Actions, GitLab CI, or Jenkins.
  • PostgreSQL production management: backup automation, replication, monitoring, performance tuning.
  • Linux systems administration (Ubuntu/Amazon Linux) and shell scripting proficiency.
  • Networking fundamentals: DNS, load balancing, TLS/SSL, firewall rules, VPN configurations.
  • Monitoring and observability: Datadog, FluentBit, CloudWatch Logs.
  • Security: AWS Secrets Manager, ACM certificates, security groups, IAM policies.
  • Application stack: Django, Celery, Redis, PostgreSQL, Nginx.
  • Git workflows, branching strategies, and pull request review processes.
  • Strong problem-solving skills with a proactive, ownership-driven approach.

Good-to-Have Skills

  • Advanced AWS services: AWS Backup, Lambda, SNS, EventBridge.
  • Advanced Kubernetes: EFS CSI driver, AWS Load Balancer Controller, Cluster Autoscaler.
  • Python scripting for infrastructure automation and operational workflows.
  • Multi-tenant SaaS architecture, tenant isolation strategies, and data partitioning.
  • Third-party service integration (SendGrid, Twilio) at the infrastructure level.
  • FinOps practices: cloud cost management, reserved/spot instance optimization.
  • Compliance frameworks (SOC 2 Type II, HIPAA) and required infrastructure controls.
  • Service mesh technologies (Istio, Linkerd) or API gateway solutions.
  • Cluster management tools like Rancher.
  • Database disaster recovery: snapshots, cloning, multi-region considerations.
  • Container security scanning and ClamAV integration.
  • Infrastructure documentation and multi-environment workflows (dev β†’ stg β†’ prod).
  • AWS certifications (Solutions Architect, DevOps Engineer Professional).


Benefits

  • Competitive salary commensurate with experience.
  • Opportunities for career advancement and professional development.
  • Experience collaborating with a diverse, global team within a remote work setting.


This is a remote position.

Platform Engineer Related jobs

Other jobs at Embrace Software Inc

We help you get seen. Not ignored.

We help you get seen faster β€” by the right people.

πŸš€

Auto-Apply

We apply for you β€” automatically and instantly.

Save time, skip forms, and stay on top of every opportunity. Because you can't get seen if you're not in the race.

✨

AI Match Feedback

Know your real match before you apply.

Get a detailed AI assessment of your profile against each job posting. Because getting seen starts with passing the filters.

Upgrade to Premium. Apply smarter and get noticed.

Upgrade to Premium

Join thousands of professionals who got noticed and hired faster.