DevOps Engineer

Fluence is the first decentralized “Cloudless” computing platform, providing an open alternative to the giant internet cloud monopolies. Fluence is up to 80% cheaper than cloud providers and is both resilient and verifiable. Fluence assembles excess compute capacity from top tier data centers around the world into a global, always-on DePIN network that is suitable for running a wide range of applications. The platform is open, allowing users to change providers easily, ensuring that prices stay low and service levels are high.

Who We’re Looking For:

We're looking for a DevOps Engineer/SRE to strengthen system stability, reliability, and scalability.

You think in terms of reliability, failure modes, SLOs, and error budgets, and know how to balance velocity and reliability in day-to-day engineering decisions.

You believe in Infrastructure as Code and automation, follow GitOps principles, love operators, and prefer building clean, reproducible systems on self-hosted, vendor-neutral building blocks rather than relying on convenient managed services.

You stay up to date with modern infra tech and practices and know when and how to adopt improvements to save time without compromising reliability.

What You'll Be Doing:

  • Help implement and evolve SRE practices to make systems more stable and predictable
  • Level up monitoring with metrics, alerts, and dashboards
  • Maintain CI/CD, improve infrastructure and tooling, make life of devs easier
  • Participate in cloud and platform R&D — work on storage, networking, VMs, GPU integration, and container runtimes
  • Act as a technical expert for the platform — helping internal teams, customers, and infrastructure providers resolve complex infrastructure and platform-level issues
  • Join the on-call rotation, handle incidents, and drive post-incident improvements

What You Bring:

  • 5+ years of production experience
  • SRE mindset (reliability engineering, error budgeting, postmortems, capacity planning)
  • Kubernetes expertise — experience designing, deploying, and managing production clusters (bare metal and cloud)
  • Good grasp of hardware and networking fundamentals
  • Terraform skills and an IaC mindset
  • Familiarity with GitOps principles and tools
  • Experience with the Prometheus ecosystem
  • Ability to own projects end-to-end and work effectively across teams, work independently, and deliver results quickly
  • Professional proficiency in English

What will make us extra happy:

  • Strong understanding of datacenter environments end-to-end: server hardware (NICs, disks, BIOS/firmware, RAID), networking fundamentals (L2/L3, VLANs, BGP, LACP), and how DC operations work in practice (provisioning, rack/stack, power/cooling, failure domains, capacity planning)
  • Experience running validators nodes (Etherium and Solana)
  • Expertise in running PostgreSQL and Kafka in production
  • Strong Linux internals knowledge with the ability to debug low-level performance issues, such as bottlenecks in the kernel or networking stack
  • Proficiency in Russian

What You'll Tackle First:

  • Help to improve release process with gates, SLOs, and error budgeting
  • Build self-service ephemeral developer environment

Future Challenges:

You'll be part of building our own cloud - working closely with hardware, advanced networking, and systems-level R&D.

You'll help expand our core platform by adding:

  • Advanced networking (NAT gateways, load-balancers)
  • GPU integration with VMs in Kubernetes
  • Secure container runtime

Tech Stack Highlights:

Talos, Terraform, FluxCD and ArgoCD, HashiCorp Vault, GitHub Actions, KubeVirt, VictoriaMetrics stack (metrics, logs, traces), Authentik, Kube-OVN, Linstor, PostgreSQL (CNPG), Kafka (Redpanda operator)

Why Join Us:

  • Impact: Your work will directly impact how our clients access and utilize compute resources, driving the next generation of cloud-native infrastructure.
  • Innovation: Work with the latest in container orchestration, virtualization, and multi-cloud technologies.
  • Growth: Be part of a rapidly growing company with plenty of opportunities to advance your career.
  • Culture: Join a collaborative, inclusive, and supportive work environment that values innovation and professional development.

Location: Remote

Commitment: Full-time

Benefits:

  • Competitive compensation and equity options
  • Remote-first, work from home or use a co-working space of your choice
  • Friendly startup environment: no bureaucracy or time tracking
  • We will provide you with anything you need to learn, grow, and be more productive