hero

PL Network

PL VS
198
companies
532
Jobs

DevOps Engineer/SRE - Infrastructure Team

ChainSafe Systems

ChainSafe Systems

Software Engineering, Other Engineering
Remote
Posted on Saturday, May 25, 2024

ChainSafe is a leading blockchain research and development firm specializing in infrastructure solutions for the decentralized web. Alongside its contributions to significant ecosystems such as Ethereum, Polkadot, Filecoin, and more, ChainSafe creates solutions for developers across the web3 space utilizing our expertise in gaming, bridging and decentralized storage. As part of the mission to build innovative products for users and better tooling for developers, ChainSafe embodies an open-source and community-oriented ethos.

At ChainSafe, you’ll be part of a global remote team that believes in the community's vital importance and contributes to advancing humanity with open-source and decentralized technology. To learn more about ChainSafe, look at our website or check out our work on GitHub.

About the role

As a DevOps Engineer SRE for the Infrastructure Team, you will play a vital role in defining and implementing best-practice strategies and guides to ensure the reliability, scalability, and performance of our infrastructure that supports the daily production activities across multiple blockchain ecosystems. This includes multiple cloud & bare metal service providers, based on our containerized stack across linux environments.

Your expertise will contribute to the sophistication of blockchain applications and redefine the boundaries of what's possible within this emerging technological sphere. All work across ChainSafe will be open-source, ensuring expansive opportunities for deep contribution and collaborative efforts across various web3 blockchains and ecosystems.

Responsibilities

What you will be doing

  • Oversee and enhance the health, performance, and security of environments, servers, and applications across the entire technology stack, including various blockchain services and full nodes.
    • Engage in managing various global environments, considering resources and latency to their observed regions
  • Be on-call, able to respond promptly outside of business hours
  • Implement automation efforts around builds, deployment, and automatic scaling
  • Work directly with the development and support teams to resolve issues
  • Design and implement procedures related to ChainSafe’s infrastructure operations
    • Execute deployments and network upgrade
    • Run and improve the incident response program
  • Provide training and guidance for other members of the infrastructure team, ensuring round-the-clock node operation and incident response.
  • Document and communicate technical details via open-source documentation
  • Collaborate with various internal teams and the wider community to build, expand, and scale ChainSafe’s infrastructure architecture, by tapping into new trends and opportunities highlighted by internal data, blockchain research, and the wider blockchain ecosystem

Requirements

  • Practical knowledge of at least one programming language (Go, TypeScript, Solidity, or Rust is a big plus)
  • Demonstrable experience with modern Infrastructure as Code (IaC) tools (Terraform, Helm, Ansible, etc), automating deployment, and best CI/CD practices and tools.
  • 3+ years of experience managing resources in either AWS, GCP, or Azure.
  • 3+ years of experience working with Linux.
  • 3+ years of experience with monitoring and alerting tools (DataDog, Grafana, Prometheus, etc.)
  • 3+ years of experience implementing distributed tracing, monitoring, and logging systems using OpenTelemetry Protocol
  • 3+ years of experience building and participating in incident response systems (PagerDuty, etc) and handling the emergency response to production environment failures.
  • Excellent communication skills with the ability to document and convey technical details clearly
  • Ability to work autonomously as well as with the wider team

As a plus:

  • Experience working in Web3 domain
  • Experience working with bare metal deployments
  • Experience automating network deployment
  • Understanding at least two of the following domains - Web Security, Web3 Security, Cloud Security, Systems Security, and Applied Cryptography.

Hiring Steps

  • Selected candidates will be invited to a 30–to–45–minute values interview with one or two of our team members
  • Technical 60-minute interview with one or two of our engineers.
  • Then, candidates will be asked to complete a homework assignment in under 3-4 hours.
  • Lastly, a 60-minute call with the hiring team to discuss the results and final interview.

Why Join ChainSafe

Founded by developers for developers, ChainSafe is a remote-first company with an international team. We continue to provide opportunities for personal and professional growth, value autonomy and responsibility, have a results-driven environment, and offer flexible work hours.

We care deeply about our values and look for these attributes in every new team member. In addition, we recognize the benefits of cultivating a diverse team and aspire to embed respect for all people into our culture. We encourage women, the LGBTQIA+ community, people of colour, and members of any other group underrepresented in the blockchain space (or tech in general) to apply.

How to Apply

Please fill out the Greenhouse application form below and ensure that you attach your resume and link your Github/Gitlab profile or any software project you have contributed to (if applicable).